Optimization Specialist

Job Type: Contract
Job Location: United States
Salary: Up to $140/hr
Company Name: Robert Half

About the job

Apache Zookeeper Optimization Consultant Role:

We are seeking an experienced Apache Zookeeper Optimization Consultant to enhance the resiliency and performance of our distributed systems infrastructure. The ideal candidate will possess deep expertise in Zookeeper configuration, tuning, and troubleshooting, with a strong understanding of distributed systems, high-availability requirements, and related technologies such as RabbitMQ, Redis, and Kafka.

 

Key Responsibilities:

 

Performance Optimization:

● Analyze the current Zookeeper setup and identify bottlenecks affecting performance.

● Implement tuning measures for read/write latency, throughput, and leader election times.

● Optimize JVM parameters and Zookeeper settings (e.g., tick time, heap size).

 

Resiliency Enhancement:

● Architect solutions for fault tolerance and disaster recovery.

● Design and implement multi-region and multi-data center deployments.

● Establish robust configurations for quorum consistency and failover mechanisms.

 

Monitoring and Alerting:

● Review monitoring tools (e.g., Prometheus, Grafana) to track Zookeeper health for resiliency.

● Develop custom alerts for potential issues such as latency spikes, memory usage, and connection limits.

 

Collaboration:

● Work closely with engineering teams to ensure Zookeeper is optimized and resilient alongside other components like Kafka, RabbitMQ, Redis, and custom services.

● Conduct capacity planning to ensure scalability for future workloads.

 

Experience:

● 10+ years of hands-on experience managing and optimizing Apache Zookeeper in production environments at large scale.

● Proven track record of designing resilient distributed systems.

● Experience with RabbitMQ, Redis, and Kafka in distributed architectures.

 

Technical Expertise:

● Deep understanding of distributed systems, including Zookeeper internals (leader election, session management, quorum design).

● Expertise in associated technologies like RabbitMQ, Redis, and Kafka, with an understanding of their integration into distributed environments.

● Proficiency in monitoring and troubleshooting tools such as Prometheus, Grafana, or similar.

 

Skills:

● Strong scripting skills (e.g., Bash, Python) for automation.

● Excellent problem-solving and communication abilities.

 

Certifications (optional):

● Relevant certifications in distributed systems, messaging technologies, or DevOps practices are a plus


APPLY

Apply for this position

Allowed Type(s): .pdf, .doc, .docx