Principal Kafka Support & Reliability Engineer
Purple DriveCanton, MA, Massachusetts, USACalibri,sans-serif">Role: Principal Kafka Support & Reliability Engineer ...Show more
Database engineer • brockton ma
Calibri,sans-serif">Role: Principal Kafka Support & Reliability Engineer ...Show more
About Mercor’s talent network** Join our Frontend Engineer Expert Network to connect with leading AI labs and companies seeking your expertise.This is an open application for future contract opport...Show more
Apex Systems, a World-Class Technology Solutions Provider, is seeking applicants for the below position on behalf of our client.Please apply if interested and qualified.Please note that only qualif...Show more
Our work spans from safeguarding and maintaining critical infrastructure to helping communities recover from natural disasters, from empowering our armed forces and first responders to reducing car...Show more
Position: Operations Engineer II.Plan, design and implement complex high-volume sortation processes, systems and facilities for capacity expansion and continuous improvement of North America Sort C...Show more
HOW HIRING: Geotechnical Engineer – Project Manager.We are partnering with a well-established and highly respected geotechnical specialty contractor to identify an experienced.This role offers the ...Show more
Seeking **active open-source contributors** with demonstrated experience working on well-known open-source repositories (e.Must-have:** Strong ability to author and manage **Pull Requests (PRs)**, ...Show more
OCES is growing ! We are expanding to 11 more towns covering the south shore area, including Quincy, Braintree, Weymouth, etc.APPLY NOW during this exciting time! .By promoting healthy, safe lives ...Show more
At Amphenol Alden Products, we’re passionate about making a difference.Every device we help create plays a part in improving - and even saving - lives around the world.It’s a responsibility we take...Show more
Carrier Global Corporation, global leader in intelligent climate and energy solutions, is committed to creating innovations that bring comfort, safety and sustainability to life.Through cutting-edg...Show more
We are seeking an experienced Buisness Office Manager to add to our team!!.Receive professional referrals from the electronic database and process the critical referral information.Electronic Medic...Show more
Combat engineers primarily supervise, serve, or assist as a member of a team when they are tackling rough terrain in combat situations.They provide their expertise in areas such as mobility, counte...Show more
Business Banking Lending Specialist.The Business Banking Lending Specialist will serve as a resource to the Business Banking Sales Group.The Business Banking Lending Specialist is responsible for a...Show more
Completes quality audits and inspections as assigned.Performs the initial tear-down, internal inspection and observational analysis to compile data in order to identify root cause and makes recomme...Show more
About Mercor’s talent network** Join our Backend Engineer Expert Network to connect with leading AI labs and companies seeking your expertise.This is an open application for future contract opportu...Show more
Heavy Civil Construction Engineer - Project Manager.Build the Infrastructure That Matters.An established and growing civil construction organization is seeking a Heavy Civil Construction Project Ma...Show more
The Vertex Companies, LLC (VERTEX) is a $180M global consulting firm that integrates strategic advisory, project management, and dispute resolution services for organizations facing complex challen...Show more
About Mercor’s talent network** **This is an open application for future contract opportunities that match your background and interests.Once you complete your profile and pass our AI interview, yo...Show more
About the Entry Level Engineer position.ControlPoint is looking for engineers that are ready to launch their career in the Power & Energy Industry as Entry Level Engineers.The position will include...Show more
The average salary range is between $ 110,000 and $ 177,645 year , with the average salary hovering around $ 131,215 year .
Role: Principal Kafka Support & Reliability Engineer
Location: Canton, MA
Role Descriptions: Tier 3 Incident Management Escalation SupportAct as the highest technical escalation point for Kafka production incidents Sev 1 Sev 2.Lead deep troubleshooting across 1. Broker instability| controller elections| ISR shrinkage2. Under replicated partitions and leader imbalance3. Producerconsumer failures| lag spikes| and rebalance stormsDisk| network| JVM| and request handler saturationProvide hands on remediation for complex issues| including Partition reassignment and leader rebalanceBroker configuration tuningThrottlequota strategies for noisy producers or consumersCoordinate with vendor support during service incidents| providing logs| metrics| and forensic details.Guide Tier 2 teams during major incidents and validate restoration actions.2. Kafka Performance Engineering OptimizationAnalyze Kafka workloads for performance and scalability risks Partition skew and hot partitionsInefficient producer batchingcompressionConsumer lag root cause analysisThread pool| IO| and network bottlenecksRecommend and validate Topic design (partition count| replication factor| retention| compaction)Producer and consumer configuration best practicesQuotas| quotas enforcement| and multi tenant controlsSupport onboarding of high throughput or latency sensitive workloads| ensuring Kafka is correctly sized and tuned.3. Platform Stability| Reliability ResilienceDiagnose and resolve systemic Kafka stability issues Repeated broker failures or flappingMetadatacontroller instability (Zookeeper or KRaft)Recovery issues following failovers or maintenance eventsSupport resilience initiatives Multi AZ cluster health validationReplication and DR strategies (MirrorMaker 2| Replicator| or app level DR patterns)Failover testing and validationDefine and improve Kafka SLOs for availability| durability| and latency.4. Change| Upgrade Configuration LeadershipLead medium to high risk Kafka changes| including Broker and cluster configuration changesPartition expansion or large scale reassignmentTopic policy changes impacting durability or performanceSupport and plan Kafka version upgradesMSK Confluent upgrade cyclesClient compatibility and rollout strategiesParticipate in CAB reviews| assess risk| and design rollback and validation plans.5. Root Cause Analysis Continuous ImprovementOwn RCA documentation for major incidents with clear corrective and preventive actions (CAPA).Identify recurring failure patterns and architectural gaps.Recommend platform-level improvements Automation opportunitiesGuardrails and standardsMonitoring and alerting enhancementsContribute to continuous improvement of runbooks| knowledge base articles| and operational playbooks.
Essential Skills: Role OverviewThe Kafka Tier 3 Support Engineer is a senior technical role responsible for expert level support| advanced troubleshooting| performance engineering| and platform stabilization of enterprise Apache Kafka environments. This role functions as the final technical escalation point for Kafka-related production incidents and is accountable for root cause analysis (RCA)| complex remediation| and long term prevention. The engineer works closely with Tier 2 operations| Platform Engineering| SRE teams| application teams| and vendor support (AWS MSK Confluent Cloud providers) to ensure Kafka remains a highly reliable| scalable| and secure streaming backbone.
Desirable Skills:
Keyword:
Skills: Digital : Kafka~Digital: Amazon Connect~Digital : Kubernetes Experience Required: 10 & Above