Cloudera Data Engineer
We have an exciting opening for a full-time Cloudera Data Engineer
to join our innovative Software Development team in Madison, Wisconsin!
Join a team recognized as one of Madison Magazine's Best Places to Work, where innovation thrives, collaboration drives success, and your work makes a real-world impact—because at Yahara, we don't just build data pipelines, we empower people and transform industries.
Important Notes about this Position :
- This position offers remote work flexibility but is only open to candidates who reside in or are willing to relocate to the greater Madison, WI area.
- We are unable to provide sponsorship at this time.
Summary
The Cloudera Data Engineer designs and maintains enterprise-scale data pipelines using the Cloudera Data Platform (CDP) and related big data technologies. The role focuses on building scalable ETL / ELT workflows, optimizing distributed compute, and enabling secure, high‑performance data services across multiple domains. Work is highly collaborative within cross‑functional Agile teams.
Our Approach
We build data solutions grounded in strong engineering fundamentals—reliable architecture, quality controls, and scalable design. We use modern cloud platforms, integrating analytics and ML where valuable while prioritizing data integrity and governance.
What You'll Do
Design and maintain enterprise-scale pipelines using CDP and big data tooling.Build scalable ETL / ELT workflows for structured and unstructured data.Develop distributed processing jobs using big data framework components.Design data storage solutions balancing performance and cost.Collaborate with analysts, scientists, and developers to deliver data solutions.Develop technical documentation for pipelines and architectures.What You'll Bring
Experience & Education :
5–7 years in data engineering with big data or distributed systems.Experience with CDP, CDH, or similar enterprise big data platforms.Degree in CS, Data Science, Information Systems, or equivalent experience.Strong background in distributed data processing.Ability to obtain and maintain Public Trust clearance.Mindset & Approach :
Self‑starter with a passion for data engineering.Strong analytical and problem‑solving skills.Enthusiastic about big data technologies and performance optimization.Detail‑oriented with a commitment to accuracy and reliability.Ability to translate business requirements into effective solutions.Collaborative, able to recognize blockers and leverage team strengths.Technical Background :
Experience with Agile development environments.Proven experience designing and implementing production pipelines.Experience in biohealth, laboratory, or scientific data environments is a plus.Familiarity with HIPAA, FDA, or GxP preferred but not required.Specific Technical Qualifications
Cloudera ecosystem experience : CDP, HDFS, Hive / Impala, Spark.Programming : Python, Scala, or Java.Advanced SQL and distributed compute (Spark, MapReduce).Shell scripting and version control (Git).Data storage formats : Parquet, Avro, ORC.Workflow orchestration and scheduling.Cloud experience (Azure, AWS, or GCP) and understanding of hybrid patterns.Company Benefits & Perks
20+ days of PTO accruable in the first year!Comprehensive health insurance (Medical, Dental, Vision) with HMO and PPO optionsHealth Savings Account (HSA) with annual employer contributions401(k) with guaranteed company match (Traditional and Roth options)100% company-paid short-term and long-term disability100% company-paid life insurance with option to increase coverage100% company-paid identity theft protectionOn-site gym with basketball courtHybrid / remote schedule with home office stipendFresh fruit, healthy snacks, and beverages provided dailyBonus certification program (Microsoft, AWS, PMP, IIBA, etc.)Employee Assistance Program (counseling, legal, financial services)Monthly and Quarterly Recognition Awards with spot bonusesCompany-supported community outreach and volunteer opportunitiesEmployee-run committee involvement opportunitiesCollaborative culture founded on realized values and incredible peopleIf you need an accommodation as part of the employment process, please contact Human Resources via email at hradmin@yaharasoftware.com
Yahara Software LLC is an Equal Employment Opportunity / Affirmative Action Employer.
This is a full-time, salaried position with competitive salary and benefits. Candidates must be eligible to work in the U.S. on a permanent basis and can work on-site in our office located in Madison, Wisconsin.