Talent.com
Databricks Data Engineer with DevOps
Databricks Data Engineer with DevOpsCloudious LLC • Los Angeles, CA, United States
[error_messages.no_longer_accepting]
Databricks Data Engineer with DevOps

Databricks Data Engineer with DevOps

Cloudious LLC • Los Angeles, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
  • [filters_job_card.quick_apply]
[job_card.job_description]

Job Title: Databricks Data Engineer with DevOps Skills

Location : Los Angeles CA (Hybrid)

Hire type : FTE / CTH

Salary : $130K

Job Summary

We are looking for an experienced Databricks Data Engineer with strong DevOps expertise to join our data engineering team. The ideal candidate will design, build, and optimize large-scale pipelines on the Databricks Lakehouse Platform on AWS, while driving automated CI/CD and deployment practices. This role requires strong skills in PySpark, SQL, AWS cloud services, and modern DevOps tooling. You will collaborate closely with cross-functional teams to deliver scalable, secure, and high-performance data solutions.

Must Demonstrate (Critical Skills & Architectural Competencies)

  • Designing and implementing Databricks-based Lakehouse architectures on AWS
  • Clear separation of compute vs. serving layers
  • Ability to design low-latency data/API access strategies (beyond Spark-only patterns)
  • Strong understanding of caching strategies for performance and cost optimization
  • Data partitioning, storage optimization, and file layout strategy
  • Ability to handle multi-terabyte structured or time-series datasets
  • Skill in requirement probing, identifying what matters architecturally
  • A player-coach mindset: hands-on engineering + technical leadership

Key Responsibilities

1. Data Pipeline Development

  • Design, build, and maintain scalable ETL/ELT pipelines using Databricks on AWS.
  • Develop high-performance data processing workflows using PySpark/Spark and SQL.
  • Integrate data from Amazon S3, relational databases, and semi/non structured sources.
  • Implement Delta Lake best practices including schema evolution, ACID, OPTIMIZE, ZORDER, partitioning, and file-size tuning.
  • Ensure architectures support high-volume, multi-terabyte workloads.

2. DevOps & CI/CD

  • Implement CI/CD pipelines for Databricks using Git, GitLab, GitHub Actions, or AWS-native tools.
  • Build and manage automated deployments using Databricks Asset Bundles.
  • Manage version control for notebooks, workflows, libraries, and environment configuration.
  • Automate cluster policies, job creation, environment provisioning, and configuration management.
  • Support infrastructure-as-code via Terraform (preferred) or CloudFormation.

3. Collaboration & Business Support

  • Work with data analysts and BI teams to prepare curated datasets for reporting and analytics.
  • Collaborate closely with product owners, engineering teams, and business partners to translate requirements into scalable implementations.
  • Document data flows, technical architecture, and DevOps/deployment workflows.

4. Performance & Optimization

  • Tune Spark clusters, workflows, and queries for cost efficiency and compute performance.
  • Monitor pipelines, troubleshoot failures, and maintain high reliability.
  • Implement logging, monitoring, and observability across workflows and jobs.
  • Apply caching strategies and workload optimization techniques to support low-latency consumption patterns.

5. Governance & Security

  • Implement and maintain data governance using Unity Catalog.
  • Enforce access controls, security policies, and data compliance requirements.
  • Ensure lineage, quality checks, and auditability across data flows.

Technical Skills

  • Strong hands-on experience with Databricks, including:
    • Delta Lake
    • Unity Catalog
    • Lakehouse Architecture
    • Delta Live Pipelines
    • Databricks Runtime
    • Table Triggers
    • Databricks Workflows
  • Proficiency in PySpark, Spark, and advanced SQL.
  • Expertise with AWS cloud services, including:
    • S3
    • IAM
    • Glue / Glue Catalog
    • Lambda
    • Kinesis (optional but beneficial)
    • Secrets Manager
  • Strong understanding of DevOps tools:
    • Git / GitLab
    • CI/CD pipelines
    • Databricks Asset Bundles
  • Familiarity with Terraform is a plus.
  • Experience with relational databases and data warehouse concepts.

Preferred Experience

  • Knowledge of streaming technologies like Structured Streaming/Spark Streaming.
  • Experience building real-time or near real-time pipelines.
  • Exposure to advanced Databricks runtime configurations and performance tuning.

Certifications (Optional)

  • Databricks Certified Data Engineer Associate / Professional
  • AWS Data Engineer or AWS Solutions Architect certification

[job_alerts.create_a_job]

Databricks Data Engineer with DevOps • Los Angeles, CA, United States

[internal_linking.similar_jobs]

Data Infrastructure Engineer

HeyGenLos Angeles, CA, United States
[job_card.full_time]

At HeyGen, our mission is to make visual storytelling accessible to all.Over the last decade, visual content has become the preferred method of information creation, consumption, and retention.But ...[internal_linking.show_more]

 • [job_card.promoted]

Vice President, Senior Data Engineer

Oaktree Capital Management, L.P.Los Angeles, CA, United States
[job_card.full_time]

Oaktree is a leader among global investment managers specializing in alternative investments, with more than $220 billion in assets under management.The firm emphasizes an opportunistic, value‑orie...[internal_linking.show_more]

 • [job_card.promoted]

Senior Data Engineer - Build Scalable Data Pipelines

EnigmaLos Angeles, CA, United States
[job_card.full_time]

A data intelligence firm in San Francisco is looking for a Senior Data Engineer to design and maintain their core small business data product.The role requires extensive experience in data systems,...[internal_linking.show_more]

 • [job_card.promoted]

Senior Data Engineer: 25-03739 (No C2C)

Akraya IncSanta Monica, California, United States
[job_card.temporary]
[filters_job_card.quick_apply]

Skills: SQL (Expert), Spark (Intermediate), Python(Proficient), Data Modeling (Expert), ETL Data Pipelining (Proficient).Duration: 7 Months Contract with possible extension.Location: Santa Monica, ...[internal_linking.show_more]

Senior DevOps, Platform Engineer

MoonGlendale, California, United States, 91203
[job_card.full_time]
[filters_job_card.quick_apply]

An ambitious and independent stealth SaaS company incubated by Home Organizers, a market leader with decades of proven success in designing and delivering exceptional, innovative home organization ...[internal_linking.show_more]

 • [job_card.new]

Data Engineer, Snowflake & HR Analytics (Remote Options)

DisneyBurbank, CA, United States
[filters.remote]
[job_card.full_time]

A leading entertainment company is seeking a data architect in Burbank to optimize Snowflake data solutions and integrate HR case management.Candidates should have 5years of experience in data mana...[internal_linking.show_more]

 • [job_card.promoted]

Lead Data Engineer, Real-Time On-Chain Trading

zora.coLos Angeles, CA, United States
[job_card.full_time]

A leading on-chain social network is seeking a Lead Data Engineer to build data infrastructure that powers their trading platform.The role involves designing scalable data pipelines, maintaining da...[internal_linking.show_more]

 • [job_card.promoted]

AWS Data Engineer – Qualtrics Integration

Veracity Software IncTorrance, CA, United States
[job_card.full_time]

AWS Data Engineer - Qualtrics Integration.Enterprise Survey Platform / Qualtrics.AWS | Python | SQL | Qualtrics API.The AWS Data Engineer - Qualtrics Integration is responsible for designing, build...[internal_linking.show_more]

 • [job_card.promoted]

Lead Data Engineer

zora.coLos Angeles, CA, United States
[job_card.full_time]

Zora is a new kind of on-chain social network where you can express yourself freely, connect with others, and discover the value of your imagination.We believe that by using crypto, we can enable a...[internal_linking.show_more]

 • [job_card.promoted]

Senior Data Engineer - Data Cloud & Ingestion Lead (Remote)

Sopra Steria USALos Angeles, CA, United States
[filters.remote]
[job_card.full_time]

Une entreprise majeure de conseil IT recherche un Data Engineer exp riment pour rejoindre son centre d'excellence.Vous travaillerez sur l'ingestion et la transformation des donn es, en collaborant ...[internal_linking.show_more]

 • [job_card.promoted]

Senior Platform Engineer – CloudOps & Data Integrations

Rainmaker Entertainment IncEl Segundo, CA, United States
[job_card.full_time]

A leading technology firm in California is seeking a Senior Engineer to develop an operational platform that connects cloud seeding operations to business and scientific analysis.The candidate will...[internal_linking.show_more]

 • [job_card.promoted]

Senior Data Platform Engineer (Remote)

LinktreeLos Angeles, CA, United States
[filters.remote]
[job_card.full_time]

A leading data platform company in San Francisco is looking for a Software Engineer to design and implement a robust data platform.Your role will directly impact the experiences of millions of user...[internal_linking.show_more]

 • [job_card.promoted]

Data Engineer

Skillerszone LLCLos Angeles, California, United States
[filters.remote]
[job_card.full_time]
[filters_job_card.quick_apply]

About Optiboostmedia is a leading provider of affiliate marketing and recruitment software solutions.We help businesses optimize their marketing and recruitment processes through innovative, user-f...[internal_linking.show_more]

Principal IT Software Engineer -- Remote Data & Cloud Lead

DIRECTVEl Segundo, CA, United States
[filters.remote]
[job_card.full_time]

A leading telecommunications company seeks a Principal, IT Software Engineer 2 to design and develop solutions for their customer profile operational data store.The ideal candidate has 7years of so...[internal_linking.show_more]

 • [job_card.promoted]

Cloud Engineer

Henderson ScottCulver City, CA, United States
[job_card.full_time]

Senior Cloud & Automation Engineer (AWS / Terraform / Ansible).We're working with a globally recognised organisation operating at serious scale, looking to bring in a.This isn't a pure architect or...[internal_linking.show_more]

 • [job_card.promoted]

Senior Software Engineer, Big Data

TixrSanta Monica, CA, United States
[job_card.full_time]

Tixr's on a mission to transform the ticket buying experience with a modern approach to a legacy business.Born from a fan-focused frame of mind, we empower large-scale events, music venues, and spo...[internal_linking.show_more]

 • [job_card.promoted]

Lead II - ML Engineering Data Science Engineer (Onsite)

Axelon Services CorporationLos Angeles, CA, US
[job_card.full_time]

Job Title: Lead II - ML Engineering Data Science Engineer (Onsite) Location: Woodland Hills, CA Pay rate: $53/hr Role Overview We are seeking a highly skilled Data Science Engineer to design and de...[internal_linking.show_more]

 • [job_card.promoted]

DevOps Engineer

DivIHN Integration IncKagel Canyon, CA, US
[job_card.permanent]

DivIHN (pronounced “divine”) is a CMMI ML3-certified Technology and Talent solutions firm.Driven by a unique Purpose, Culture, and Value Delivery Model, we enable meaningful connections between tal...[internal_linking.show_more]