Talent.com
Software Engineer - Distributed Data Systems San Francisco, California
Software Engineer - Distributed Data Systems San Francisco, CaliforniaDatabricks Inc. • San Francisco, CA, United States
Software Engineer - Distributed Data Systems San Francisco, California

Software Engineer - Distributed Data Systems San Francisco, California

Databricks Inc. • San Francisco, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business. Founded by engineers — and customer obsessed — we leap at every opportunity to solve technical challenges, from designing next-gen UI / UX for interfacing with data to scaling our services and infrastructure across millions of virtual machines. And we're only getting started.

Modern data analysis employs sophisticated methods such as machine learning that go well beyond the roll-up and drill-down capabilities of traditional SQL query engines. As a software engineer on the Runtime team at Databricks, you will be building the next generation distributed data storage and processing systems that can outperform specialized SQL query engines in relational query performance, yet provide the expressiveness and programming abstractions to support diverse workloads ranging from ETL to data science.

Below are some example projects :

Apache Spark : Develop the de facto open source standard framework for big data.

Data Plane Storage : Provide reliable and high performance services and client libraries for storing and accessing humongous amount of data on cloud storage backends, e.g., AWS S3, Azure Blob Store.

Delta Lake : A storage management system that combines the scale and cost-efficiency of data lakes, the performance and reliability of a data warehouse, and the low latency of streaming. Its higher level abstractions and guarantees, including ACID transactions and time travel, drastically simplify the complexity of real-world data engineering architecture.

Delta Pipelines : It's difficult to manage even a single data engineering pipeline. The goal of the Delta Pipelines project is to make it simple and possible to orchestrate and operate tens of thousands of data pipelines. It provides a higher level abstraction for expressing data pipelines and enables customers to deploy, test & upgrade pipelines and eliminate operational burdens for managing and building high quality data pipelines.

Performance Engineering : Build the next generation query optimizer and execution engine that's fast, tuning free, scalable, and robust.

What we look for :

  • BS (or higher) in Computer Science, related technical field or equivalent practical experience.
  • Comfortable working towards a multi-year vision with incremental deliverables.
  • Motivated by delivering customer value and impact.
  • 5+ years of production level experience in either Java, Scala or C++.
  • Strong foundation in algorithms and data structures and their real-world use cases.
  • Experience with distributed systems, databases, and big data systems (Apache Spark, Hadoop).

Pay Range Transparency

Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles. Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here .

Local Pay Range

$166,000 — $225,000 USD

About Databricks

Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark, Delta Lake and MLflow. To learn more, follow Databricks on Twitter , and Facebook .

Benefits

At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please .

Our Commitment to Diversity and Inclusion

At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.

Compliance

If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

#J-18808-Ljbffr

[job_alerts.create_a_job]

Software Engineer • San Francisco, CA, United States

[internal_linking.related_jobs]
Software Engineer, Distributed Data Systems (Sora)

Software Engineer, Distributed Data Systems (Sora)

OpenAI • San Francisco, CA, United States
[job_card.full_time]
Software Engineer, Distributed Data Systems (Sora).The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Distributed Systems Software Engineer - Public Cloud (Mid / Senior / Lead / Principal)

Distributed Systems Software Engineer - Public Cloud (Mid / Senior / Lead / Principal)

Salesforce • San Francisco, CA, United States
[job_card.full_time]
Distributed Systems Software Engineer - Public Cloud (Mid / Senior / Lead / Principal).To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure y...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Distributed Systems Engineer - AI Data Platform

Senior Distributed Systems Engineer - AI Data Platform

Alluxio, Inc. • Foster City, CA, United States
[job_card.full_time]
A data orchestration company in California is seeking a Senior Software Engineer to advance their data layer for modern AI and analytics. You will work on optimizing distributed systems and enhancin...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Distributed Systems Software Engineer - Public Cloud (Mid / Senior / Lead / Principal)

Distributed Systems Software Engineer - Public Cloud (Mid / Senior / Lead / Principal)

salesforce.com, inc. • San Francisco, CA, United States
[job_card.full_time]
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Salesforce is the #1 AI CRM, where humans with age...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Engineer (San Francisco)

Data Engineer (San Francisco)

Midjourney • San Francisco, CA, US
[job_card.part_time]
Midjourney is a research lab exploring new mediums to expand the imaginative powers of the human species.We are a small, self-funded team focused on design, human infrastructure, and AI.We have no ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Software Engineer, Data Infrastructure San Francisco

Software Engineer, Data Infrastructure San Francisco

Persona • San Francisco, CA, United States
[job_card.full_time]
Persona is the configurable identity platform built for businesses in a digital-first world.Verifying individuals and organizations is harder — but more important — than ever, with AI enabling frau...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Principal Software Engineer (San Francisco)

Principal Software Engineer (San Francisco)

Xcede • San Francisco, CA, US
[job_card.part_time]
A SF Bay Area based cutting-edge startup, which is leading the charge in transforming observability for today's data-driven world is looking for seasoned software engineer to join them as a princip...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Software Engineer, Data Platform San Francisco

Senior Software Engineer, Data Platform San Francisco

Highnote Health Inc. • San Francisco, CA, United States
[job_card.full_time]
Founded in 2020 by a team of leaders from Braintree, PayPal, and Lending Club, Highnote is an embedded finance company that sets the standard in modern card platform management.As an all-in-one car...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Lead Data Engineer (San Francisco)

Lead Data Engineer (San Francisco)

Mentor Talent Acquisition • San Francisco, CA, United States
[job_card.full_time]
Were looking for a Lead Data Engineer to spearhead the design, implementation, and iteration of a world-class, modern data infrastructure that powers analytics, data science, and ML / AI systems.You ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Platform Engineer : Scale & Data Systems

Data Platform Engineer : Scale & Data Systems

Monograph • San Francisco, CA, United States
[job_card.full_time]
A technology company in San Francisco is looking for an experienced engineer to own and develop data systems and infrastructure. The successful candidate will work collaboratively across teams to de...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Engineer - Scientific Data Ingestion (San Francisco)

Data Engineer - Scientific Data Ingestion (San Francisco)

Mithrl • San Francisco, CA, US
[job_card.part_time]
We envision a world where novel drugs and therapies reach patients in months, not years, accelerating breakthroughs that save lives. Mithrl is building the worlds first commercially available AI Co-...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Solutions Engineer - US (San Francisco)

Solutions Engineer - US (San Francisco)

Encord • San Francisco, CA, United States
[job_card.full_time]
Solutions Engineer - US (San Francisco).Solutions Engineer - US (San Francisco).Get AI-powered advice on this job and more exclusive features. At Encord, we're building the AI infrastructure of the ...[show_more]
[last_updated.last_updated_30] • [promoted]
Software Engineer, Data Platform San Francisco; Hybrid

Software Engineer, Data Platform San Francisco; Hybrid

Superhuman Labs, Inc. • San Francisco, CA, United States
[job_card.full_time]
Superhuman offers a dynamic hybrid working model for this role.This flexible approach gives team members the best of both worlds : plenty of focus time along with in-person collaboration that helps ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Engineer (San Francisco)

Data Engineer (San Francisco)

Odiin. • San Francisco, CA, United States
[job_card.full_time]
Youll work closely with engineering, analytics, and product teams to ensure data is accurate, accessible, and efficiently processed across the organization. Design, develop, and maintain scalable da...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Engineer (San Francisco)

Data Engineer (San Francisco)

Fluency • San Francisco, CA, United States
[job_card.full_time]
Fluency is enabling the autonomous Enterprise.You're needed to help pioneer a new software category that will change how enterprises work. Welcome to the data layer of the future.Fluency is looking ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Distributed Systems Engineer / AI Workloads (San Francisco Bay Area)

Distributed Systems Engineer / AI Workloads (San Francisco Bay Area)

The Crypto Recruiters • San Francisco Bay Area, CA, United States
[job_card.permanent]
We are actively searching for a Distributed Systems Engineer to join our team on a permanent basis.In this founding engineer role you will focus on building next-generation data infrastructure for ...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Data Software Engineer

Data Software Engineer

micro1 • San Francisco, CA, United States
[job_card.full_time]
Alaris Security is building the core technology stack for intelligent cyber security.Designed for enterprise and defense organizations, our platform unifies security data.Together these technologie...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Software Engineer (Data & AI) San Francisco, CA

Software Engineer (Data & AI) San Francisco, CA

Suptask • San Francisco, CA, United States
[job_card.full_time]
Design, implement, and optimize data processing algorithms and AI models that enable intelligent decision-making and predictive capabilities. Collaborate with cross-functional teams to integrate, cl...[show_more]
[last_updated.last_updated_30] • [promoted]