Talent.com
Staff Data Engineer
Staff Data EngineerLucasfilm • Nicasio, Californie, États-Unis
Staff Data Engineer

Staff Data Engineer

Lucasfilm • Nicasio, Californie, États-Unis
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Résumé du poste:

The Skywalker Sound Development Group is seeking an experienced Data Engineer to specialize in the creation, management, and optimization of data pipelines to support cutting-edge AI/ML research. This is a critical role in preparing high-quality datasets for the training, retraining, and evaluation of machine learning models tailored to immersive and multichannel audio applications.

As a Data Engineer, you will focus on developing robust pipelines for processing complex media datasets, enabling AI/ML researchers to build transformative solutions for speech processing, style transfer, and source separation. Your work will directly contribute to creating innovative soundtrack workflows for global media production.

What You'll Do

  • Design, implement, and maintain scalable, automated data pipelines for the ingestion, preprocessing, and transformation of large-scale audio datasets.

  • Ensure pipelines support efficient model training and retraining workflows, enabling continuous improvement of AI/ML models.

  • Collaborate with AI/ML researchers to define data requirements and integrate feedback to improve data pipeline functionality.

  • Develop advanced preprocessing techniques for immersive and multichannel audio formats (e.g., Dolby Atmos, high-order ambisonics).

  • Automate data cleaning, normalization, and augmentation processes to prepare datasets for various model architectures, including foundational models and transformers.

  • Integrate external datasets and APIs while ensuring compliance with legal and ethical data usage standards.

  • Monitor and optimize pipeline performance to handle complex and dynamic data structures effectively.

  • Create tools and workflows for annotating, labeling, and curating datasets, including the use of active learning methods.

  • Perform exploratory data analysis to uncover trends, validate dataset quality, and identify data gaps.

What We’re Looking For

  • Master’s Degree with preference for PhD in Data Engineering/Science, Computer Science, Signal Processing, or a related field.

  • 8+years of experience in data engineering or data science with a focus on building pipelines for AI/ML applications.

  • Proficiency in Python, with expertise in data manipulation libraries such as Pandas, NumPy, and PyTorch’s data utilities.

  • Hands-on experience with audio processing libraries and tools (e.g., Librosa, FFmpeg, SoX) for handling complex audio formats.

  • Familiarity with scalable pipeline tools like GitLab, Apache Spark, Airflow, or Luigi, and experience with containerized workflows (Docker, Kubernetes).

  • Strong understanding of data pipeline requirements for model training, retraining, and evaluation in iterative research workflows.

  • Experience with immersive and multichannel audio formats.

  • Knowledge of cloud-based platforms and tools for storage and processing, such as AWS S3, Redshift, or Google BigQuery.

  • Strong problem-solving skills, with a proactive mindset for addressing evolving data challenges.

Preferred Qualifications

  • Experience integrating data pipelines with AI/ML workflows, including active learning and model retraining.

  • Familiarity with audio-specific datasets and metadata management strategies.

  • Knowledge of machine learning principles and how data quality impacts model performance.

  • Experience with distributed training pipelines and large-scale dataset processing.

  • Contributions to open-source projects or published research in the fields of data science or audio processing.

  • Experience with visualization tools (e.g., Tableau, Matplotlib) for quality assurance and exploratory data analysis.

  • Expertise in designing systems to support AI/ML model monitoring and retraining over time.


The hiring range for this position in Nicasio, CA is $170,500 to $228,600 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.
[job_alerts.create_a_job]

Staff Data Engineer • Nicasio, Californie, États-Unis

[internal_linking.similar_jobs]
Staff Software Engineer - Remote

Staff Software Engineer - Remote

TradeJobsWorkForce • 92122 San Diego, CA, US
[filters.remote]
[job_card.full_time]
Staff Software Engineer Remote Job Duties: • Implement and evolve a Data Lake storage system with low latency and high throughput for bulk data ingestion and query • Implement metadata, data govern...[show_more]
[last_updated.last_updated_30] • [promoted]
Remote Data Entry & Survey Job – Earn Extra Income Online, Up to $25 Per Survey

Remote Data Entry & Survey Job – Earn Extra Income Online, Up to $25 Per Survey

Earn Haus • Carlsbad, CA, US
[filters.remote]
[job_card.full_time] +1
We are urgently looking for people interested in taking online surveys for Fortune 500 brands.If you are a self-starter, looking for flexible hours throughout the week, this may be for you! Earn up...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Area Supervisor

Area Supervisor

Ross Stores, Inc • La Jolla, CA, United States
[job_card.full_time]
Our values start with our people, join a team that values you! Bring your talents to Ross, our leading off-price retail chain with over 2,200 stores, and a strong track record of success and growth...[show_more]
[last_updated.last_updated_30] • [promoted]
Stationary Engineer

Stationary Engineer

SodexoMagic • OCEANSIDE, CA, US
[job_card.full_time]
GILEAD SCIENCES, OCEANSIDE - 21002101.Varying shifts, days/hours (open availability preferred).More details will be provided during the interview process.Working with SodexoMagic is more than a job...[show_more]
[last_updated.last_updated_variable_days]
ML Engineer, Insurance AI — Pipelines & Production

ML Engineer, Insurance AI — Pipelines & Production

Nearmap • Carlsbad, CA, United States
[job_card.full_time]
A leading technology company is seeking a Machine Learning Engineer to join their Insurance AI team in California.You will be responsible for building and maintaining ML infrastructure that support...[show_more]
[last_updated.last_updated_30] • [promoted]
Team Manager

Team Manager

Panera Bread • Coronado, CA, United States
[job_card.full_time]
Come Join Panera Bread- an award-winning leader in the restaurant industry and employer of choice for 2022 and 2023!.We are also proud to be named a Top Workplace for 2024!.A competitive hourly wag...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Engineer, AI Software - Onsite

Senior Engineer, AI Software - Onsite

Leica Biosystems • Vista, CA, United States
[job_card.full_time]
Senior Engineer, AI Software - Onsite.Are you ready to accelerate your potential and make a real difference within life sciences, diagnostics and biotechnology? At Leica Biosystems, one of Danaher’...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Senior Software Engineer

Senior Software Engineer

Leidos • Vista, CA, United States
[job_card.full_time]
The Senior Software Engineer will serve as an experienced technical contributor solving complex technical issues and driving innovation within the team.This individual will play a key role in leadi...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Team Lead

Team Lead

CAVA • La Jolla, CA, United States
[job_card.full_time]
At CAVA, we love what we do, and we try and make every day as fulfilling as the last.Our restaurants need Team Members to make the magic happen every day.Everyone matters and we're here to celebrat...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sofware Engineer

Sofware Engineer

TradeJobsWorkForce • 92132 San Diego, CA, US
[job_card.full_time]
Analyze, design and develop tests and test-automation suites.Design, create and develop a processing platform using various configuration management technologies.Test software development methodolo...[show_more]
[last_updated.last_updated_30] • [promoted]
Netflix Data Analyst

Netflix Data Analyst

TradeJobsWorkforce • 92161 San Diego, CA, US
[job_card.full_time]
Join our growing team as a Netflix Data Analyst to perform daily responsibilities with dedication.Provide excellent interactions with customers and colleagues.Provide excellent interactions with cu...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Software Engineer — AI-Driven C# Leader

Senior Software Engineer — AI-Driven C# Leader

Leidos Inc • Vista, CA, United States
[job_card.full_time]
A leading technology solutions provider in California is seeking a Senior Software Engineer to solve complex technical challenges and drive innovation.This role involves leading projects, mentoring...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Remote Senior SQL Engineer - AI Trainer

Remote Senior SQL Engineer - AI Trainer

SuperAnnotate • Carlsbad, California, US
[filters.remote]
[job_card.full_time]
As a Senior SQL Engineer, you will work remotely on an hourly paid basis to review AI-generated SQL queries, database designs, and data-processing logic, as well as generate high-quality reference ...[show_more]
[last_updated.last_updated_30]
Senior Software Engineer (Python / AI Agent)

Senior Software Engineer (Python / AI Agent)

XILO • Carlsbad, CA, United States
[job_card.full_time]
Senior Software Engineer (Python / AI Agent).Typescript/NestJS), Angular, AI Agents (LangChain/CrewAI).You will act as a Full Stack Developer with a heavy emphasis on Backend and AI logic.While Pyt...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Work from Home Data Entry Clerk

Work from Home Data Entry Clerk

GL Inc. • Encinitas, California
[filters.remote]
[job_card.full_time]
We’re looking for Data Entry Specialists for Customer Products across the US to work from home and help top brands improve their products before they hit the market.[show_more]
[last_updated.last_updated_30] • [promoted]
Data Scientist

Data Scientist

Sellers & Associates, LLC • Coronado, CA, United States
[job_card.full_time]
Sellers & Associates, LLC (S&A) is a Veteran Owned Small Business (VOSB) that provides effective and affordable Programmatic and Engineering Support Services and Solutions to our Government and Com...[show_more]
[last_updated.last_updated_30] • [promoted]
Data Scientist — AI/ML for National Defense Ops

Data Scientist — AI/ML for National Defense Ops

Sellers & Associates, LLC • Coronado, CA, United States
[job_card.full_time]
A Veteran Owned Small Business is seeking an experienced Data Scientist with AI/ML expertise to support Navy mission objectives.This role focuses on developing data-driven solutions through advance...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Python AI Agent Engineer — Hybrid

Senior Python AI Agent Engineer — Hybrid

XILO • Carlsbad, CA, United States
[job_card.full_time]
A tech-driven solutions provider is seeking a Senior Software Engineer to focus on backend services and AI logic in a hybrid role.The candidate should have over 5 years of experience, with deep exp...[show_more]
[last_updated.last_updated_variable_days] • [promoted]