Talent.com
Staff Data Engineer
Staff Data EngineerLucasfilm • Nicasio, Californie, États-Unis
Staff Data Engineer

Staff Data Engineer

Lucasfilm • Nicasio, Californie, États-Unis
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Résumé du poste:

The Skywalker Sound Development Group is seeking an experienced Data Engineer to specialize in the creation, management, and optimization of data pipelines to support cutting-edge AI/ML research. This is a critical role in preparing high-quality datasets for the training, retraining, and evaluation of machine learning models tailored to immersive and multichannel audio applications.

As a Data Engineer, you will focus on developing robust pipelines for processing complex media datasets, enabling AI/ML researchers to build transformative solutions for speech processing, style transfer, and source separation. Your work will directly contribute to creating innovative soundtrack workflows for global media production.

What You'll Do

  • Design, implement, and maintain scalable, automated data pipelines for the ingestion, preprocessing, and transformation of large-scale audio datasets.

  • Ensure pipelines support efficient model training and retraining workflows, enabling continuous improvement of AI/ML models.

  • Collaborate with AI/ML researchers to define data requirements and integrate feedback to improve data pipeline functionality.

  • Develop advanced preprocessing techniques for immersive and multichannel audio formats (e.g., Dolby Atmos, high-order ambisonics).

  • Automate data cleaning, normalization, and augmentation processes to prepare datasets for various model architectures, including foundational models and transformers.

  • Integrate external datasets and APIs while ensuring compliance with legal and ethical data usage standards.

  • Monitor and optimize pipeline performance to handle complex and dynamic data structures effectively.

  • Create tools and workflows for annotating, labeling, and curating datasets, including the use of active learning methods.

  • Perform exploratory data analysis to uncover trends, validate dataset quality, and identify data gaps.

What We’re Looking For

  • Master’s Degree with preference for PhD in Data Engineering/Science, Computer Science, Signal Processing, or a related field.

  • 8+years of experience in data engineering or data science with a focus on building pipelines for AI/ML applications.

  • Proficiency in Python, with expertise in data manipulation libraries such as Pandas, NumPy, and PyTorch’s data utilities.

  • Hands-on experience with audio processing libraries and tools (e.g., Librosa, FFmpeg, SoX) for handling complex audio formats.

  • Familiarity with scalable pipeline tools like GitLab, Apache Spark, Airflow, or Luigi, and experience with containerized workflows (Docker, Kubernetes).

  • Strong understanding of data pipeline requirements for model training, retraining, and evaluation in iterative research workflows.

  • Experience with immersive and multichannel audio formats.

  • Knowledge of cloud-based platforms and tools for storage and processing, such as AWS S3, Redshift, or Google BigQuery.

  • Strong problem-solving skills, with a proactive mindset for addressing evolving data challenges.

Preferred Qualifications

  • Experience integrating data pipelines with AI/ML workflows, including active learning and model retraining.

  • Familiarity with audio-specific datasets and metadata management strategies.

  • Knowledge of machine learning principles and how data quality impacts model performance.

  • Experience with distributed training pipelines and large-scale dataset processing.

  • Contributions to open-source projects or published research in the fields of data science or audio processing.

  • Experience with visualization tools (e.g., Tableau, Matplotlib) for quality assurance and exploratory data analysis.

  • Expertise in designing systems to support AI/ML model monitoring and retraining over time.


The hiring range for this position in Nicasio, CA is $170,500 to $228,600 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.
[job_alerts.create_a_job]

Staff Data Engineer • Nicasio, Californie, États-Unis

[internal_linking.similar_jobs]
Field Application Engineer

Field Application Engineer

AIRGAIN INC • San Diego, California, United States, 92130
[job_card.full_time]
[filters_job_card.quick_apply]
Airgain simplifies wireless connectivity across a diverse set of devices and markets, from solving complex connectivity issues to speeding time to market to enhancing wireless signals.Our products ...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Remote Data Entry & Survey Job – Earn Extra Income Online, Up to $25 Per Survey

Remote Data Entry & Survey Job – Earn Extra Income Online, Up to $25 Per Survey

Earn Haus • Encinitas, CA, US
[filters.remote]
[job_card.full_time] +1
We are urgently looking for people interested in taking online surveys for Fortune 500 brands.If you are a self-starter, looking for flexible hours throughout the week, this may be for you! Earn up...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
SENIOR & JUNIOR STAFF ACCOUNTANT

SENIOR & JUNIOR STAFF ACCOUNTANT

FRANK AKEF & CO • Preuss, CA, United States
[job_card.full_time]
We’re seeking dynamic individuals to fill Senior and Junior CPA, EA and / or accounting positions for daily tax activities.If you have a CPA, EA and /or accounting experience in public accounting f...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Area Supervisor

Area Supervisor

Ross Stores, Inc • La Jolla, CA, United States
[job_card.full_time]
Our values start with our people, join a team that values you! Bring your talents to Ross, our leading off-price retail chain with over 2,200 stores, and a strong track record of success and growth...[show_more]
[last_updated.last_updated_30] • [promoted]
Stationary Engineer

Stationary Engineer

SodexoMagic • OCEANSIDE, CA, US
[job_card.full_time]
GILEAD SCIENCES, OCEANSIDE - 21002101.Varying shifts, days/hours (open availability preferred).More details will be provided during the interview process.Working with SodexoMagic is more than a job...[show_more]
[last_updated.last_updated_variable_days]
Team Manager

Team Manager

Panera Bread • Coronado, CA, United States
[job_card.full_time]
Come Join Panera Bread- an award-winning leader in the restaurant industry and employer of choice for 2022 and 2023!.We are also proud to be named a Top Workplace for 2024!.A competitive hourly wag...[show_more]
[last_updated.last_updated_30] • [promoted]
Team Lead

Team Lead

CAVA • La Jolla, CA, United States
[job_card.full_time]
At CAVA, we love what we do, and we try and make every day as fulfilling as the last.Our restaurants need Team Members to make the magic happen every day.Everyone matters and we're here to celebrat...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
ML Engineer – Insurance AI Pipelines & Production

ML Engineer – Insurance AI Pipelines & Production

Betterview • Carlsbad, CA, United States
[job_card.full_time]
A technology-driven SaaS company is looking for a Machine Learning Engineer to be part of their Insurance AI team.The role involves building and maintaining ML infrastructure, collaborating with Da...[show_more]
[last_updated.last_updated_variable_hours] • [promoted] • [new]
AI Data Scientist

AI Data Scientist

Vets Hired • Camp Pendleton, California, United States
[job_card.full_time]
[filters_job_card.quick_apply]
This role focuses on data integration, advanced analytics, and the development and deployment of secure AI/ML solutions in both real-world and theoretical environments.The position supports decisio...[show_more]
[last_updated.last_updated_30]
Remote Senior SQL Engineer - AI Trainer

Remote Senior SQL Engineer - AI Trainer

SuperAnnotate • Coronado, California, US
[filters.remote]
[job_card.full_time]
As a Senior SQL Engineer, you will work remotely on an hourly paid basis to review AI-generated SQL queries, database designs, and data-processing logic, as well as generate high-quality reference ...[show_more]
[last_updated.last_updated_30]
Staff Software Engineer - Remote

Staff Software Engineer - Remote

TradeJobsWorkForce • 92109 San Diego, CA, US
[filters.remote]
[job_card.full_time]
Staff Software Engineer Remote Job Duties: • Implement and evolve a Data Lake storage system with low latency and high throughput for bulk data ingestion and query • Implement metadata, data govern...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Software Engineer (Python / AI Agent)

Senior Software Engineer (Python / AI Agent)

XILO • Carlsbad, CA, United States
[job_card.full_time]
Senior Software Engineer (Python / AI Agent).Typescript/NestJS), Angular, AI Agents (LangChain/CrewAI).You will act as a Full Stack Developer with a heavy emphasis on Backend and AI logic.While Pyt...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Sofware Engineer

Sofware Engineer

TradeJobsWorkForce • 92166 San Diego, CA, US
[job_card.full_time]
Analyze, design and develop tests and test-automation suites.Design, create and develop a processing platform using various configuration management technologies.Test software development methodolo...[show_more]
[last_updated.last_updated_30] • [promoted]
Senior Python AI Agent Engineer — Hybrid

Senior Python AI Agent Engineer — Hybrid

XILO • Carlsbad, CA, United States
[job_card.full_time]
A tech-driven solutions provider is seeking a Senior Software Engineer to focus on backend services and AI logic in a hybrid role.The candidate should have over 5 years of experience, with deep exp...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Online Survey Participant: Work Remote and Earn Up To $25 Per Survey

Online Survey Participant: Work Remote and Earn Up To $25 Per Survey

Earn Haus • Coronado, CA, US
[filters.remote]
[job_card.full_time] +1
Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se...[show_more]
[last_updated.last_updated_30] • [promoted]
Survey Taker: Earn up to $25 per survey (Remote)

Survey Taker: Earn up to $25 per survey (Remote)

Earn Haus • Coronado, CA, US
[filters.remote]
[job_card.full_time] +1
Looking for people to participate in taking online surveys for Fortune 500 brands.All you need to do is complete online surveys by sharing your opinion.You will help influence brand decisions on se...[show_more]
[last_updated.last_updated_30] • [promoted]
Engineering Manager - ML Delivery & DevOps

Engineering Manager - ML Delivery & DevOps

SDV International • Coronado, California, United States
[job_card.full_time]
Salary: $195,000 - 195,000 per year.Active DoD Secret clearance (must be current and maintained throughout the contract).Onsite presence in Coronado, CA (this role is not remote).Over 10 years of p...[show_more]
[last_updated.last_updated_variable_days] • [promoted]
Netflix Data Analyst

Netflix Data Analyst

TradeJobsWorkforce • 92132 San Diego, CA, US
[job_card.full_time]
Make an impact in the role of Netflix Data Analyst to analyze viewer data to guide content decisions.Ensure all safety and quality standards are met.Work closely with your team to maintain high per...[show_more]
[last_updated.last_updated_30] • [promoted]