Talent.com
TikTok
Large Model Training Acceleration EngineerTikTok • San Jose
Large Model Training Acceleration Engineer

Large Model Training Acceleration Engineer

TikTok • San Jose
30+ days ago
Job type
  • Full-time
Job description

Team Introduction

The Intelligent Creation - AI Platform team is a team focusing on building advanced end-to-end AI production pipelines, including deep learning model training, optimization, deployment and applications. We provide AI capabilities to empower content creation and consumption on TikTok and serve billions of users. We are seeking an experienced AI model optimization engineer with expertise in optimizing AI model training and inference, including distributed training/inference and acceleration. The ideal candidate will work at the cutting edge of AI efficiency, enhancing the performance, scalability, and deployment of large-scale generative AI models. Responsibilities - Optimize large model training pipelines to improve efficiency, speed, and scalability. - Develop and improve distributed training strategies such as data parallelism, model parallelism, pipeline parallelism and communication to accelerate model training. - Benchmark and profile deep learning models to identify performance bottlenecks and optimize computational resources.

Minimum Qualifications: - Master’s or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or a related field. - 5+ years of experience in AI model training optimization. - Strong software engineering skills, including proficiency in Python, C++, and CUDA. - Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed. - Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism. - Knowledge of transformers and diffusion models.
Create a job alert for this search

Large Model Training Acceleration Engineer • San Jose

Similar jobs

Model Serving Engineer

Bright Vision TechnologiesFremont, CA, US
$100,000.00 yearly
Full-time
Quick Apply

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations.We leverage cutt... Show more

Forward Deployment Engineer (Inference & RL POC)

Glint Tech Solutions LLCMountain View, California, United States
Full-time
Quick Apply

Bay area (frequent customer interaction).Inference & Reinforcement Learning Platform.Forward Deployment Engineer (FDE).You'll be embedded with customers during early-stage deployments—turning r... Show more

Senior Analog & Mixed-Signal Electronics Engineer

Zealogics.comSan Jose CA, CA, US
Full-time
Quick Apply

Duties & Responsibilities • Lead the design, analysis, and implementation of high-performance analog and mixed-signal circuits, including:.High-speed (>50 MSPS), high-precision (&#8805... Show more

Machine Learning Engineer (Agentic AI Platform)

Barker Staffing Solutions LLCMountain View, California, United States
Full-time
Quick Apply

We're building the next generation of.AI agents live, learn, and operate.This is a high-impact role for a.AI infrastructure from the ground up.You'll work at the intersection of.LLMs, distributed s... Show more

Reliability Engineer

3i Infotech Ltd.Milpitas, CA, United States
Full-time

We do have Immediate Job opening.Job Title: Reliability Engineer.Worker Location: USA-CA-Milpitas- Onsite.Create, edit and validate Reliability Block Models of complex equipment, using Reliasoft Ap... Show more

 • Promoted

Hardware Design Engineer - Santa Clara, CA

Sunrise SystemsSanta Clara, California, United States
$80.00 hourly
Full-time
Quick Apply

This role is an exciting opportunity in SBIO team to create FPGA hardware validation platforms and debugging complex issues involving both hardware and software.Collaborate with design and firmware... Show more

Foundational ML Researcher -- Live AI, LLM Training, Remote

PathwayPalo Alto, CA, United States
Remote
Full-time

Pathway is seeking R&D Engineers to work on ambitious AI projects involving attention-based models.The role offers a competitive salary with an employee stock option plan and the opportunity to... Show more

 • Promoted

Remote Corporate Development Associate - AI Trainer ($50-$60 per hour)

Data AnnotationSanta Cruz, California
$50.00 hourly
Remote
Full-time +1

DataAnnotation is committed to creating high-quality AI.Join our team to help train the next generation of AI while enjoying the flexibility of remote work and the freedom to set your own&nbsp... Show more

 • Promoted

Manufacturing Bring-up Engineer L2

CerebrasSunnyvale, CA, United States
Full-time

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs.Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programm... Show more

 • Promoted

EHS Training Engineer

Halo IndustriesSanta Clara, CA, United States
Full-time

Halo Industries has developed breakthrough technology to revolutionize a decades-old semiconductor material slicing process.Our laser-based solution minimizes waste, enhances material cost efficien... Show more

 • Promoted

EHS Training Engineer

Halo Industries, Inc.Santa Clara, CA, US
Full-time
Quick Apply

Halo Industries has developed breakthrough technology to revolutionize a decades-old semiconductor material slicing process.Our laser-based solution minimizes waste, enhances material cost efficien... Show more

Lead Machine Learning Engineer - Remote (US) or CA - Only W2

Saransh IncMountain View, CA, United States
Remote
Full-time
Quick Apply

Role: Lead Machine Learning Engineer</b></div> <div><b>Location: Mountain View, CA (3 days a week onsite) (OR) Remote</b></div> <div><b>Job Type: W2 ... Show more

Staff ML Engineer - Open-Domain QA & LLMs (RL)

Apple Inc.Santa Clara, CA, United States
Full-time

A leading technology company based in Santa Clara, California, is seeking a Staff Machine Learning Engineer to enhance AI features across Apple products.The ideal candidate will have extensive expe... Show more

 • Promoted

Multimodal LLM Engineer for Robotics & Autonomous Driving

XPENG & Volkswagen GroupSanta Clara, CA, United States
Full-time

A leading smart technology company seeks to hire a specialist to fine-tune pre-trained language models for Humanoid Robots and Autonomous Driving Cars.The role emphasizes strong Python skills and r... Show more

 • Promoted

Engineering Trainer - PLM, CAD, and Configuration Enablement

ArcherSan Jose, CA, United States
Full-time

Engineering Trainer - PLM, CAD, and Configuration Enablement.San Jose, California, United States.Archer is an aerospace company based in San Jose, California building an all-electric vertical takeo... Show more

 • Promoted

Aerospace Engineer

TradeJobsWorkforce95153 San Jose, CA, US
Full-time

Aerospace Engineer Job Duties: Contributes to the design, manufacturing, and testing of aircraft and a... Show more

 • Promoted

Physiatry/Physical Medicine & Rehabilitation Physician

CommonSpirit HealthSanta Cruz, CA, US
Full-time

Job Summary and Responsibilities.Physiatrist (PM&R) - Dignity Health Medical Group - Dominican, Santa Cruz, CA.Join the physician-led team at.Dignity Health Medical Group - Dominican.We are seeking... Show more

 • Promoted

Senior ML Infra Engineer - Training Efficiency

WaymoMountain View, CA, United States
Full-time

A leading autonomous driving technology company in Mountain View is seeking an experienced professional to enhance ML infrastructure for training workloads.Responsibilities include designing distri... Show more

 • Promoted

Autonomous Vehicle Simulation Validation Engineer

General MotorsSunnyvale, CA, United States
Full-time

A leading automotive company is seeking a Senior Software Simulation Validation Engineer in Sunnyvale, California.The role involves ensuring the quality and reliability of autonomous vehicle simula... Show more

 • Promoted

SAP HANA Modeling & Performance Engineer

Bright Vision TechnologiesFremont, CA, US
$100,000.00 yearly
Full-time
Quick Apply

Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations.We leverage cutt... Show more