Talent.com
Member of Technical Staff- Machine Learning Operations Engineer
Member of Technical Staff- Machine Learning Operations EngineerMicrosoft • Mountain View, CA, United States
[error_messages.no_longer_accepting]
Member of Technical Staff- Machine Learning Operations Engineer

Member of Technical Staff- Machine Learning Operations Engineer

Microsoft • Mountain View, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Overview

At Microsoft Copilot, we focus on building the best AI powered products in the world. We’re building applied AI products designed to improve over time and we need someone to architect and build the infrastructure that makes that possible.

As an Machine Learning Operations (MLOps) Engineer , you’ll build the connective tissue between our models and the real world. You’re not just deploying models – you’re building the systems that accelerate model improvement and drive continuous learning from production.

This is a high‑impact, high‑autonomy role where your infrastructure decisions directly shape product quality and our ability to iterate. If you’ve ever felt frustrated by the gap between ML’s potential and its messy reality in production, this is your chance to close it.

The next wave of AI products won’t win on model architecture alone – they’ll win with robust infrastructure for continuous improvement. You’ll build the infrastructure that makes our AI products genuinely intelligent, not just generative. Every system you create shortens the loop between user feedback and model improvement, directly impacting product quality and user experience.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Starting January 26, 2026, MAI employees are expected to work from a designated Microsoft office at least four days a week if they live within 50 miles (U.S.) or 25 miles (non‑U.S., country‑specific) of that location. This expectation is subject to local law and may vary by jurisdiction.

Responsibilities / What you’ll build

  • Training pipelines that scale elegantly – design and implement robust training infrastructure that handles everything from data ingestion to model versioning, making it trivial for ML engineers to experiment and deploy with confidence
  • The data flywheel – build the infrastructure and product features that capture user interactions, ground truth labels, and edge cases, then automatically route them back into training loops. Turn every production interaction into a training example
  • Inference systems that deliver – dive deep into model serving architecture, optimize latency, manage costs, implement intelligent caching, and build the observability needed to maintain reliability at scale
  • Deployment pipelines with guardrails – create deployment systems that balance velocity with safety : automated testing, gradual rollouts, performance monitoring, and quick rollback mechanisms
  • Cross‑functional infrastructure – partner closely with ML engineers, platform engineers, and data scientists to build APIs and tools that enable tight, rapid feedback loops from production back to model development

Required Qualifications

  • Doctorate in Computer Science, Statistics, Software Engineering, or related field AND 3+ years applied ML engineering experience
  • OR Master’s Degree in Computer Science, Statistics, Software Engineering, or related field AND 4+ years applied ML engineering experience
  • OR Bachelor’s Degree in Computer Science, Data Engineering, Software Engineering, or related field AND 6+ years applied ML experience,
  • OR equivalent experience.
  • 6+ years experience building and operating ML systems in production, with real stories about what breaks at scale and how you fixed it
  • 5+ years of experience in software engineering fundamentals with experience in distributed systems, containerization (Docker / Kubernetes), and cloud platforms (AWS / GCP / Azure)
  • 5+ years of hands‑on experience with ML orchestration tools (Airflow, Kubeflow, Metaflow), experiment tracking, model registries, and feature stores
  • 5+ years of experience optimizing model inference, wrestled with GPU utilization, and know the tradeoffs between latency, throughput, and cost
  • Preferred Qualifications

  • Doctorate in Computer Science, Data Engineering, Software Engineering, or related field AND 6+ years data engineering experience (e.g., building ETL pipelines, managing distributed data systems, implementing data quality frameworks)
  • OR Master’s Degree in Computer Science, Data Engineering, Software Engineering, or related field AND 8+ years data engineering experience (e.g., building ETL pipelines, managing distributed data systems, implementing data quality frameworks)
  • OR Bachelor’s Degree in Computer Science, Data Engineering, Software Engineering, or related field AND 10+ years data engineering experience (e.g., building ETL pipelines, managing distributed data systems, implementing data quality frameworks)
  • OR equivalent experience.
  • Familiarity with LLM deployment patterns, vector databases, prompt management, and the unique challenges of serving foundation models
  • Experience working with RAG, fine‑tuning pipelines, or evaluation frameworks
  • The ability to see beyond individual components to design holistic systems where data flows naturally from production through improvement cycles and back
  • Desire and preference to work at the intersection of teams, translating between ML researchers who want flexibility and engineers who need reliability
  • Data Science IC4 – The typical base pay range for this role across the U.S. is USD 119,800 – 234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD 158,400 – 258,000 per year.

    Data Science IC5 – The typical base pay range for this role across the U.S. is USD 139,900 – 274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD 188,000 – 304,200 per year.

    Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here : https : / / careers.microsoft.com / us / en / us-corporate-pay

    Microsoft will accept applications and processes offers for these roles on an ongoing basis.

    #J-18808-Ljbffr

    [job_alerts.create_a_job]

    Member Of Technical Staff • Mountain View, CA, United States

    [internal_linking.similar_jobs]
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    Axiado • San Jose, CA, US
    [job_card.full_time]
    Axiado is an AI-enhanced security processor company redefining the control and management of every digital system.The company was founded in 2017, and currently has 150+ employees.At Axiado, develo...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Machine Learning Engineer, Personalization

    Staff Machine Learning Engineer, Personalization

    Coupand • Mountain View, California, USA
    [job_card.full_time]
    We know were doing the right thing when we hear our customers say How did we ever live without Coupang Born out of an obsession to make shopping eating and living easier than ever were collectively...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Member of Technical Staff, Multimodal Understanding

    Member of Technical Staff, Multimodal Understanding

    xAI • Palo Alto, CA, US
    [job_card.full_time]
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    Intuitive Surgical, Inc. • Sunnyvale, CA, United States
    [job_card.full_time]
    It started with a simple idea : what if surgery could be less invasive and recovery less painful? Nearly 30 years later, that question still fuels everything we do at. We're a team of engineers, clin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    Cisco Systems, Inc. • San Jose, CA, United States
    [job_card.full_time]
    Join the engineering team building theintelligent backbone of Splunk Observability Cloud.This role involvesresearching, developing, and deploying core analytical componentsfocused on streaming anom...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Engineer

    Machine Learning Engineer

    ProductNow • Palo Alto, CA, US
    [job_card.full_time]
    Explore and experiment with state-of-the-art AI models and machine learning techniques, contributing to core product features powered by ML. Own projects end-to-end – from understanding proble...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Machine Learning Engineer, ML Runtime & Optimization

    Machine Learning Engineer, ML Runtime & Optimization

    pony.ai • Fremont, CA, US
    [job_card.full_time]
    Founded in 2016 in Silicon Valley, Pony.Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony. CNBC Disruptor list of the 50 most innovative and disruptive tech comp...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    GEICO • Palo Alto, CA, United States
    [job_card.full_time]
    Staff Machine Learning Engineer • • • •Overview : • • •single • AI / Machine Learning team, responsible for the tech design and tech health of the team. You will build and architect scalable and reliable AIML...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    Adobe Inc. • San Jose, CA, United States
    [job_card.full_time]
    Adobe Experience Intelligence Team is looking for a Staff Machine Learning Engineer who will apply AI and machine learning techniques to big-data problems to help Adobe better understand, manage an...[show_more]
    [last_updated.last_updated_1_hour] • [promoted] • [new]
    AIML - Staff Machine Learning Engineer, Answers Knowledge and Information

    AIML - Staff Machine Learning Engineer, Answers Knowledge and Information

    Apple Inc. • Cupertino, CA, United States
    [job_card.full_time]
    AIML - Staff Machine Learning Engineer, Answers Knowledge and Information.Cupertino, California, United States Machine Learning and AI. The AIML Information Intelligence team is creating groundbreak...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Machine Learning Engineer

    Machine Learning Engineer

    RADAR • Sunnyvale, CA, US
    [job_card.full_time]
    At RADAR, we're transforming the way the world thinks about physical retail.RADAR has raised over $104M from top investors, retailers, and strategics and works with some of the world's reta...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Member of Technical Staff, X Search

    Member of Technical Staff, X Search

    xAI • Palo Alto, CA, US
    [job_card.full_time]
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Staff / Senior Staff Research Engineer, Deep Learning

    Staff / Senior Staff Research Engineer, Deep Learning

    PlusAI • Santa Clara, CA, US
    [job_card.full_time]
    Plus, also known as PlusAI, is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United St...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Machine Learning Engineer

    Machine Learning Engineer

    Amiri Recruiting • Mountain View, CA, US
    [job_card.full_time]
    This is an opportunity with an early stage startup.We're looking for an ML research-focused software engineer to join us on our mission to build AI superpowers for developers.Train and fine-tun...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Member of Technical Staff (Research Engineer LLM Systems & Performance)

    Member of Technical Staff (Research Engineer LLM Systems & Performance)

    Contextual AI • Mountain View, California, USA
    [job_card.full_time]
    Were revolutionizing how AI Agents work by solving AIs most critical challenge : context.The right context at the right time unlocks the accuracy and production scale that enterprises leveraging AI ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Member of Technical Staff, Modeling

    Member of Technical Staff, Modeling

    Boson AI • Santa Clara, CA, US
    [job_card.full_time]
    Boson AI is a startup building large language tools for audio understanding, generation, interaction and entertainment.Our founders, Alex Smola, Mu Li, and a team of Deep Learning, Optimization, NL...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. Staff Machine Learning Engineer, Closeup Relevance

    Sr. Staff Machine Learning Engineer, Closeup Relevance

    Pinterest • Palo Alto, CA, United States
    [job_card.full_time]
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Machine Learning Engineer - Intelligent Agents & Systems

    Machine Learning Engineer - Intelligent Agents & Systems

    Zyphra • Palo Alto, CA, US
    [job_card.full_time]
    Agentic Systems and Interaction projects.You will be at the forefront of building a next-generation desktop and browser-based agent that can autonomously navigate the web, interact with filesystems...[show_more]
    [last_updated.last_updated_30] • [promoted]