Talent.com
AI Engineer & Researcher - Inference
AI Engineer & Researcher - InferenceXai • Palo Alto, California, United States
AI Engineer & Researcher - Inference

AI Engineer & Researcher - Inference

Xai • Palo Alto, California, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. Engineers are encouraged to work across multiple areas of the company, and as a result, all engineers and researchers share the title "Member of Technical Staff."

We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

xAI does not have recruiters. Every application is reviewed directly by a technical member of the team.

Tech Stack

  • Python / Rust
  • PyTorch / JAX
  • NCCL
  • CUDA (C++ and Triton)

Location

The role is based in the Bay Area [San Francisco and Palo Alto]. Candidates are expected to be located near the Bay Area or open to relocation.

Focus

  • Optimizing the latency and throughput of model inference.
  • Building reliable production serving systems to serve millions of users.
  • Accelerating research on scaling test-time compute.
  • Innovating new ideas that bring us closer to our goal : developing AI systems that can accurately understand the universe and generate new knowledge.
  • Ideal Experiences

  • Worked on system optimizations for model serving, such as batching, caching, load balancing, and model parallelism.
  • Worked on low-level optimizations for inference, such as CUDA kernels and code generation.
  • Worked on algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding.
  • Interview Process

    After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews :

  • Coding assessment in a language of your choice.
  • Systems hands-on : Demonstrate practical skills in a live problem-solving session.
  • Project deep-dive : Present your past exceptional work to a small audience.
  • Meet and greet with the wider team.
  • Our goal is to finish the main process within one week. We don’t rely on recruiters for assessments. Every application is reviewed by a member of our technical team. All interviews will be conducted via Google Meet.

    Annual Salary Range

    $180,000 - $440,000 USD

    California Consumer Privacy Act (CCPA) Notice

    [job_alerts.create_a_job]

    Ai Researcher • Palo Alto, California, United States

    [internal_linking.related_jobs]
    Sr Machine Learning Engineer - GenAI, LLM, Agentic AI

    Sr Machine Learning Engineer - GenAI, LLM, Agentic AI

    Eightfold • Santa Clara, California, United States
    [job_card.full_time]
    Research, design, development, and deployment of advanced AI agents and agentic systems.Architect and implement complex multi-agent systems, including planning, decision-making, and execution capab...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Generative AI Engineer

    Generative AI Engineer

    Lumex Talent • Santa Clara, CA, United States
    [job_card.full_time]
    Location : Santa Clara, CA (Hybrid).Base Salary Range : $225,000-$300,000 + Equity.A well-funded technology company in the. AI capabilities across their platform.This role is ideal for senior engineer...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Engineer & Researcher - Search

    AI Engineer & Researcher - Search

    Xai • Palo Alto, California, United States
    [job_card.full_time]
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Research Engineer, Multimodal AI

    Research Engineer, Multimodal AI

    Deepmind • Mountain View, California, United States
    [job_card.full_time]
    Research Engineer position to help advance the state of the art in multimodal AI, and bring its benefits to Google products used by billions of people worldwide. Our team at Google DeepMind works on...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AI Engineer

    AI Engineer

    Mlabs • Sunnyvale, California, United States
    [job_card.full_time]
    AI Engineer (Automotive Software).We are a technology leader driving the transformation to.AI-enabled software-defined vehicles. Our platform is essential for accelerating the shift from traditional...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Engineer II

    AI Engineer II

    VirtualVocations • Fremont, California, United States
    [job_card.full_time]
    A company is looking for an AI Engineer II to develop and deploy machine learning solutions in healthcare.Key Responsibilities Develop predictive models and machine learning algorithms to enhance...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AI Engineer

    AI Engineer

    Yupp Ai • Mountain View, California, United States
    [job_card.full_time]
    We are a well-funded, rapidly growing, early-stage AI startup headquartered in Silicon Valley that is building a two-sided product one side meant for global consumers and the other side for AI b...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI Research Engineer

    Senior AI Research Engineer

    Deep Abacus • Fremont, CA, United States
    [job_card.full_time]
    VC-backed Series B startup on a high-growth trajectory.Revolutionizing commerce through direct engagement with powerful conversation AI. Join a world-class team with opportunity for rapid career adv...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    AI Research Engineer

    AI Research Engineer

    VirtualVocations • Fremont, California, United States
    [job_card.full_time]
    A company is looking for an AI Research Engineer to optimize deep learning models for edge AI platforms.Key Responsibilities Research and develop quantization-aware training (QAT) and post-traini...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Threat Intelligence Research Engineer

    Senior Threat Intelligence Research Engineer

    Fortinet • Sunnyvale, CA, United States
    [job_card.full_time]
    Join Fortinet, a cybersecurity pioneer with over two decades of excellence, as we continue to shape the future of cybersecurity and redefine the intersection of networking and security.At Fortinet,...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Research Engineer, Data Infrastructure

    AI Research Engineer, Data Infrastructure

    1x Technologies As • Palo Alto, California, United States
    [job_card.full_time]
    AI Research Engineer, Data Infrastructure.We build humanoid robots that work alongside people to solve labor shortages and create abundance. As a Research Engineer in Infrastructure, you will design...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Engineer

    AI Engineer

    Zone It Solutions • San Jose, California, United States
    [job_card.full_time]
    We are on the lookout for an innovative and driven.In this role, you will be responsible for designing, developing, and deploying AI models that will enhance our products and improve our services.B...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Founding AI Engineer

    Founding AI Engineer

    Polar Sky • Palo Alto, California, United States
    [job_card.full_time]
    We're looking for a Founding AI Engineer who loves building with AI crafting context pipelines, integrating and evaluating LLMs into production systems, and delivering AI-native product experien...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Applications Engineer

    AI Applications Engineer

    InsideHigherEd • Stanford, California, United States
    [job_card.full_time]
    Business Affairs : University IT (UIT), Redwood City, California, United States.Information Technology Services📅Sep 08, 2025 Post Date📅107213 Requisition #. Are you an experienced AI / GenAI en...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Researcher / Engineer (Foundational Models)

    Machine Learning Researcher / Engineer (Foundational Models)

    Pathway • Palo Alto, California, United States
    [job_card.full_time] +1
    At Pathway we are shaking the foundations of artificial intelligence by introducing the world’s first post-transformer model that adapts and thinks just like humans. Our breakthrough architecture ou...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Engineer

    AI Engineer

    Eudia • Palo Alto, California, United States
    [job_card.full_time]
    Eudia is redefining the future of legal work with AI-powered Augmented Intelligence, enabling Fortune 500 legal teams to move faster, manage risk more effectively, and unlock new business value.Bac...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI Engineer

    Senior AI Engineer

    Linkedin • Sunnyvale, California, United States
    [job_card.full_time]
    LinkedIn is the worlds largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover excit...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI Engineer LLM, RAG

    Senior AI Engineer LLM, RAG

    A • Sunnyvale, California, United States
    [job_card.full_time]
    Our Wayfinder team is building scalable, certifiable autonomy systems to power the next generation of commercial aircraft. Our team of experts is driving the maturation of machine learning and other...[show_more]
    [last_updated.last_updated_30] • [promoted]