Talent.com
AI Engineer & Researcher - Inference
AI Engineer & Researcher - InferenceXai • Palo Alto, California, United States
AI Engineer & Researcher - Inference

AI Engineer & Researcher - Inference

Xai • Palo Alto, California, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

About xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.

Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. Engineers are encouraged to work across multiple areas of the company, and as a result, all engineers and researchers share the title "Member of Technical Staff."

We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important.

All engineers and researchers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

xAI does not have recruiters. Every application is reviewed directly by a technical member of the team.

Tech Stack

  • Python / Rust
  • PyTorch / JAX
  • NCCL
  • CUDA (C++ and Triton)

Location

The role is based in the Bay Area [San Francisco and Palo Alto]. Candidates are expected to be located near the Bay Area or open to relocation.

Focus

  • Optimizing the latency and throughput of model inference.
  • Building reliable production serving systems to serve millions of users.
  • Accelerating research on scaling test-time compute.
  • Innovating new ideas that bring us closer to our goal : developing AI systems that can accurately understand the universe and generate new knowledge.
  • Ideal Experiences

  • Worked on system optimizations for model serving, such as batching, caching, load balancing, and model parallelism.
  • Worked on low-level optimizations for inference, such as CUDA kernels and code generation.
  • Worked on algorithmic optimizations for inference, such as quantization, distillation, and speculative decoding.
  • Interview Process

    After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews :

  • Coding assessment in a language of your choice.
  • Systems hands-on : Demonstrate practical skills in a live problem-solving session.
  • Project deep-dive : Present your past exceptional work to a small audience.
  • Meet and greet with the wider team.
  • Our goal is to finish the main process within one week. We don’t rely on recruiters for assessments. Every application is reviewed by a member of our technical team. All interviews will be conducted via Google Meet.

    Annual Salary Range

    $180,000 - $440,000 USD

    California Consumer Privacy Act (CCPA) Notice

    [job_alerts.create_a_job]

    Ai Researcher • Palo Alto, California, United States

    [internal_linking.related_jobs]
    Generative AI Engineer

    Generative AI Engineer

    Lumex Talent • Santa Clara, CA, United States
    [job_card.full_time]
    Location : Santa Clara, CA (Hybrid).Base Salary Range : $225,000-$300,000 + Equity.A well-funded technology company in the. AI capabilities across their platform.This role is ideal for senior engineer...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Staff AI Researcher, Foundation Models

    Staff AI Researcher, Foundation Models

    Verily Life Sciences • Mountain View, CA, United States
    [job_card.full_time]
    Verily is a subsidiary of Alphabet that is using a data-driven approach to change the way people manage their health and the way healthcare is delivered. Launched from Google X in 2015, our purpose ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Engineer & Researcher - Search

    AI Engineer & Researcher - Search

    Xai • Palo Alto, California, United States
    [job_card.full_time]
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Research Engineer, Multimodal AI

    Research Engineer, Multimodal AI

    Deepmind • Mountain View, California, United States
    [job_card.full_time]
    Research Engineer position to help advance the state of the art in multimodal AI, and bring its benefits to Google products used by billions of people worldwide. Our team at Google DeepMind works on...[show_more]
    [last_updated.last_updated_30] • [promoted]
    AI Engineer

    AI Engineer

    Mlabs • Sunnyvale, California, United States
    [job_card.full_time]
    AI Engineer (Automotive Software).We are a technology leader driving the transformation to.AI-enabled software-defined vehicles. Our platform is essential for accelerating the shift from traditional...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior AI Research Engineer

    Senior AI Research Engineer

    Deep Abacus • Fremont, CA, United States
    [job_card.full_time]
    VC-backed Series B startup on a high-growth trajectory.Revolutionizing commerce through direct engagement with powerful conversation AI. Join a world-class team with opportunity for rapid career adv...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    AI Engineer

    AI Engineer

    Yupp Ai • Mountain View, California, United States
    [job_card.full_time]
    We are a well-funded, rapidly growing, early-stage AI startup headquartered in Silicon Valley that is building a two-sided product one side meant for global consumers and the other side for AI b...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Engineer

    AI Engineer

    MLabs • Sunnyvale, California, USA
    [job_card.full_time]
    AI Engineer (Automotive Software).We are a technology leader driving the transformation to.AI-enabled software-defined vehicles. Our platform is essential for accelerating the shift from traditional...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Threat Intelligence Research Engineer

    Senior Threat Intelligence Research Engineer

    Fortinet • Sunnyvale, CA, United States
    [job_card.full_time]
    Join Fortinet, a cybersecurity pioneer with over two decades of excellence, as we continue to shape the future of cybersecurity and redefine the intersection of networking and security.At Fortinet,...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Research Engineer, Data Infrastructure

    AI Research Engineer, Data Infrastructure

    1x Technologies As • Palo Alto, California, United States
    [job_card.full_time]
    AI Research Engineer, Data Infrastructure.We build humanoid robots that work alongside people to solve labor shortages and create abundance. As a Research Engineer in Infrastructure, you will design...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Engineer

    AI Engineer

    Zone It Solutions • San Jose, California, United States
    [job_card.full_time]
    We are on the lookout for an innovative and driven.In this role, you will be responsible for designing, developing, and deploying AI models that will enhance our products and improve our services.B...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Distinguished AI Researcher

    Distinguished AI Researcher

    VECTRA • San Jose, CA, United States
    [job_card.full_time]
    Vectra® is the leader in AI-driven threat detection and response for hybrid and multi-cloud enterprises.The Vectra AI Platform delivers integrated signal across public cloud, SaaS, identity, and da...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Researcher / Engineer (Foundational Models)

    Machine Learning Researcher / Engineer (Foundational Models)

    Pathway • Palo Alto, California, United States
    [job_card.full_time] +1
    At Pathway we are shaking the foundations of artificial intelligence by introducing the world’s first post-transformer model that adapts and thinks just like humans. Our breakthrough architecture ou...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Applied Research Engineer

    Applied Research Engineer

    Confidential • Hayward, CA, US
    [job_card.full_time]
    About the Company (Confidential).Our client is a cutting-edge AI research company specializing exclusively in.Series A from top-tier investors. Matrix Partners, Swift Ventures, Y Combinator, and AI ...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    ML Research Engineer for Home Robotics & Embodied AI

    ML Research Engineer for Home Robotics & Embodied AI

    Sunday Robotics • Mountain View, CA, United States
    [job_card.full_time]
    A tech innovation company in Mountain View, California, is seeking a Machine Learning Research Engineer.You'll design sophisticated robot learning algorithms to enhance dexterous manipulation in ho...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead AI Engineer / Head of R&D

    Lead AI Engineer / Head of R&D

    Thoth AI • Hayward, CA, US
    [job_card.full_time]
    To engineer the next generation of.AI-Assisted Human Annotation Systems.Our goal is to scale the production of high-quality, personalized, and safety-aligned datasets. Participate in the customer so...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Artificial Intelligence Engineer (San Jose)

    Artificial Intelligence Engineer (San Jose)

    NextGenPros Inc • San Jose, CA, US
    [job_card.part_time]
    Location : San Jose, CA (Hybrid).Investigate and develop methods for.Design and experiment with LLM.SQL-based processing and advanced text preprocessing techniques. LangChain, LangGraph, LangSmith fo...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    AI Researcher for Mobility PhD

    AI Researcher for Mobility PhD

    AVA Consulting • Mountain View, CA, United States
    [job_card.full_time]
    Job Title : AI Researcher for Mobility PhD.Our client, a major employer in the area, is looking for an.Client is seeking a PhD graduate in Computer Science, Electrical Engineering, Mechanical Engine...[show_more]
    [last_updated.last_updated_30] • [promoted]