Talent.com
Reinforcement Learning Research Engineer (Sonoma)
Reinforcement Learning Research Engineer (Sonoma)Strativ Group • Sonoma, CA, US
[error_messages.no_longer_accepting]
Reinforcement Learning Research Engineer (Sonoma)

Reinforcement Learning Research Engineer (Sonoma)

Strativ Group • Sonoma, CA, US
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.part_time]
[job_card.job_description]

Reinforcement Learning Research Engineer

Location - >

Remote (SF / MA Based)

Package - >

$200 250k Cash + Benefits

A scaling, SOTA Generative AI Startup operating with a world class team (Founders have multiple prior exits) with talent from Open AI, IBM, MIT and several top orgs, focused on pioneering work and advancements in large language models (LLMs), code generation, and code translation. Their projects directly involve industry leading partners where theyre applying advanced AI to solve meaningful, practical challenges with real-world impact.

Broad Responsibilities :

  • Build and maintain robust distributed training systems using PyTorch and JAX
  • Build and train production-ready reinforcement learning infrastructure
  • Develop orchestration tools that manage complex workflows across large-scale AI model training and evaluation.
  • Drive innovation by researching and developing scalable reinforcement learning (RL) algorithms and training paradigms for complex, high-dimensional optimization and decision-making tasks, including recent advancements in RL for feedback-driven optimization in LLMs.
  • Design and train large-scale RL environments for optimization problems spanning multiple industries.
  • Engage with frontier research through open-source projects and potential publications.

Requirements :

  • 2+ years of experience in distributed or decentralized RL (multi-agent preferred) using PyTorch and JAX.
  • Research experience with RL for high-dimensional optimization problems, particularly in multi-agent reinforcement learning settings.
  • Experience implementing advanced RL techniques such as task decomposition, hierarchical RL, goal-conditioned RL, or human-AI collaboration.
  • Experience deploying and managing multi-GPU training infrastructure at scale.
  • Eligible for TS / SCI clearance.
  • Get in touch today for more details and immediate consideration / interview!

    [job_alerts.create_a_job]

    Learning Engineer Reinforcement Learning • Sonoma, CA, US

    [internal_linking.related_jobs]
    Machine Learning Researcher (Sonoma)

    Machine Learning Researcher (Sonoma)

    Goliath Partners • Sonoma, CA, US
    [job_card.part_time]
    SUPERCOMPUTING AI LAB W / MULTIMODAL GENERAL AGENT AI STARTUP - SERIES A $1.Goliath Partners has exclusively teamed up with an early stage startup AI Lab in SF currently valued at over $1B just afte...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Applied Research Engineer (Sonoma)

    Applied Research Engineer (Sonoma)

    Confidential • Sonoma, CA, US
    [job_card.full_time] +1
    About the Company (Confidential).Our client is a cutting-edge AI research company specializing exclusively in.Series A from top-tier investors. Matrix Partners, Swift Ventures, Y Combinator, and AI ...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    AI / Machine Learning Engineer Agentic AI & LLM Systems (Sonoma)

    AI / Machine Learning Engineer Agentic AI & LLM Systems (Sonoma)

    Experis • Sonoma, CA, US
    [job_card.part_time]
    AI / Machine Learning Engineer Agentic AI & LLM Systems.Were partnered with a pioneering AI organisation pushing the boundaries of. LLMs reason, act, and collaborate autonomously.Designing the fram...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    GNSS Systems Engineer Hyfix Spatial Intelligence (Sonoma)

    GNSS Systems Engineer Hyfix Spatial Intelligence (Sonoma)

    HYFIX Spatial Intelligence • Sonoma, CA, US
    [job_card.full_time] +1
    GNSS Systems Engineer Hyfix Spatial Intelligence.At HYFIX, we are building pioneering Spatial AI and autonomous systems to shape the future of intelligent technology. Our team is global, forward-th...[show_more]
    [last_updated.last_updated_variable_hours] • [promoted] • [new]
    Postdoctoral Researcher - HPC Workflow Performance (NESAP / NERSC)

    Postdoctoral Researcher - HPC Workflow Performance (NESAP / NERSC)

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    [job_card.full_time]
    The National Energy Research Scientific Computing Center (.Postdoctoral Researcher - HPC Workflow Performance (NESAP / NERSC) to join the Workflow Readiness team as part of NERSC's Exascale Science A...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Lead Generative AI Engineer (Sonoma)

    Lead Generative AI Engineer (Sonoma)

    Madison-Davis, LLC • Sonoma, CA, US
    [job_card.part_time]
    Were supporting a major global financial technology organization thats making significant investments in AI innovation.Theyre scaling their engineering teams across North America to drive developme...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Machine Learning Systems Engineer

    Machine Learning Systems Engineer

    Menlo Ventures • Berkeley, CA, United States
    [job_card.full_time]
    At RelationalAI, we are building the future of intelligent data systems through our cloud-native relational knowledge graph management system—a platform designed for learning, reasoning, and predic...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Technical Deployment Strategist (Sonoma)

    Technical Deployment Strategist (Sonoma)

    Axle Automation • Sonoma, CA, US
    [job_card.part_time]
    Axle builds AI Digital Workers for financial compliance teams.Our agents investigate alerts, review customers and transactions, and generate audit-ready narratives so banks and fintechs can move fa...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    research scientist - RL (Sonoma)

    research scientist - RL (Sonoma)

    Cerebro • Sonoma, CA, United States
    [job_card.full_time]
    Join a Leading Applied Research Lab Pushing the Boundaries of Reinforcement Learning.Are you passionate about advancing the frontiers of. Develop novel optimization-based methods.Drive your own rese...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Controls & Machine Learning Engineer

    Controls & Machine Learning Engineer

    Terranova • Berkeley, CA, United States
    [job_card.full_time]
    Backed by leading climate and American dynamism investors, Terranova builds intelligent robotic systems to terraform the Earth itself – lifting land, restoring wetlands, and protecting critical inf...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Research Engineer (Sonoma)

    Senior Research Engineer (Sonoma)

    Harrison Clarke • Sonoma, CA, US
    [job_card.part_time]
    A fast-growing, deeply technical AI company is looking for a.This is an opportunity to work at the frontier of AI, helping design and evaluate models that can understand, write, and reason about co...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Sr ML / Deep Learning Architect (Sonoma)

    Sr ML / Deep Learning Architect (Sonoma)

    CitiusTech • Sonoma, CA, US
    [job_card.part_time]
    Machine Learning / Deep Learning Architect (Dicpm / Medical Imaging).With over 8,500 healthcare technology professionals worldwide, CitiusTech powers healthcare digital innovation, business transform...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    GenAI Lead (Sonoma)

    GenAI Lead (Sonoma)

    RandomTrees • Sonoma, CA, US
    [job_card.part_time]
    Professional experience in an AI or Machine Learning engineering role at the capacity of Lead / Architect.Hands-on experience with LLM frameworks and tools like LangChain , Llama Index etc.Expertise...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Senior AI Research Engineer (Sonoma)

    Senior AI Research Engineer (Sonoma)

    Deep Abacus • Sonoma, CA, US
    [job_card.part_time]
    VC-backed Series B startup on a high-growth trajectory.Revolutionizing commerce through direct engagement with powerful conversation AI. Join a world-class team with opportunity for rapid career adv...[show_more]
    [last_updated.last_updated_1_day] • [promoted]
    Principal ML Architect (Machine Learning) with Imaging (Sonoma)

    Principal ML Architect (Machine Learning) with Imaging (Sonoma)

    VBeyond Corporation • Sonoma, CA, US
    [job_card.part_time]
    Job Title : - Principal ML Architect (Machine Learning).Location : - San Francisco, CA (Onsite 3 days / Week).Type of Employment : - Fullltime. As a Machine Learning / Deep Learning Architect you will...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Lead AI Engineer (Sonoma)

    Lead AI Engineer (Sonoma)

    1Five • Sonoma, CA, US
    [job_card.part_time]
    This is a leadership role at the intersection of.AI, technical architecture, and company vision.ML engineering and model development. Backflips core model, including architecture, data, training, an...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Distributed Systems Engineer / AI Workloads (Sonoma)

    Distributed Systems Engineer / AI Workloads (Sonoma)

    The Crypto Recruiters • Sonoma, CA, US
    [job_card.part_time] +1
    We are actively searching for a Distributed Systems Engineer to join our team on a permanent basis.In this founding engineer role you will focus on building next-generation data infrastructure for ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Founding Machine Learning Engineer (Sonoma)

    Founding Machine Learning Engineer (Sonoma)

    Key Technology • Sonoma, CA, US
    [job_card.part_time]
    Youll design, build, and ship ranking and recommendation systems that make every match feel more personal and improve week after week. Train and fine-tune LLMs / encoders.Collaborate across ML, platfo...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]