Talent.com
Client Services
Software Engineer (Model Evaluation & Benchmarking)Client Services • San Francisco, CA, United States
No longer accepting applications
Software Engineer (Model Evaluation & Benchmarking)

Software Engineer (Model Evaluation & Benchmarking)

Client Services • San Francisco, CA, United States
6 days ago
Job type
  • Full-time
Job description
Software Engineer (Model Evaluation & Benchmarking)

San Francisco, California (Hybrid)

$140,000 - $170,000 + Equity + Healthcare + 401(k) + PTO

Are you a Software Engineer interested in working on the systems that measure and validate cutting-edge AI models before they reach production, while joining a high-growth AI start-up at an exciting stage of expansion?

This is an opportunity to join a high-growth AI company building advanced computer vision and generative AI systems used by global brands. The engineering team is focused on developing reliable multimodal AI platforms where model performance, consistency, and reliability are critical for real-world deployment.

In this role, you will build evaluation and benchmarking systems that test generative and vision models across image, video, and multimodal pipelines. You will work closely with researchers and ML engineers to ensure new models meet production-quality standards and that regressions are quickly detected as systems evolve.

This role would suit a software engineer who enjoys building infrastructure, testing frameworks, and evaluation tooling for complex machine learning systems.

The Role:
*Build automated evaluation pipelines for AI and vision models
*Develop benchmarking systems for generative and multimodal models
*Detect regressions across model checkpoints and versions
*Design metrics for model realism, consistency, and performance
*Integrate evaluation tooling into CI/CD and deployment workflows

The Person:
*Strong programming skills in Python
*Experience building testing frameworks or benchmarking tools
*Understanding of machine learning experimentation workflows
*Experience working with data analysis tools such as NumPy or Pandas
*Interest in computer vision, generative AI, or multimodal systems

Reference Number: BBBH270868
To apply for this role or to be considered for further roles, please click "Apply Now" or contact Ben Herridge at Rise Technical Recruitment.

Rise Technical Recruitment Inc of 1011 Centre Rd, Suite 322, Wilmington, DE 19805 act as an employer-paid private personnel agency.

The salary advertised is the bracket available for this position. The actual salary paid will be dependent on your level of experience, qualifications and skill set and will be decided by our client, the employer. Rise are not responsible or liable for any hiring decisions made by the end client.

We are an equal opportunities company and welcome applications from all suitable candidates.
Create a job alert for this search

Software Engineer (Model Evaluation & Benchmarking) • San Francisco, CA, United States

Similar jobs

Public Sector ML Engineer: Robust Evaluation Pipelines

Scale AISan Francisco, California, United States
Full-time

A leading AI technology firm in Washington DC is seeking a Machine Learning Engineer to design automated evaluation pipelines.The role involves working with advanced AI systems in government settin... Show more

 • Promoted

Software Engineer, Developer Productivity & AI Tooling

GleanSan Francisco, CA, United States
Full-time

A leading Work AI platform in San Francisco is seeking a Software Engineer for Developer Productivity to enhance build systems and CI/CD pipelines.This hybrid role involves developing tooling integ... Show more

 • Promoted

AI/ML Software Engineer for Drones & Space Systems

Dreki SystemsSan Francisco, CA, United States
Full-time

A leading technology development company in San Francisco is seeking a passionate Software Developer.You will be integral in building software systems for drones and robotic platforms.The ideal can... Show more

 • Promoted

Solutions Engineer

GetcleraSan Francisco, California, United States
Full-time

Hybrid • San Francisco, Chicago, San Francisco, Chicago • $100k - $200k.Solutions Engineer at Openlayer (S21) $100K - $200K The fastest way to ship airtight AI San Francisco, CA, US / Chicago, IL, ... Show more

 • Promoted

Robotics Simulation & Verification Engineer

AndromedaSan Francisco, California, United States
Full-time

A leading robotics company in San Francisco is looking for a Simulation and Test Engineer to develop simulation platforms and test infrastructures for its humanoid robot, Abi.This role will involve... Show more

 • Promoted

Software Engineer (Technical Leadership) - Machine Learning

MetaSan Francisco, CA, United States
Full-time

Meta is seeking talented principal engineers to join our teams in building cutting‑edge products that connect billions of people around the world.As a member of our team, you will oversee complex t... Show more

 • Promoted

Embedded ML Engineer – Gesture Recognition

Imago.aiSan Francisco, California, United States
Full-time

Embedded ML Engineer role at Imago.We are seeking an engineer at the intersection of embedded systems and machine learning to enable rich, reliable interactions on wearable devices.The candidate wi... Show more

 • Promoted

LLM Applications Engineer

Uncountable Inc.San Francisco, California, United States
Full-time

Thank you for your interest in Uncountable Engineering!.Uncountable is seeking experienced engineers to lead the transformation of our platform into an AI-first R&D ecosystem.We are building the ne... Show more

 • Promoted

Founding Engineer - Build the Physics Foundation Model

Godela (YC X25)San Francisco, California, United States
Full-time

A cutting-edge tech startup is seeking a Founding Software Engineer to build systems that leverage physics-informed models.The ideal candidate will have strong software engineering skills, experien... Show more

 • Promoted

Senior Software Engineer, Model Serving

DatabricksSan Francisco, CA, United States
Full-time

Senior Software Engineer, Model Serving at Databricks.Senior Software Engineer, Model Serving.At Databricks, we are passionate about enabling data teams to solve the world's toughest problems — fro... Show more

 • Promoted

AI Software Engineer (Mid-Level)

Full ScopeSan Francisco, California, United States
Full-time

AI Software Engineer (Mid-Level).We are seeking a talented and motivated Mid-Level AI Software Engineer to join our AI team.The ideal candidate will have a solid foundation in software engineering ... Show more

 • Promoted

Applied ML Engineer (LLMs & RAG)

AlldusSan Francisco, CA, United States
Full-time

This role is a combination of research and engineering.We are looking for someone who's a talented software engineer at their core, but has contributed to AI research, especially in the field of RA... Show more

 • Promoted

Staff Engineer - ML Inference & Model Efficiency

CohereSan Francisco, CA, United States
Full-time

A leading AI research firm in San Francisco is seeking a Member of Technical Staff specialized in Model Efficiency.In this role, you will enhance LLM inference systems by tackling performance issue... Show more

 • Promoted

Research Engineer, ML Systems (All Industry Levels)

Character.AISan Francisco, California, United States
Full-time

Research Engineer, ML Systems (All Industry Levels).Research Engineer, ML Systems (All Industry Levels).Research Engineer, ML Systems (All Industry Levels).Research Engineer, ML Systems (All Indust... Show more

 • Promoted

Software Engineer, Computational Geometry (Autonomy)

KodiakSan Francisco, CA, United States
Full-time

Software Engineer with deep expertise in computational geometry to join the core algorithms team.You will design and implement the geometric foundations that power mission-critical systems across o... Show more

 • Promoted

Software Engineer - AI Technologies (Remote)

Outlier AISan Francisco, California, United States
Remote
Full-time

Outlier helps the world’s most innovative companies improve their AI agents by providing human feedback.We collaborate with leading AI organizations to train Large Language Models (LLMs) to functio... Show more

 • Promoted

Senior AI Evaluation & Reliability Engineer

The Mice Groups, Inc.Redwood City, CA, United States
Full-time

A leading AI solutions firm in Redwood City seeks a Senior Engineer specializing in AI Evaluation & Reliability.The role focuses on designing evaluation metrics, ensuring operational excellence for... Show more

 • Promoted

Software Engineer

Autoscience InstituteMenlo Park, CA, United States
Full-time

This range is provided by Autoscience Institute.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.At Autoscience Institute, we create AI systems t... Show more

 • Promoted

Deployed Engineer

Reval RecruitingSan Francisco, CA, United States
Full-time

This role sits at the intersection of engineering, customer engagement, and product innovation.You’ll work directly with companies building cutting‑edge LLM applications—helping them turn ideas int... Show more

 • Promoted

ML Engineer - Personalization & Recommendation Systems

krea.aiSan Francisco, CA, United States
Full-time

ML Engineer - Personalization & Recommendation Systems.At Krea, we are building next-generation AI creative tools.We are dedicated to making AI intuitive and controllable for creatives.Our mission ... Show more