Salesforce, Inc. • San Francisco, CA, United States
[job_card.variable_days_ago]
[job_preview.job_type]
[job_card.full_time]
[job_card.job_description]
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.
Job CategorySoftware EngineeringJob Details
About Salesforce
Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn’t a buzzword — it’s a way of life. The world of work as we know it is changing and we're looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce's core values at the heart of it all.
Opportunity Description :
We’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good – you’ve come to the right place. We are looking for a technical leader who understands that
building
an AI agent is only 10% of the work—the real engineering challenge is
measuring
it. We need a thought leader who can solve the "problem nobody talks about" : evaluating non-deterministic agentic systems in production. You will lead the team responsible for defining what "good" looks like for agents, moving beyond basic accuracy to rigorous evals that bridge agent spec’s to business outcomes. You will thread together
Applied Science
(defining metrics, curation of golden datasets, establishing ground truth) and
Product Engineering
(shipping software)
Responsibilities :
Build the "Evaluation Core" :
Lead the engineering of a scalable evaluation platform that runs in parallel with agent execution.
Thread Science & Engineering :
Operationalize applied science by turning theoretical benchmarks into production regression tests and bring about a discipline of eval driven development
Thought Leadership :
Act as the internal SME for AI testing. Educate cross-functional partners (Product, UX, ML) on the difference between stochastic AI behavior and traditional deterministic software
You are an Engineering leader who can lead the group through technical leadership, process management, maintain a good discipline of high quality code delivery aided with AI tools as necessary.
You are a People leader who ensures teams have clear priorities and adequate resources. You are a multiplier and have a passion for team and team members’ success providing technical guidance, career development, and mentoring.
Required Skills
Specialized Agent Evaluation Experience :
You have specific experience building evaluation harnesses for LLMs or Agents
Applied Science & Engineering Hybrid :
You have a track record of managing "Research Engineering" or "Applied Science" teams where you had to operationalize vague scientific goals into shipping code. You are comfortable curating "Golden Sets" of data and building custom benchmarks from scratch.
Deep Knowledge of Eval Methodologies :
You are fluent in modern evaluation techniques, including : - LLM-as-a-Judge : Validating judges against human ground truth to prevent self-bias.- Behavioral Analysis : Evaluating
how
an agent thinks (Reasoning Traces / Chain of Thought), not just the final output.
Production-Grade AI Experience :
You have shipped AI products where you had to manage real-world constraints like token budgets, inference latency, and cost-normalized accuracy. Pragmatic orientation to building ML solutions that work in production at scale
Familiarity with academic and industry benchmarks and their limitations in a business environment.
Experience building
simulation environments
(mock APIs, virtual users) to stress-test agents safely before deployment.
Experience with Data engineering, specifically around data acquisition, creating data pipelines, metric measurement, and analysis
Experience owning highly available services and putting processes in place to maintain uptime
Prior experience working with global teams
Strong verbal and written communication skills, organizational and time management skills
Advanced degree in Computer Science, Machine Learning, or related field with a focus on system evaluation or reliability#LI-YUnleash Your PotentialWhen you join Salesforce, you’ll be limitless in all areas of your life. Our benefits and resources support you to find balance and
be your best
, and our AI agents accelerate your impact so you can
do your best
. Together, we’ll bring the power of Agentforce to organizations of all sizes and deliver amazing experiences that customers love. Apply today to not only shape the future — but to redefine what’s possible — for yourself, for AI, and the world.AccommodationsIf you require assistance due to a disability applying for open positions please submit a request via this .Posting StatementAny employee or potential employee will be assessed on the basis of merit, competence and qualifications – without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.In the United States, compensation offered will be determined by factors such as location, job level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including : time off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock purchasing program. More details about company benefits can be found at the following link : https : / / www.salesforcebenefits.com.Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.### ### ### ### At Salesforce, we believe in equitable compensation practices that reflect the dynamic nature of labor markets across various regions.The typical base salary range for this position is $211,500 - $306,600 annually. In select cities within the San Francisco and New York City metropolitan area, the base salary range for this role is $230,800 - $334,600 annually.The range represents base salary only, and does not include company bonus, incentive for sales roles, equity or benefits, as applicable.### ### ### ###
#J-18808-Ljbffr
[job_alerts.create_a_job]
Director Agentforce Testing Center Engineering • San Francisco, CA, United States
[internal_linking.similar_jobs]
Director, Agentforce Testing Center Engineering
Salesforce, Inc.. • San Francisco, CA, United States
[job_card.full_time]
We’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM.Leading with our core values, we help companies across every industry blaze new trails and connect with c...[show_more]
Anomali is headquartered in Silicon Valley and is the Leading AI-Powered Security Operations Platform that is modernizing security operations.
At the center of it is an omnipresent, intelligent, and...[show_more]
Founded in 2015, Shield AI is a venture‑backed deep‑tech company with the mission of protecting service members and civilians with intelligent systems.
Its products include the V‑BAT and X‑BAT aircr...[show_more]
Fonoa Technologies ltd. • San Francisco, CA, United States
[job_card.full_time]
A leading software company is seeking a Regional Enterprise Sales Director in San Francisco, California.This role involves leading a team of Account Executives and developing strategies for sales g...[show_more]
[last_updated.last_updated_30] • [promoted]
Director, GTM Automation & Digital Wallet
Salesforce • San Francisco, CA, United States
[job_card.full_time]
A leading cloud-based software company in San Francisco seeks a Director, Go To Market Automation / Digital Wallet.This role entails leading engineering teams, establishing best practices, and overse...[show_more]
Activision Blizzard Media • San Francisco, CA, United States
[job_card.full_time]
A leading gaming technology company in San Francisco seeks an Associate Engineering Director to lead multiple engineering teams and drive operational excellence.
You will transform ambiguous busines...[show_more]
N28 Technologies is a boutique AI+ Salesforce Implementation Partner based in San Francisco with operations in Canada and India as well.
N28 was founded by a Salesforce alum whose goal is to provide...[show_more]
[last_updated.last_updated_30] • [promoted]
Director, Product Management, Data Center Power Solutions
Qcells North America • San Francisco, CA, United States
[job_card.full_time]
Director of Product Management to lead the development of a new Energy‑Management‑as‑a‑Service solution that integrates operational technology, information technology, and artificial intelligence t...[show_more]
We're looking for a seasoned analytics leader to drive data-informed decisions for the Business Experiences organization.
The Business Experiences Analytics Data Science team is a group of expert da...[show_more]
University of California Berkeley • Berkeley, CA, United States
[job_card.full_time] +1
Add to Favorite Jobs Email this Job.At the University of California, Berkeley, we are dedicated to fostering a community where everyone feels welcome and can thrive.
Our culture of openness, freedom...[show_more]
Online Consumer Panels America • Vallejo, California, US
[filters.remote]
[job_card.part_time] +1
Product Testers are wanted to work from home nationwide in the US to fulfill upcoming contracts with national and international companies.
We guarantee 15-25 hours per week with an hourly pay of bet...[show_more]
[last_updated.last_updated_30] • [promoted]
Facilities Maintenance & Life Support Manager $100,000 - $150,000
Six Flags Discovery Kingdom Careers • VALLEJO, CA, United States
[job_card.full_time]
The Maintenance Division is currently seeking a qualified person to manage all activities related to facilities maintenance to include paint, carpentry, sign / art, landscape, pest control, life supp...[show_more]
[last_updated.last_updated_1_day] • [promoted]
Patient Access Director-Redwood City, CA
Optum • Redwood City, CA, United States
[job_card.full_time]
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives.The work you do with our team will directly improve health outcomes by connect...[show_more]
A leading global payments company is seeking a Director of Growth & Revenue Systems in San Francisco to strategize and manage systems that enhance revenue growth.
This critical leadership role requi...[show_more]
A leading financial services firm is seeking a Senior Director, Distinguished Engineer to shape the future of banking in the cloud.
This role involves driving technical innovation, mentoring talent,...[show_more]
Work From Home - Product Specialist - $45 per hour
GL1 • Richmond, California
[filters.remote]
[job_card.part_time] +1
Product Testers are wanted to work from home nationwide in the US to fulfill upcoming contracts with national and international companies.
We guarantee 15-25 hours per week with an hourly pay of bet...[show_more]
[last_updated.last_updated_30] • [promoted]
Director, Agentforce Testing Center Engineering
salesforce.com, inc. • San Francisco, CA, United States
[job_card.full_time]
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.
Salesforce is the #1 AI CRM, where humans with age...[show_more]