Talent.com
Manager, Agent Evaluation
Manager, Agent EvaluationComcast Corporation • Washington D.C., District of Columbia, United States
Manager, Agent Evaluation

Manager, Agent Evaluation

Comcast Corporation • Washington D.C., District of Columbia, United States
[job_card.variable_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Make your mark at Comcast a Fortune 30 global media and technology company. From the connectivity and platforms we provide, to the content and experiences we create, we reach hundreds of millions of customers, viewers, and guests worldwide. Become part of our award-winning technology team that turns big ideas into cutting-edge products, platforms, and solutions that our customers love. We create space to innovate, and we recognize, reward, and invest in your ideas, while ensuring you can proudly bring your authentic self to the workplace. Join us. You’ll do the best work of your career right here at Comcast. (In most cases, Comcast prefers to have employees on-site collaborating unless the team has been designated as virtual due to the nature of their work. If a position is listed with both office locations and virtual offerings, Comcast may be willing to consider candidates who live greater than 100 miles from the office for the remote option.)

Job Summary

The Agent Evaluation team is responsible for testing whether AI agents return the correct and expected responses. We build the framework, metrics, and test cases that validate agent behavior, accuracy, and reliability before release. Our goal is to ensure agents perform consistently and meet product and user expectations.

Job Description

Role Summary :

The Manager, Agent Evaluation will lead the team responsible for building and scaling the evaluation framework that tests whether AI agents return accurate, reliable, and expected responses across real-world scenarios.

Key Responsibilities :

  • Lead and grow a team focused on agent and model evaluation
  • Define the strategy, roadmap, and standards for agent testing and validation
  • Oversee development of metrics, benchmarks, and testing frameworks to measure response quality, accuracy, safety, and performance
  • Ensure evaluation coverage aligns with product, UX, and business requirements
  • Partner closely with Product, Engineering, Research, and Platform teams to integrate evaluation into the development lifecycle
  • Drive experimentation and continuous improvement of evaluation methodologies
  • Establish reporting mechanisms to clearly communicate evaluation results and trade-offs to leadership
  • Implement best practices for model versioning, monitoring, and release validation
  • Stay current with advancements in LLMs, AI agents, and evaluation techniques

Required Skills :

  • Strong foundation in machine learning fundamentals and applied ML systems
  • Hands-on experience with model and agent evaluation methodologies
  • Familiarity with LLMs, AI agents, and prompt-driven systems
  • Proficiency in Python and modern ML frameworks (e.g., PyTorch, TensorFlow)
  • Experience defining metrics, benchmarks, and experimentation frameworks
  • Solid understanding of MLOps practices, including model versioning, monitoring, and CI / CD
  • Ability to collaborate effectively with product, platform, and research teams
  • Clear communicator of technical trade-offs, evaluation insights, and results
  • Disclaimer :

  • This information has been designed to indicate the general nature and level of work performed by employees in this role. It is not designed to contain or be interpreted as a comprehensive inventory of all duties, responsibilities and qualifications.
  • Comcast is an equal opportunity workplace. We will consider all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, genetic information, or any other basis protected by applicable law.

    Skills :

    Machine Learning (ML); Metrics Reporting; Natural Language Processing (NLP); Cross-Functional Collaboration; Large Language Models (LLMs); AI Frameworks; Python (Programming Language)

    Salary :

    Primary Location Pay Range : $183,063.62 - $274,595.42

    Comcast intends to offer the selected candidate base pay within this range, dependent on job-related, non-discriminatory factors such as experience. The application window is 30 days from the date job is posted, unless the number of applicants requires it to close sooner or later.

    Base pay is one part of the Total Rewards that Comcast provides to compensate and recognize employees for their work. Most sales positions are eligible for a Commission under the terms of an applicable plan, while most non-sales positions are eligible for a Bonus. Additionally, Comcast provides best-in-class Benefits to eligible employees. We believe that benefits should connect you to the support you need when it matters most, and should help you care for those who matter most. That?s why we provide an array of options, expert guidance and always-on tools, that are personalized to meet the needs of your reality ? to help support you physically, financially and emotionally through the big milestones and in your everyday life. Please visit the compensation and benefits summary on our careers site for more details.

    Education

    Master's Degree

    While possessing the stated degree is preferred, Comcast also may consider applicants who hold some combination of coursework and experience, or who have extensive related professional experience.

    Relevant Work Experience

    5-7 Years

    [job_alerts.create_a_job]

    Manager Agent Evaluation • Washington D.C., District of Columbia, United States

    [internal_linking.similar_jobs]
    TPC - Design, Monitoring, and Evaluation Specialist

    TPC - Design, Monitoring, and Evaluation Specialist

    BlueForce • Washington, DC, United States
    [job_card.full_time]
    Design, Monitoring, and Evaluation Specialist.CONUS and OCONUS in support of the US Department of State (DoS) Bureau of International Narcotics and Law Enforcement Affairs (INL) Program.If you want...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Test and Evaluation Technician

    Test and Evaluation Technician

    Syntelligent Analytic Solutions • Arlington, VA, United States
    [job_card.full_time]
    Syntelligent Analytic Solutions, LLC.Our customers' and Syntelligent's success are built upon the core values of People First, Integrity & Accountability, Mission Driven, Community Focus and Team O...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    PVAT Evaluation, Analysis, and Integration Engineer

    PVAT Evaluation, Analysis, and Integration Engineer

    The Johns Hopkins University Applied Physics Laboratory • Laurel, Maryland, United States
    [job_card.temporary]
    Are you interested in the research, development, design, testing, and analysis of Position, Velocity, Attitude, and Timing (PVAT) systems including Global Navigation Satellite System (GNSS) receive...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Remote Strategic Analytics Lead – Consumer Banking

    Remote Strategic Analytics Lead – Consumer Banking

    KeyBank • Washington, DC, United States
    [filters.remote]
    [job_card.full_time]
    A financial services company is seeking a leader for its Consumer Analytics & Advisory team.This role involves leading a team of analysts, translating data into strategic recommendations, and colla...[show_more]
    [last_updated.last_updated_30] • [promoted]
    TEP Project Manager (Hybrid / DC)

    TEP Project Manager (Hybrid / DC)

    General Dynamics • Washington, DC, United States
    [job_card.full_time]
    Technical Evaluation Panel Project Manager.GDIT is seeking an experienced Technical Evaluation Panel Project Manager to plan, coordinate, and deliver activities for a multidisciplinary panel suppor...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Audience Development Manager

    Audience Development Manager

    Government Executive Media Group LLC • Washington, District of Columbia, United States
    [job_card.permanent]
    GovExec is seeking an Audience Development Manager to join our team.In this role, you will be responsible for growing, engaging, and retaining our audiences across New York and Pennsylvania.This po...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Engagement Manager

    Engagement Manager

    Trilagen • Bethesda, MD, United States
    [job_card.full_time]
    Trilagen is on the lookout for an experienced Engagement Manager to join our team and drive client success through effective project management and relationship building. In this critical role, you ...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Senior Manager-Engagement Management-Engagement Lead

    Senior Manager-Engagement Management-Engagement Lead

    EXL • Washington, DC, United States
    [job_card.full_time]
    Engagement Management-Engagement Lead.This role combines engagement management, client relationship leadership, sales enablement, and delivery oversight with deep knowledge of data platforms, gover...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Monitoring and Evaluation Specialist II

    Monitoring and Evaluation Specialist II

    Human Capital Resources and Concepts • Washington, DC, DC, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Human Capital Resources and Concepts is seeking a Monitoring and Evaluation Specialist II who will work collaboratively within a team and across multiple agencies, focusing on foreign assistance an...[show_more]
    [last_updated.last_updated_30]
    Physics Subject Matter Expert - AI Evaluation

    Physics Subject Matter Expert - AI Evaluation

    sepal • Arlington, VA, United States
    [job_card.full_time]
    About the Project\nSepal is conducting a qualitative research study to define benchmarks of professional excellence in physics education. We are looking for experienced professionals to contribute t...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Beverage Manager

    Beverage Manager

    Marriott • Washington, DC, United States
    [job_card.full_time]
    K St NW, Washington, District of Columbia, United States, 20006 VIEW ON MAP (https : / / www.C%20923%2016th%20%26%20K%20St%20NW%2C%20Washington%2C%20District%20of%20Columbia%2C%20United%20States%2C%202...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Revenue Optimization Manager : Pricing, Billing & Profitability

    Revenue Optimization Manager : Pricing, Billing & Profitability

    Liberty Personnel Services, Inc. • Washington, DC, United States
    [job_card.full_time]
    A national legal services firm in Washington, DC is seeking a Revenue Optimization Manager to lead the optimization of their revenue cycle. Key responsibilities include monitoring revenue metrics, i...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Agile Project Manager

    Agile Project Manager

    Nationwide IT Services • Fort Belvoir, VA, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Agile Project Manager (2 Positions) 2026-017 Work Location : .Remote with regular meetings at Ft.Belvoir, VA Clearance Requirements : . IT-II Non-Critical Sensitive / Tier 3 (T3) required Position Overv...[show_more]
    [last_updated.last_updated_variable_days]
    Digital Test and Evaluation Practitioner

    Digital Test and Evaluation Practitioner

    Booz Allen Hamilton • Arlington, VA, United States
    [job_card.full_time] +1
    Digital Test and Evaluation Practitioner.Create, integrate, and apply interdisciplinary digital models of products from concept throughout the product lifecycle. Apply advanced consulting skills or ...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Beverage Manager

    Beverage Manager

    Gecko Hospitality • Washington, DC, United States
    [job_card.full_time]
    Beverage Manager - Iconic High-End Restaurant, Washington, D.Are you a seasoned Beverage Manager ready to lead in one of Washington, D. This is your opportunity to craft unforgettable guest experien...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    25-6034 : Customer Engagement Manager - DC Metro

    25-6034 : Customer Engagement Manager - DC Metro

    Navitas • Washington, DC, US
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Customer Engagement Manager Job ID : .Clearance : Minimum Secret clearance with ability to obtain TS / SCI Location : DC Metro Who We Are : Since our inception back in 2006, Navitas has grown to be an in...[show_more]
    [last_updated.last_updated_30]
    Manager, Engagement

    Manager, Engagement

    JBG SMITH • Bethesda, Maryland, United States
    [job_card.full_time]
    At JBG SMITH, our team of caring, enthusiastic professionals is dedicated to delivering exceptional customer service.We prioritize creating tailored experiences for our communities, ensuring that t...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Transcript Evaluator II (Hybrid)

    Transcript Evaluator II (Hybrid)

    Henry M. Jackson Foundation for the Advancement of Military Medicine • Bethesda, Maryland, United States
    [job_card.full_time]
    [filters_job_card.quick_apply]
    Jackson Foundation for the Advancement of Military Medicine (HJF) is a nonprofit organization dedicated to advancing military medicine. We serve military, medical, academic and government clients by...[show_more]
    [last_updated.last_updated_30]