Artificial Intelligence Engineer (Redwood City)The Mice Groups, Inc. • Redwood City, CA, US

Artificial Intelligence Engineer (Redwood City)

The Mice Groups, Inc. • Redwood City, CA, US

[job_card.variable_days_ago]

[job_preview.job_type]

[job_card.part_time]

[job_card.permanent]

[job_card.job_description]

AI Engineer, Evaluation and Reliability / Contract-to-Hire or Direct Hire / Redwood City / Hybrid, onsite 3 days per week / This position pays $70-80 / hr. W2 for Contract, $140-190K annually upon conversion / US Citizens and Green Card holders only

Summary :

Our client is looking for a Senior Engineer, AI Evaluation & Reliability to lead the design and execution of evaluation, quality assurance, and release gating for our agentic AI features.

You'll develop the pipelines, datasets, and dashboards that measure and improve agent performance across real-world SOC workflows ensuring every release is safe, reliable, efficient, and production-ready.

You will guarantee that our agentic AI features operate at full production scale, ingesting and active on millions of SOC alerts per day, with measurable impact on analyst productivity and risk mitigation. This role partners closely with the Product team to deliver operational excellence and trust in every AI-drive capability.

Responsibilities :

Define quality metrics : Translate SOC use cases into measurable KPI's (e.g., precision / recall, MTTR, false-positive rate, step success, latency / cost budgets).
Build continuous evaluations : Develop offline / online evaluation pipelines, regression suites, and A / B or canary test; integrate them into CI / CD for release gating.
Curate and manage datasets : Maintain gold-standard datasets and red-team scenarios; establish data governance and drift monitoring practices.
Ensure safety, reliability, and explainability : Partner with Platform and Security Research to encode guardrails, policy enforcement, and runtime safety checks.
Expand adversarial test coverage (prompt injection, data exfiltration, abuse scenarios).
Ensure explainability and auditability of agent decisions, maintaining traceability and compliance of AI-driven workflows.
Production reliability & observability : Monitor and maintain reliability of agentic AI features post-release define and uphold SLIs / SLOs, establish alerting and rollback strategies, and conduct incident post-mortems.
Design and implement infrastructure to scale evaluation and production pipelines for real-time SOC workflows across cloud environments.
Drive agentic system engineering : Experiment with multi-agent systems, tool-using language models, retrieval-augmented workflows, and prompt orchestration.
Manage model and prompt lifecycle track version, rollout strategies, and fallbacks; measure impact through statistically sound experiments.
Collaborate cross-functionally : Work with Product, UX and Engineering to prioritize high-leverage improvements, resolve regressions quickly, and advance overall system reliability.

Required Skills :

6+ years building evaluation or testing infrastructure for ML / LLM systems or large-scale distributes system

Proven ability to translate product requirements into measurable metrics and test plans.

Strong Python skills

Strong Experience with modern data tooling

Hands-on experience running A / B tests, canaries, or experiment frameworks.

Experience defining and maintaining operational reliability metrics (SLIs / SLOs) for AI-driven systems.

Familiarity with large-scale distributed or streaming systems serving AI / agent workflows (millions of events or alerts / day).

Excellent communication skills able to clearly convey technical results and trade-offs to engineer, PMs, and analysts.

Pay for this position is based on market location and may vary depending on job-related knowledge, skills, and experience. As a contractor you may also be eligible for health benefits such as health, dental, and vision as well as access to a 401K plan.

A sign-on payment and restricted stock units may be provided as part of the compensation package, in addition to a full range of medical, financial, and / or other benefits, dependent on the position offered by our client.

Applicants should apply via The Mice Groups Inc. website (www.micegroups.com) or through this careers site posting.

We are an equal opportunity employer and value diversity at The Mice Groups Inc. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Pursuant to the Los Angeles Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

The Mice Groups Inc. values your privacy. Please consult our Candidate Privacy Notice, for information about how we collect, use, and disclose personal information of our candidates.

Privacy Policy

One of the basic principles The Mice Groups follows in designing and operating this website is that we ask for only the information we need to provide the service youve requested.

The Mice Groups does not currently collect personal identifying information via its website except (i) to the extent that you provide this information in an online job application and (ii) to the extent that your web browser provides personal identifying information.

The Mice Groups will use your personally identifying information solely for the purpose for which you submitted the information. The Mice Groups may, however, aggregate certain elements of your personal identifying information with the information of other users of our website to analyze the usefulness and popularity of various web pages on its website.

The Mice Groups reserves the right to change this policy at any time by posting a new privacy policy at this location. Questions regarding this statement should be directed to info@micegroups.com

[job_alerts.create_a_job]

Artificial Intelligence Engineer • Redwood City, CA, US

[internal_linking.related_jobs]

Applied AI Engineer

Atlas • San Francisco, California, United States

[job_card.full_time]

Atlas is the concierge and credit card that unlocks coveted access and enables seamless spending.Our cardmembers are discerning, busy individuals who rely on Atlas to deliver convenience across the...[show_more]

[last_updated.last_updated_30] • [promoted]

Founding Audio AI Research Engineer

David AI • San Francisco, California, United States

[job_card.full_time]

David AI is the first audio data research company.We bring an R&D approach to data–developing datasets with the same rigor AI labs bring to models. Speech is versatile, accessible, and.To unlock the...[show_more]

[last_updated.last_updated_30] • [promoted]

Founding Research Engineer, AI

Adyen • San Francisco, California, United States

[job_card.full_time]

Adyen is the financial technology platform of choice for leading businesses, providing payments, data, and financial products in a single solution for global customers like Meta, Uber, and H&M.At A...[show_more]

[last_updated.last_updated_30] • [promoted]

Applied Research Engineer

Confidential • Alameda, CA, US

[job_card.full_time]

About the Company (Confidential).Our client is a cutting-edge AI research company specializing exclusively in.Series A from top-tier investors. Matrix Partners, Swift Ventures, Y Combinator, and AI ...[show_more]

[last_updated.last_updated_variable_hours] • [promoted] • [new]

Research Engineer, Applied AI Engineering

Openai • San Francisco, California, United States

[job_card.full_time]

OpenAI is at the forefront of artificial intelligence, driving innovation and shaping the future with cutting-edge research. Our mission is to ensure that AI's benefits reach everyone.We are looking...[show_more]

[last_updated.last_updated_30] • [promoted]

Research Engineer, AI Safety & Alignment

Character.ai • Redwood City, California, United States

[job_card.full_time]

Joining us as a Research Engineer, you'll be at the forefront of tackling one of the most critical challenges in AI today : safety and alignment. Your work will be pivotal in understanding and mitiga...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Engineer III

Guidewire • San Mateo, California, USA

[job_card.full_time] +1

Join Guidewires Product Strategy team in San Mateo where we drive operational excellence and transformative innovation by embedding AI and GenAI across our product portfolio.Our mission is to deliv...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

AI / ML Inference Engineer

Krea • San Francisco, California, United States

[job_card.full_time]

At Krea, we're dedicated to making AI intuitive and controllable for creatives.Our mission is to build tools that empower human creativity, not replace it. We believe AI is a new medium that allows ...[show_more]

[last_updated.last_updated_30] • [promoted]

Applied AI Engineer

Safetykit • San Francisco, California, United States

[job_card.full_time]

We’re inventing the future of B2B SaaS with AI agents.We’re betting on language models and we’re betting on scale.You’ll test new models the day they come out and understand their characteristics b...[show_more]

[last_updated.last_updated_30] • [promoted]

Applied Research Engineer

Labelbox • San Francisco, California, USA

[job_card.full_time]

At Labelbox were building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018 weve been pioneering data-centric approaches that are fu...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

AI Engineer

Elicit • Oakland, California, United States

[job_card.full_time]

Elicit is an AI research assistant that uses language models to help researchers figure out what’s true and make better decisions, starting with common research tasks like literature review.Elicit ...[show_more]

[last_updated.last_updated_30] • [promoted]

Applied AI Engineer

Harvey • San Francisco, California, United States

[job_card.full_time]

Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customi...[show_more]

[last_updated.last_updated_30] • [promoted]

Research Engineer Machine Learning & Systems

World Labs • San Francisco, California, United States

[job_card.full_time]

We are looking for a versatile Research Engineer with a strong background in machine learning or 3D, software development, and systems design. This role is ideal for someone excited about bridging c...[show_more]

[last_updated.last_updated_30] • [promoted]

Applied AI Engineer

Tako • San Francisco, California, United States

[job_card.full_time]

Tako is an AI-first startup reinventing how companies manage people and payroll.We build products that eliminate bureaucracy, automate repetitive tasks, and give People teams the autonomy to focus ...[show_more]

[last_updated.last_updated_30] • [promoted]

Machine Learning Research Engineer - Robotics

Scale Ai • San Francisco, California, United States

[job_card.full_time]

Scale’s Robotics business unit is dedicated to solving the data bottleneck in Physical AI.This position will be a key contributor in conducting applied research in Robotics and developing ML pipeli...[show_more]

[last_updated.last_updated_30] • [promoted]

Founding Perception Engineer (250k-380k + Equity) at Crewline AI

Jack & Jill / External ATS • San Francisco, California, USA

[job_card.full_time]

This is a job that we are recruiting for on behalf of one of our customers.Hes an AI agent that sends you unmissable jobs and then helps you ace the interview. Hell make sure you are considered for ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning Engineer - Generative AI

Dream Technologies • San Francisco, California, United States

[job_card.full_time]

We are looking for a Generative AI engineer to work on the full lifecycle of novel spatial generative AI model development, defining the correct frame of an inference problem, structuring image-bas...[show_more]

[last_updated.last_updated_30] • [promoted]

AI Inference Engineer

Quadric, Inc • Burlingame, California, United States

[job_card.full_time]

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture.Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads...[show_more]

[last_updated.last_updated_variable_days] • [promoted]