Performance EngineerEtched • San Jose, CA, US

Performance Engineer

Etched • San Jose, CA, US

[job_card.variable_days_ago]

[job_preview.job_type]

[job_card.full_time]

[job_card.job_description]

Job Description

About Etched

Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Backed by hundreds of millions from top-tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history.

Key responsibilities

Develop comprehensive performance models and projections for Sohu's transformer-specific architecture across varying workloads and configurations

Profile and analyze deep learning workloads on Sohu to identify micro-architectural bottlenecks and optimization opportunities

Build analytical and simulation-based models to predict performance under different architectural configurations and design trade-offs

Collaborate with hardware architects to inform micro-architectural decisions based on workload characteristics and performance analysis

Drive hardware / software co-optimization by identifying opportunities where architectural features can unlock significant performance improvements

Characterize and optimize memory hierarchy performance, interconnect utilization, and compute resource efficiency

Develop performance benchmarking frameworks and methodologies specific to transformer inference workloads

Key Responsibilities

Build detailed roofline models and performance projections for Sohu across diverse transformer architectures (Llama, Mixtral, etc.)

Profile production inference workloads to identify and eliminate micro-architectural bottlenecks

Analyze memory bandwidth, compute utilization, and interconnect performance to guide next-generation architecture decisions

Develop performance modeling tools that predict chip behavior across different batch sizes, sequence lengths, and model configurations

Characterize the performance impact of architectural features like specialized datapaths, memory hierarchies, and on-chip interconnects

Compare Sohu's architectural efficiency against conventional GPU architectures through detailed bottleneck analysis

Inform hardware design decisions for future generations (Caelius and beyond) based on workload analysis and performance projections

You may be a good fit if you have

Deep expertise in computer architecture and micro-architecture, particularly for accelerators or domain-specific architectures

Strong performance modeling and analysis skills with experience building analytical or simulation-based performance models

Experience profiling and optimizing deep learning workloads on hardware accelerators (GPUs, TPUs, ASICs, FPGAs)

Strong understanding of hardware / software co-design principles and cross-layer optimization

Solid foundation in digital circuit design and how micro-architectural decisions impact performance

Experience with reconfigurable or heterogeneous architectures

Ability to reason quantitatively about performance bottlenecks across the full stack from circuits to workloads

Strong candidates may also have

PhD or equivalent research experience in Computer Architecture or related fields

Experience with ASIC, FPGA, or CGRA-based accelerator development

Published research in computer architecture, ML systems, or hardware acceleration

Deep knowledge of GPU architectures and CUDA programming model

Experience with architecture simulators and performance modeling tools (gem5, trace-driven simulators, custom models)

Track record of informing architectural decisions through rigorous performance analysis

Familiarity with transformer model architectures and inference serving optimizations

Benefits

Medical, dental, and vision packages with generous premium coverage

$500 per month credit for waiving medical benefits

Housing subsidy of $2k per month for those living within walking distance of the office

Relocation support for those moving to San Jose (Santana Row)

Various wellness benefits covering fitness, mental health, and more

Daily lunch + dinner in our office

How we’re different

Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

We are a fully in-person team in San Jose (Santana Row), and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

Compensation Range : $175K - $275K

[job_alerts.create_a_job]

Performance Engineer • San Jose, CA, US

[internal_linking.similar_jobs]

Principal Customer Applications Engineer

CyberCoders • San Jose, CA, US

[job_card.full_time]

Principal Customer Applications Engineer.Colorado Springs, CO or San Jose, CA.Must have customer facing experience from technical project management OR customer applications engineering in the semi...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior / Staff Fullstack Engineer

Booster • Mountain View, CA, United States

[job_card.full_time]

Gatik, the leader in autonomous middle-mile logistics, is revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution and prioritizing safe, consistent del...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Full Stack Engineer

Velocity Tech • Santa Clara, CA, United States

[job_card.full_time]

Full Stack Engineer (AI, Real-Time Systems, Product-Led Growth).We’re an AI-first startup replacing traditional ad agencies with autonomous systems that optimize spend in real time.LLMs for natural...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Springhill Suites Fremont - Engineer Manager OEM

Aimbridge Hospitality • Fremont, California, United States, 94536

[job_card.full_time]

Springhill Suites Fremont - Engineer Manager OEM.The Engineer Manager is responsible for ensuring proper operations, maintenance, service and repair of all equipment, while supporting the Aimbridge...[show_more]

[last_updated.last_updated_variable_days]

Principal Performance & PNP Modeling Architect

Arm Limited • San Jose, CA, United States

[job_card.full_time]

We are seeking highly skilled and motivated System-on-Chip (SoC) Performance and Power modeling (PnP) Architects to join our diverse team at Arm! Our team focuses on PnP Analysis of Arm SoCs / SoPs (...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Machine Learning GPU Performance Engineer

Google • Mountain View, CA, United States

[job_card.full_time]

Machine Learning GPU Performance Engineer.Google’s software engineers are at the forefront of developing next-generation technologies that impact billions of users globally.We go beyond web search,...[show_more]

[last_updated.last_updated_30] • [promoted]

Performance Engineer (Lidar Algorithms) - Mountain View, CA

Aeva Inc. • Mountain View, CA, United States

[job_card.full_time]

Lidar Algorithms SW engineer (Optimization)_ Mountain View, CA.Aeva’s mission is to bring the next wave of perception to a broad range of applications from automated driving to industrial robotics,...[show_more]

[last_updated.last_updated_30] • [promoted]

Performance Engineer (Lidar Algorithms) - Mountain View, CA

Aeva • Mountain View, CA, United States

[job_card.full_time]

Performance Engineer (Lidar Algorithms) - Mountain View, CA.Be among the first 25 applicants.This range is provided by Aeva. Your actual pay will be based on your skills and experience — talk with y...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior Principal Engineer- Flow Innovation and Efficiency

Cadence Design Systems, Inc. • San Jose, CA, United States

[job_card.full_time]

At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology.The Cadence Artisan Foundation IP group develops industry-leading Foundation IP to enabl...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Vehicle Performance Engineer

Archer • San Jose, CA, United States

[job_card.full_time]

Archer is an aerospace company based in San Jose, California building an all-electric vertical takeoff and landing aircraft with a mission to advance the benefits of sustainable air mobility.We are...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior GPU Performance Engineer — Scale Training

AMD • San Jose, CA, United States

[job_card.full_time]

A leading semiconductor company in San Jose is seeking a Principal / Senior GPU Software Performance Engineer to enhance multi-GPU model training performance. The role involves kernel performance opti...[show_more]

[last_updated.last_updated_30] • [promoted]

Principal Hardware Engineer - Product Performance and Power

NVIDIA Corporation • Santa Clara, CA, United States

[job_card.full_time]

Principal Hardware Engineer - Product Performance and Power page is loaded## Principal Hardware Engineer - Product Performance and Powerlocations : US, CA, Santa Claratime type : Full timeposted ...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior Fullstack Engineer, PatientKeeper

Commure • Mountain View, CA, United States

[job_card.full_time]

At Commure, our mission is to simplify healthcare.We have bold ambitions to reimagine the healthcare experience, setting a new standard for how care is delivered and experienced across the industry...[show_more]

[last_updated.last_updated_variable_days] • [promoted]

Senior Software Engineer - Fleet Response

Waymo • Mountain View, CA, United States

[job_card.full_time]

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on buildin...[show_more]

[last_updated.last_updated_30] • [promoted]

Performance Engineer (Lidar Algorithms) - Mountain View, CA

Clutch Canada • Mountain View, CA, United States

[job_card.full_time]

Aeva’s mission is to bring the next wave of perception to a broad range of applications from automated driving to industrial robotics, consumer electronics, consumer health, security, and beyond.Ae...[show_more]

[last_updated.last_updated_30] • [promoted]

Senior Fullstack Engineer, Agents

Monograph • Mountain View, CA, United States

[job_card.full_time]

[last_updated.last_updated_30] • [promoted]

Director, System Performance & Reliability Analytics

Bloom Energy • San Jose, CA, United States

[job_card.full_time]

Director, System Performance & Reliability Analytics page is loaded## Director, System Performance & Reliability Analyticslocations : San Jose, Californiatime type : Full timeposted on : Posted ...[show_more]

[last_updated.last_updated_30] • [promoted]

Architecture & Performance Modeling Engineer

Eridu Corporation • Saratoga, California, United States, 95070

[job_card.full_time]

Eridu AI is a Silicon Valley-based hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today’s AI performance is frequently limited...[show_more]

[last_updated.last_updated_variable_days]