Talent.com

Performance engineer [h1.location_city]

[job_alerts.create_a_job]

Performance engineer • berkeley ca

[last_updated.last_updated_variable_days]
Software Engineer (AI Performance)

Software Engineer (AI Performance)

Gimlet Labs, IncSan Francisco, CA, United States
[job_card.full_time]
Gimlet Labs is building the foundation for the next generation of AI applications.As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck.Gimlet is redefi...[show_more][last_updated.last_updated_30]
Senior Software Engineer - Compute Performance

Senior Software Engineer - Compute Performance

LambdaSan Francisco, California, United States
[job_card.full_time]
In 2012, Lambda started with a crew of AI engineers publishing research at top machine-learning conferences.We began as an AI company built by AI engineers. Today, we're on a mission to be the world...[show_more][last_updated.last_updated_variable_days]
Founding Resilience Engineer — Performance & Observability

Founding Resilience Engineer — Performance & Observability

PersonaSan Francisco, CA, United States
[job_card.full_time]
A leading identity platform company in San Francisco is seeking a Resilience Engineering specialist to join their new function. This role involves partnering with product teams to solve performance ...[show_more][last_updated.last_updated_variable_days]
Performance ML Engineer : CUDA, GPU Systems

Performance ML Engineer : CUDA, GPU Systems

RelaceSan Francisco, CA, United States
[job_card.full_time]
A tech company specializing in ML infrastructure is seeking a Machine Learning Engineer who excels at making models faster and more efficient through performance tuning and optimization.The ideal c...[show_more][last_updated.last_updated_30]
  • [promoted]
Building Performance Engineer

Building Performance Engineer

Harrison Consulting SolutionsSan Francisco, California, USA
[job_card.full_time]
Leading architectural and engineering firm is adding a.Senior Building Performance Engineer.Lead commissioning and optimization of building systems (HVAC systems air / water distribution systems buil...[show_more][last_updated.last_updated_variable_days]
GPU Systems Engineer : High-Performance C++

GPU Systems Engineer : High-Performance C++

10X Recruiting PartnersSan Francisco, CA, United States
[job_card.full_time]
A technology consulting firm is seeking a highly skilled Software Engineer (C++ Systems) to join their client's team in San Francisco. This role focuses on optimizing GPU virtualization performance ...[show_more][last_updated.last_updated_variable_days]
Senior ML Inference Engineer - PyTorch Performance

Senior ML Inference Engineer - PyTorch Performance

ComfySan Francisco, CA, United States
[job_card.full_time]
A leading AI platform company in San Francisco is seeking a talented individual to optimize model inference for their advanced visual AI product. The ideal candidate will engage in building efficien...[show_more][last_updated.last_updated_variable_days]
GPU Performance Engineer

GPU Performance Engineer

Genmo Inc.San Francisco, CA, United States
[job_card.full_time]
We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the bo...[show_more][last_updated.last_updated_variable_days]
Sr Building Performance Engineer

Sr Building Performance Engineer

HGASan Francisco, CA, US
[job_card.full_time]
Help Shape the Future of Sustainable Building Performance at HGA as a .Senior Building Performance Engineer on the West Coast!. Senior Building Performance Engineer.If you are passionate about optim...[show_more][last_updated.last_updated_30]
Senior HPC Performance Engineer

Senior HPC Performance Engineer

NVIDIARemote, CA, US
[filters.remote]
[job_card.full_time]
As a member of our team in NVIDIA's NVHPC compilers & tools group, you will analyze and run High Performance Computing (HPC) applications on HPC servers and systems to gain insight into the per...[show_more][last_updated.last_updated_30]
Backend Engineer - High-Performance Search Systems

Backend Engineer - High-Performance Search Systems

ExaSan Francisco, CA, United States
[job_card.full_time]
A cutting-edge search engine startup in San Francisco is seeking a backend engineer to contribute to innovative projects involving high performance systems. The ideal candidate has experience with l...[show_more][last_updated.last_updated_variable_days]
Performance Engineer

Performance Engineer

Menlo VenturesSan Francisco, CA, United States
[job_card.full_time]
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...[show_more][last_updated.last_updated_30]
Performance Modelling Engineer

Performance Modelling Engineer

PageBolt WordPressSan Francisco, CA, United States
[job_card.permanent]
We’re searching for a Staff Performance Modelling Engineer to create and own the analytical and simulation models that steer OTPU architecture and software evolution. You will build functional simul...[show_more][last_updated.last_updated_variable_days]
  • [promoted]
HPC / AI Data Performance Engineer

HPC / AI Data Performance Engineer

Lawrence Berkeley National LaboratoryBerkeley, CA, United States
[job_card.full_time] +1
In this exciting role, you will serve as a Data Performance Engineer in NERSC's Application Performance Group, architecting HPC and AI data services that advance fundamental science.You'll optimize...[show_more][last_updated.last_updated_30]
Sr. Software Engineer - Performance

Sr. Software Engineer - Performance

DatabricksSan Francisco, California
[job_card.full_time]
At Databricks, we are passionate about enabling data teams to solve the world's toughest problems.We do this by building and running the world's best data and AI infrastructure platform so our cust...[show_more][last_updated.last_updated_30]
Senior Performance Engineer

Senior Performance Engineer

VirtualVocationsOakland, California, United States
[job_card.full_time]
Key Responsibilities : Participate in design discussions and lead performance test projects across multiple streams Collaborate with Development and QA teams to identify automation and performanc...[show_more][last_updated.last_updated_30]
Performance Engineer SMTS

Performance Engineer SMTS

Salesforce, Inc.San Francisco, CA, United States
[job_card.full_time]
To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job CategorySoftware EngineeringJob Details • • • •Abo...[show_more][last_updated.last_updated_variable_days]
Product Performance Engineer

Product Performance Engineer

OpenAISan Francisco
[job_card.full_time]
We bring OpenAI's technology to the world through products like ChatGPT and the OpenAI API.We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool...[show_more][last_updated.last_updated_30]
Software Engineer (AI Performance)

Software Engineer (AI Performance)

Gimlet Labs, IncSan Francisco, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Gimlet Labs is building the foundation for the next generation of AI applications. As generative AI workloads rapidly scale, inference efficiency is becoming the critical bottleneck. Gimlet is redefining AI inference from the ground up, combining cutting-edge research with an integrated hardware-software stack that delivers breakthrough performance, efficiency, and model quality. Gimlet pairs its inference stack with a seamless developer experience, allowing users to deploy, manage, and monitor AI workloads from frameworks like PyTorch and LangChain at production scale in seconds.

Gimlet is spun out of a Stanford research project under Professors Zain Asgar and Sachin Katti. The founding team has deep experience across AI, distributed systems, and hardware with previous successful exits.

Gimlet Labs is seeking a Software Engineer focused on AI Performance. You will be researching and implementing techniques to drive performance and quality optimizations across the latest AI models. You will implement techniques such as quantization, KV caching, and FlashAttention to enable inference efficiency. You will design parallelism strategies to distribute data and workloads across compute nodes at production scale. You will dive deep into GPU code and kernel optimizations to accelerate AI workloads.

Responsibilities

  • Evaluating and implementing cutting-edge AI research for model performance and efficiency
  • Architecting infrastructure for distributed AI workloads across both the software stack and GPU kernel layers
  • Profiling, benchmarking, and analyzing system performance, identifying bottlenecks and optimization opportunities in execution runtimes targeting various hardware systems

Qualifications

  • Bachelor’s degree in computer science, engineering, applied mathematics or comparable area of study
  • Experience with performance optimization
  • Preferred Qualifications

  • Graduate degree in computer science, engineering, applied mathematics or comparable area of study
  • Familiarity with compilers and compiler frameworks such as MLIR
  • Experience with PyTorch, TensorFlow, vLLM, ONNX and other AI frameworks
  • Software development experience with Python, C++, and CUDA
  • #J-18808-Ljbffr