Talent.com

Failure analysis engineer Jobs in Austin, TX

Create a job alert for this search

Failure analysis engineer • austin tx

Last updated: 3 days ago

Failure Analysis Engineer

Advanced Micro Devices, IncAustin, Texas, United States
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded syst...Show more

Platform HW Competitive Analysis Engineer

Sunrise SystemsAustin, Texas, United States
Full-time
Quick Apply

Platform HW Competitive Analysis Engineer.Analyze CPU/APU/GPU printed circuit boards (PCBs) from various vendors and systems, including notebooks, PC motherboards, and GPU cards.Identify electrical...Show more

Pediatric Cardiac Heart Failure & Transplant Advanced Practice Provider

UT AustinAustin, US
Full-time

In collaboration with supervising physicians, provides for the expansion of individualized healthcare services by diagnosing and treating pediatric and congenital heart conditions.Counsels and educ...Show more

Platform HW Competitive Analysis Engineer

TPI Global (formerly Tech Providers, Inc.)Austin, TX, US
Temporary
Quick Apply

Role: Platform HW Competitive Analysis Engineer.Location: Austin, TX (onsite/hybrid).Analyze CPU/APU/GPU printed circuit boards (PCBs) from various vendors and systems, including notebooks, PC moth...Show more

HW Competitive Analysis Engr.

Staffing TechnologiesAustin, TX
Full-time

Platform HW Competitive Analysis EngineerJob Duties:1.Board Analysis:o Analyze CPU/APU/GPU printed circuit boards (PCBs) from various vendors and systems, including notebooks, PC motherboards, and ...Show more

Financial Planning & Analysis Specialist

e-MDsAustin, TX, United States
Full-time

Financial Planning & Analysis Specialist.Manager will lead the business planning process across all lines of business.Also responsible for preparing and supervising the preparation of the financial...Show more

Business Analysis Manager - Industrial EHS / ORM Domain Expert

Wolters KluwerSouthwest Pkwy, Austin, TX, USA
Full-time

Wolters Kluwer is a global leader in professional information services, combining deep domain expertise with specialized technology to help professionals make confident decisions.Our Enablon platfo...Show more

Failure Analysis Engineer (Austin Site)

Foxconn Industrial Internet - FIIAustin, TX, US
Full-time
Quick Apply

Required Skillsets Technical Skills Engineering Principles:.Strong understanding of mechanical, electrical, and materials engineering principles Failure Analysis Techniques: Proficiency in techniqu...Show more

Pediatric Heart Failure & Transplant Cardiologist

The University of Texas at AustinAustin, TX, US
Full-time

As part of The University of Texas at Austin, one of the nation’s leading research universities, the Dell Medical School pursues innovation in the redesign of healthcare delivery, excellence in hea...Show more

Director, Strategic Planning & Analysis

ArmaninoAustin, Texas
Full-time

At Armanino, you determine your career path.This means it's possible to pursue challenges you are passionate about, in industries you care about.Among the top 20 Largest Firms in the Nation.We have...Show more

Platform HW Competitive Analysis Engineer

TPI Global SolutionsAustin, TX, US
Temporary

Role: Platform HW Competitive Analysis Engineer 12 Months Contract Location: Austin, TX (onsite/hybrid) Job Duties: 1.Board Analysis: o Analyze CPU/APU/GPU printed circuit boards (PCBs) from variou...Show more

Platform HW Competitive Analysis Engineer

TekWissen LLCAustin, TX, United States
Full-time
Quick Apply

Overview: TekWissen is a global workforce management provider headquartered in Ann Arbor, Michigan that offers strategic talent solutions to our clients world-wide.This Client is an American multin...Show more

Financial Planning & Analysis Manager

Cloud Imperium GamesAustin, TX, United States
Full-time

Financial Planning & Analysis Manager.We are a crowdfunded company and have a dedicated and enthusiastic community of backers who are helping us create the "Best Damn Space Sim Ever".We want to bui...Show more

Pediatric Cardiac Heart Failure & Transplant Advanced Practice Provider

HealthEcareers - ClientAustin, TX, USA
Full-time

In collaboration with supervising physicians, provides for the expansion of individualized healthcare services by diagnosing and treating pediatric and congenital heart conditions.Counsels and educ...Show more

Work-at-Home Data Analysis Associate

FocusGroupPanelAustin, Texas, United States
Remote
Full-time +1

We appreciate you checking us out! Work At Home Data Entry Research Panelist Jobs - Part Time, Full Time.This work-from-home position is ideal for anyone with a diverse professional background, inc...Show more

Failure Analysis Engineer

AMDAustin, TX, US
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING.At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded syst...Show more

Remote Financial Planning & Analysis Manager - AI Trainer ($50-$60 per hour)

Data AnnotationAustin, Texas
Remote
Full-time +1

DataAnnotation is committed to creating high-quality AI.Join our team to help train the next generation of AI while enjoying the flexibility of remote work and the freedom to set your own schedule....Show more

 • Promoted

Financial Planning and Analysis Director

K&L GatesAustin
Full-time

At K&L Gates, we are looking for smart, imaginative and hard-working people with diverse backgrounds, experiences and ideas to join us.Perhaps our search for talented visionaries and your search fo...Show more

Sr. Financial Planning & Analysis (FP&A) Analyst

Made In CookwareAustin, TX, United States
Full-time

Financial Planning & Analysis (FP&A) Analyst.Made In is the leader in the digitally-native kitchen space.We bring Chef expertise and centuries-old cookware manufacturing techniques to craft profess...Show more

People also ask
Failure Analysis Engineer

Failure Analysis Engineer

Advanced Micro Devices, IncAustin, Texas, United States
30+ days ago
Job type
  • Full-time
Job description


WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.Together, we advance your career.




THE ROLE:

As a Failure Analysis Engineer, you will play a critical role in diagnosing, isolating, and resolving complex failures across GPU-accelerated server platforms deployed in rack-level and data center environments. This is a highly technical, hands-on role focused on server bring-up, system-level debug, and rack integration troubleshooting involving CPU, GPU, memory, PCIe, networking, power delivery, and thermal subsystems.

You will leverage advanced electrical, firmware, and platform-level diagnostic tools to uncover root causes of failures impacting system stability, performance, and reliability in multi-node server environments. You will collaborate closely with platform design, firmware, validation, manufacturing, and quality teams to improve product robustness, accelerate debug cycles, and drive corrective actions across deployed infrastructure.

This role requires deep experience with server platforms — not just component-level FA — including rack-level interactions, BIOS/BMC behavior, and data center operational environments.

THE PERSON:

The ideal candidate is analytical, detail-oriented, and thrives in fast-paced, high-visibility debug environments. You are comfortable owning complex investigations involving hardware, firmware, power sequencing, and system interoperability across multi-node GPU servers.

You are experienced in server bring-up and rack-level troubleshooting and can clearly translate technical findings into structured RCA/FMEA documentation and actionable design improvements. You are equally comfortable working independently in lab environments and cross-functionally with design, firmware, validation, and manufacturing teams.

KEY RESPONSIBILITIES:

  • Perform component-, server-, and rack-level failure analysis on GPU-accelerated server platforms.
  • Debug and isolate complex platform issues involving CPU, GPU, memory, PCIe, networking, storage, and peripheral subsystems.
  • Troubleshoot server bring-up failures including POST issues, BIOS/UEFI misconfiguration, PCIe enumeration failures, and firmware interaction problems.
  • Analyze BIOS, BMC, IPMI, and system logs to identify hardware/firmware interaction issues impacting system stability.
  • Diagnose rack-level failures including power distribution issues, thermal interactions, multi-node communication problems, and system integration defects.
  • Utilize advanced electrical and system-level test equipment including oscilloscopes, logic analyzers, protocol analyzers, and power integrity measurement tools.
  • Replicate and isolate field failures within lab environments to accelerate root cause identification.
  • Investigate system-level failures such as boot instability, performance degradation, thermal throttling, GPU errors, and interoperability issues.
  • Create formal, concise RCA/FMEA reports detailing failure reproduction steps, data analysis, root causes, and corrective actions.
  • Drive corrective actions with design, firmware, validation, and manufacturing teams to improve Design for Reliability (DfR), Design for Testability (DfT), and Design for Serviceability (DfS).
  • Develop and maintain debug guides, SOPs, and knowledge bases to accelerate future troubleshooting efforts.

PREFERRED EXPERIENCE:

  • Hands-on experience debugging server platforms in data center or rack-level environments.
  • Strong background in GPU server systems and multi-node infrastructure.
  • Experience troubleshooting power sequencing, PCIe subsystems, memory training issues, and thermal management in server platforms.
  • Familiarity with BIOS/UEFI, BMC, IPMI, and firmware-level debugging.
  • Proficiency using oscilloscopes, logic analyzers, protocol analyzers, and power analyzers for system-level debug.
  • Experience collaborating cross-functionally to drive systemic corrective actions and reliability improvements.
  • Strong technical documentation and communication skills.

ACADEMIC CREDENTIALS:

Bachelor’s Degree in Electrical Engineering required; Master’s Degree in Electrical or Systems Engineering preferred.

LOCATION: Austin, Tx (Onsite)

#LI-CS1

This role is not eligible for visa sponsorship.




Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

This posting is for an existing vacancy.

THE ROLE:

As a Failure Analysis Engineer, you will play a critical role in diagnosing, isolating, and resolving complex failures across GPU-accelerated server platforms deployed in rack-level and data center environments. This is a highly technical, hands-on role focused on server bring-up, system-level debug, and rack integration troubleshooting involving CPU, GPU, memory, PCIe, networking, power delivery, and thermal subsystems.

You will leverage advanced electrical, firmware, and platform-level diagnostic tools to uncover root causes of failures impacting system stability, performance, and reliability in multi-node server environments. You will collaborate closely with platform design, firmware, validation, manufacturing, and quality teams to improve product robustness, accelerate debug cycles, and drive corrective actions across deployed infrastructure.

This role requires deep experience with server platforms — not just component-level FA — including rack-level interactions, BIOS/BMC behavior, and data center operational environments.

THE PERSON:

The ideal candidate is analytical, detail-oriented, and thrives in fast-paced, high-visibility debug environments. You are comfortable owning complex investigations involving hardware, firmware, power sequencing, and system interoperability across multi-node GPU servers.

You are experienced in server bring-up and rack-level troubleshooting and can clearly translate technical findings into structured RCA/FMEA documentation and actionable design improvements. You are equally comfortable working independently in lab environments and cross-functionally with design, firmware, validation, and manufacturing teams.

KEY RESPONSIBILITIES:

  • Perform component-, server-, and rack-level failure analysis on GPU-accelerated server platforms.
  • Debug and isolate complex platform issues involving CPU, GPU, memory, PCIe, networking, storage, and peripheral subsystems.
  • Troubleshoot server bring-up failures including POST issues, BIOS/UEFI misconfiguration, PCIe enumeration failures, and firmware interaction problems.
  • Analyze BIOS, BMC, IPMI, and system logs to identify hardware/firmware interaction issues impacting system stability.
  • Diagnose rack-level failures including power distribution issues, thermal interactions, multi-node communication problems, and system integration defects.
  • Utilize advanced electrical and system-level test equipment including oscilloscopes, logic analyzers, protocol analyzers, and power integrity measurement tools.
  • Replicate and isolate field failures within lab environments to accelerate root cause identification.
  • Investigate system-level failures such as boot instability, performance degradation, thermal throttling, GPU errors, and interoperability issues.
  • Create formal, concise RCA/FMEA reports detailing failure reproduction steps, data analysis, root causes, and corrective actions.
  • Drive corrective actions with design, firmware, validation, and manufacturing teams to improve Design for Reliability (DfR), Design for Testability (DfT), and Design for Serviceability (DfS).
  • Develop and maintain debug guides, SOPs, and knowledge bases to accelerate future troubleshooting efforts.

PREFERRED EXPERIENCE:

  • Hands-on experience debugging server platforms in data center or rack-level environments.
  • Strong background in GPU server systems and multi-node infrastructure.
  • Experience troubleshooting power sequencing, PCIe subsystems, memory training issues, and thermal management in server platforms.
  • Familiarity with BIOS/UEFI, BMC, IPMI, and firmware-level debugging.
  • Proficiency using oscilloscopes, logic analyzers, protocol analyzers, and power analyzers for system-level debug.
  • Experience collaborating cross-functionally to drive systemic corrective actions and reliability improvements.
  • Strong technical documentation and communication skills.

ACADEMIC CREDENTIALS:

Bachelor’s Degree in Electrical Engineering required; Master’s Degree in Electrical or Systems Engineering preferred.

LOCATION: Austin, Tx (Onsite)

#LI-CS1

This role is not eligible for visa sponsorship.

Benefits offered are described: AMD benefits at a glance.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD’s “Responsible AI Policy” is available here.

This posting is for an existing vacancy.