Job Description
Job Description
Position Summary
We are seeking a proactive and results-oriented Reliability Engineer to join our Server R&D team. In this role, you will focus on ensuring the long-term reliability and performance of server and rack systems through rigorous validation, failure analysis, and cross-functional collaboration. The ideal candidate is experienced in reliability test methodologies and passionate about improving product quality from early development to mass production.
Key Responsibilities
- Plan and execute reliability validation strategies across system levels (Server / Storage / Rack), including thermal cycling, vibration, drop, operational stress, and HALT / HASS.
- Lead design reviews with EE, ME, Power, Thermal, and BIOS teams to identify and mitigate potential reliability risks early in the development cycle.
- Conduct system-level integration testing to validate hardware / software compatibility, stability, feature completeness, and long-term operational reliability.
- Perform root cause analysis (RCA) and corrective actions for reliability and quality issues; provide design feedback to improve future iterations.
- Define and maintain reliability test specifications aligned with industry standards (e.g., Telcordia, GR-63, JESD22, MIL-STD-810).
- Create and maintain test plans, procedures, and technical reports; present findings to internal stakeholders and external customers.
- Collaborate closely with global R&D centers (e.g., Taipei, Tianjin) and support customer audits and reviews as needed.
- Operate and calibrate reliability test equipment; ensure lab safety, equipment integrity, and data traceability.
Qualifications
Bachelor’s or Master’s degree in Mechanical, Electrical, or Industrial Engineering, or a related technical field.Minimum of 2 years of experience in server, PC, notebook, or data center product testing or reliability engineering.Proficiency with Windows and Linux OS installation and basic command-line operations.Hands-on experience operating environmental and reliability testing equipment (e.g., thermal chambers, vibration / shock testers, power cycling systems).Strong analytical and problem-solving skills; familiarity with FA / RCA tools and debugging methods.Excellent verbal and written communication skills in English; Mandarin proficiency is a plus.Experience with external testing labs, certification processes, and documentation best practices is preferred.Preferred Experience
Knowledge of data center environmental standards and server architecture.Experience working with global customers and supporting OEM / ODM projects.Familiarity with failure analysis and working closely with design teams for corrective actions.Powered by JazzHR
IgiC1T4YCX