Talent.com
Sr. System Engineer/Rack Solution (27694)
Sr. System Engineer/Rack Solution (27694)Supermicro • San Jose, CA, United States
Sr. System Engineer / Rack Solution (27694)

Sr. System Engineer / Rack Solution (27694)

Supermicro • San Jose, CA, United States
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]

Job Req ID : 27694

About Supermicro :

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary :

As a Sr. System Engineer, you'll be the go-to person to roll out and maintain business critical applications and services for Supermicro. You are also responsible for resolving escalated service issues, coaching other engineers to resolutions, engineering and implementing complex projects. You will be a person who is independent with leadership to drive the technical development and with excellent communication skills.

Essential Duties and Responsibilities :

Includes the following essential duties and responsibilities (other duties may also be assigned) :

  • Execute comprehensive system-level rack tests on latest NVidia and AMD GPUs, ARM-based, Intel Xeon, and AMD EPYC processors, encompassing functionality, compatibility, performance, stress, and reliability testing, leveraging proprietary in-house tools.
  • Establish expertise in HPC / AI applications and benchmarks, delivering impactful training sessions to customers and partners, while addressing complex customer support issues, demonstrating innovative problem-solving skills and building robust processes and procedures for HPC / AI solutions.
  • Conduct proof of concept design and testing, providing optimized benchmarks for HPC / AI applications in a timely manner. Fine-tune BIOS settings, optimize OS / network configurations, and develop diverse simulation configurations to enhance efficiency across various workloads.
  • Deliver on-site deployment services, ensuring customer acceptance verification and providing post-level 1&2 support. Create and maintain technical documentation, including technical notes, blogs, and diagrams, to facilitate knowledge dissemination.
  • Identify and document hardware and software quality issues and collaborate with Product Management and other Engineering teams to integrate customer feedback into future product enhancements.
  • Proactively engage in HPC roadmap development, planning software and hardware upgrades to sustain exceptional HPC infrastructure performance.
  • Document and analyze test plans, reports, logs, and actively contribute to the development of test utilities and automation scripts to streamline testing processes.

Qualifications :

  • BS / MS in Electrical Engineering, Computer Engineering or Computer Science
  • 8+ years of work-related experience in Deep Learning and Machine Learning
  • 8+ years of Linux / networking debugging / testing or relevant experience preferred
  • Experience with leading AI / ML frameworks such as PyTorch, TensorFlow, ONNX, etc.
  • Experience with DevOps or in cloud environments, including but not limited to Docker / Containers and Kubernetes
  • Hands-on experience with workload / scheduler Managers (Slurm) for rack / cluster
  • Familiar with MLPerf Training / Inference benchmark, LLM, HPL-AI or RCCL / NCCL
  • Programming experience with windows and Linux shell scripting
  • Strong sense of teamwork and good team player, strong communication skills
  • Familiar with Intel / AMD / NVIDIA development tool kits such as CUDA, oneAPI, ROCm is a plus
  • Experience with server / network hardware debugging and troubleshooting is a plus
  • CCNA, OpenStack, OpenShift, Azure or AWS is a plus
  • Please note that this position requires regular in-office attendance. The successful candidate is expected to be present in the office during standard working hours as determined by the company. In-office collaboration and participation in team meetings, training sessions, and other on-site activities are essential aspects of this role. Candidates should consider the commuting distance and be prepared to fulfill their responsibilities in the designated office location.

    Salary Range

    $137,000 - $156,000

    The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

    EEO Statement

    Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

    [job_alerts.create_a_job]

    Solution • San Jose, CA, United States

    [internal_linking.related_jobs]
    Sr. Systems Engineer

    Sr. Systems Engineer

    Archer • San Jose, CA, United States
    [job_card.full_time]
    Archer is an aerospace company based in San Jose, California building an all-electric vertical takeoff and landing aircraft with a mission to advance the benefits of sustainable air mobility.We are...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    RN Supervisor II (Sat / Sun AM) - Mental Health 194

    RN Supervisor II (Sat / Sun AM) - Mental Health 194

    Telecare Corporation • Santa Cruz, CA, United States
    [job_card.full_time]
    Telecare's mission is to deliver excellent and effective behavioral health services that engage individuals in recovering their health, hopes, and dreams. Telecare continues to advance cultural dive...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. Backend Engineer

    Sr. Backend Engineer

    Drivemode • Mountain View, California, United States
    [job_card.full_time]
    Driving technology always feels old.We believe vehicles can be a thousand times smarter, safer, and more connected to the world around us, and our mission is to see it happen.In 2019, we joined for...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. System Engineer / Rack Solution (27694)

    Sr. System Engineer / Rack Solution (27694)

    Supermicro • San Jose, California, United States
    [job_card.full_time]
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Travel MRI Tech - $3,031 per week in Santa Cruz, CA

    Travel MRI Tech - $3,031 per week in Santa Cruz, CA

    AlliedTravelCareers • Santa Cruz, CA, US
    [job_card.full_time]
    AlliedTravelCareers is working with Medical Solutions to find a qualified MRI Tech in Santa Cruz, California, 95062!.A facility in Santa Cruz, CA is seeking its next amazing MRI Technologist.Read o...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior R&D Engineer

    Senior R&D Engineer

    Synopsys • Mountain View, CA, United States
    [job_card.full_time]
    The Senior R&D Engineer is responsible for the deployment and maintenance of cloud-based HPC infrastructure.In this role, the Senior R&D Engineer will use advanced technical and problem-solving ski...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Physician, Geriatrics (Santa Cruz, CA)

    Physician, Geriatrics (Santa Cruz, CA)

    HealthEcareers - Client • Santa Cruz, CA, USA
    [job_card.full_time]
    Palo Alto Foundation Medical Group has a full–time opportunity for BE / BC internal medicine / family medicine / Geriatrics trained physician for a growing Post–Acute program. Physician–led and collegial...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. Solution Engineer

    Sr. Solution Engineer

    Commscope • Sunnyvale, California, US
    [job_card.full_time]
    In our hyper-connected world, RUCKUS Networks is redefining how organizations connect, communicate, and collaborate.We’re seeking a Senior Solution Engineer to join our dynamic Solution Engineering...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Travel MRI Tech - $3,031 per week in Santa Cruz, CA

    Travel MRI Tech - $3,031 per week in Santa Cruz, CA

    AlliedTravelNetwork • Santa Cruz, CA, US
    [job_card.full_time]
    AlliedTravelNetwork is working with Medical Solutions to find a qualified MRI Tech in Santa Cruz, California, 95062!.A facility in Santa Cruz, CA is seeking its next amazing MRI Technologist.Read o...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • Hayward, CA, US
    [job_card.full_time]
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Senior HPC / AI Systems Engineer

    Senior HPC / AI Systems Engineer

    Support Revolution • San Jose, CA, United States
    [job_card.full_time]
    A leading technology company in San Jose seeks a Sr.System Engineer to roll out and maintain business-critical applications and services. The role involves resolving service issues, engineering comp...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Physician, Geriatrics (Santa Cruz, CA)

    Physician, Geriatrics (Santa Cruz, CA)

    Palo Alto Foundation Medical Group • Santa Cruz, US
    [job_card.full_time]
    Palo Alto Foundation Medical Group has a full–time opportunity for BE / BC internal medicine / family medicine / Geriatrics trained physician for a growing Post–Acute program. Physician–led and collegial...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. Systems Engineer, PHY Standards (Wi-Fi / BT)

    Sr. Systems Engineer, PHY Standards (Wi-Fi / BT)

    Synaptics Inc. • San Jose, CA, US
    [job_card.full_time]
    Synaptics is leading the charge in AI at the Edge, bringing AI closer to end users and transforming how we engage with intelligent connected devices, whether at home, at work, or on the move.As the...[show_more]
    [last_updated.last_updated_30]
    Travel MRI Tech - $3,031 per week in Santa Cruz, CA

    Travel MRI Tech - $3,031 per week in Santa Cruz, CA

    Medical Solutions • Santa Cruz, CA, US
    [job_card.full_time]
    A facility in Santa Cruz, CA is seeking its next amazing MRI Technologist.Read on if this sounds like your perfect fit!.Nurses and allied healthcare professionals are in high demand nationwide, and...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    ObGyn

    ObGyn

    CompHealth • Santa Cruz, California, US
    [job_card.full_time]
    ObGyn physician job in California : Santa Cruz is a city on the central California coast.Its long wharf, with eateries and shops, stretches into Monterey Bay. Downtown, Pacific Avenue has vintage cl...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Sr. R&D Engineer, Instruments

    Sr. R&D Engineer, Instruments

    Calyxo, Inc. • Pleasanton, CA, United States
    [job_card.full_time]
    The company was founded in 2016 to address the profound need for improved kidney stone treatment.Kidney stone disease is a common, painful condition that consumes vast amounts of healthcare resourc...[show_more]
    [last_updated.last_updated_30] • [promoted]
    Hardware Engineer

    Hardware Engineer

    Shyld AI • Fremont, CA, US
    [job_card.full_time]
    We are looking for a junior, motivated and talented problem solver.The ideal candidate will be responsible for testing and assembling electrical and mechanical components into devices.They will als...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]
    Sr. IT System Engineer (Contractor to Hire)

    Sr. IT System Engineer (Contractor to Hire)

    OPPO US Research Center • Palo Alto, CA, United States
    [job_card.full_time]
    OPPO US Research Center is seeking a highly skilled and hands-on IT System Engineer to support our routine business.Collaborate with internal teams and vendors to ensure IT systems align with busin...[show_more]
    [last_updated.last_updated_variable_days] • [promoted]