A company is looking for an AI Runtime Engineer to develop and optimize the execution stack for next-generation AI accelerators.
Key Responsibilities
Develop and optimize the AI runtime software stack for executing deep learning workloads on AI accelerators
Implement task scheduling, memory management, and kernel execution strategies for efficient computation
Design and implement high-performance APIs for AI Inference frameworks and ensure scalability and reliability of the AI runtime
Required Qualifications
Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related field
3+ years of experience in developing low-level runtime software for AI accelerators, GPUs, or HPC systems
Strong proficiency in C / C++ and low-level systems programming
Deep understanding of task scheduling, concurrency, and memory hierarchy
Familiarity with deep learning execution frameworks and experience with low-latency, high-throughput workload execution for AI models
Ai Engineer • Omaha, Nebraska, United States