A company is looking for a Senior Deep Learning Software Engineer, TensorRT Performance.
Key Responsibilities
Establish performance benchmarking methodologies and identify performance issues in NVIDIA's inference ecosystem
Contribute features and code to OSS inference frameworks including TensorRT and Torch-TensorRT
Develop optimized model pipelines for NVIDIA's inference ecosystem, focusing on areas like quantization and distributed inference
Required Qualifications
Bachelor's, Master's, PhD, or equivalent experience in Computer Science, Computer Engineering, EECS, or AI
At least 3 years of relevant software development experience
Strong programming skills in C++ and Python
Experience with deep learning frameworks such as PyTorch and TensorFlow
Experience in performance analysis and optimization
Senior Deep Learning Engineer • Kansas City, Missouri, United States