We are looking for senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning training, inference and NVIDIA AI Services. Work with world class software engineers to implement blazingly fast SOTA deep learning models that help understanding the end-to-end performance of NVIDIAs DL software and hardware stack. We are working across all layers of the hardware/software stack, from GPU architecture to Deep Learning Framework, to achieve peak performance. * Implement deep learning models from multiple data domains (CV, NLP/LLMs, ASR, TTS, RecSys and others) in mu
more