Location: United States
Job Summary:
Job Duties
- Design, develop, and optimize high-performance AI kernels/operators for GPUs
- Achieve state-of-the-art performance using GPU software and micro-architectural features
- Collaborate with compiler, framework, runtime, and serving teams for end-to-end GPU performance
- Work with ML researchers to guide future ML system development
Required Skills (Keywords)
- Computer architecture
- GPU programming (CUDA, OpenCL)
- Performance optimization
- Problem-solving
- Team collaboration
Required Experiences (Topics)
- 3+ years in complex software systems
- Analysis and profiling of AI workloads
- Independent and self-motivated work ethic
Job URLs: