Location: San Francisco, CA, 94102, US
Job Summary:
Job Duties:
- Analyze, design, and implement optimized GPU algorithms for data analytics and machine learning.
- Architect and implement distributed GPU algorithms for single-node and multi-node environments.
- Enhance RAPIDS integration into high-level frameworks.
- Conduct performance analysis, benchmarking, and troubleshooting of libraries.
- Collaborate with a multi-functional team to implement or improve solutions.
Required Skills:
- Strong analytical problem-solving and mathematics fundamentals.
- Distributed system development experience.
- Excellent software development skills, including programming, debugging, and performance analysis.
- Good communication and documentation habits.
- Ability to work independently.
Required Experiences:
- MS or PhD in Computer Science or related field.
- 5+ years of experience in Computer Science, AI, Applied Math, or related fields.
- Experience developing distributed algorithms and systems (HPC, Cloud).
- Familiarity with the PyData Stack (NumPy, Pandas, etc.) is a plus.
- Prior work on open-source projects is advantageous.
Job URLs: