Location: New York, 10261, US
Job Summary:
Job Duties and Scope:
- Deploy and scale ML models in a cloud environment.
- Design fault-tolerant, highly available systems for machine learning.
- Optimize system performance and debug production issues.
- Provide software design and architecture for scalable ML systems.
Required Skills:
- Proficient in Python and Kubernetes.
- Expertise in designing and managing distributed systems.
- Strong understanding of low-level operating systems concepts.
Required Experience:
- 5+ years in ML model deployment and scaling.
- Experience with Infrastructure as Code (IaC) and cloud environments.
- Background in Computer Science/Engineering, Statistics, or Mathematics.
Job URLs: