Location: New York, 10261, US
Job Summary:
Job Duties and Scopes:
- Deploy and scale ML models in a cloud environment.
- Design fault-tolerant, highly available systems.
- Optimize system performance and troubleshoot production issues.
- Develop scalable ML infrastructure using Kubernetes.
- Focus on software design and architecture for machine learning applications.
Required Skills:
- Proficiency in Python and Kubernetes.
- Expertise in Infrastructure as Code (IaC).
- Strong understanding of low-level operating systems concepts (multi-threading, memory management, etc.).
Required Experiences:
- 5+ years in ML model deployment and scaling.
- Experience with distributed systems and managing inference at scale is a plus.
- Bachelor's/Master's degree in Computer Science, Statistics, Mathematics, or equivalent.
Job URLs: