Location: San Mateo, CA, USA
Job Summary:
Job Duties:
- Define and own the roadmap for machine learning platforms/tools.
- Develop distributed training infrastructure for LLMs.
- Optimize inference for LLMs.
- Align RHLF for LLMs.
- Collaborate with engineering teams to enhance LLM capabilities.
- Support team members in maintaining technical quality.
Required Skills (Keywords):
- Machine Learning
- Deep Learning
- Distributed Systems
- Python
- SKLearn
- XGBoost
- PyTorch
- TensorFlow
- Collaboration
Required Experiences (Topics):
- 8+ years in machine learning infrastructure/support.
- Modeling in pre-training and fine-tuning.
- Experience with machine learning systems/platforms.
- Collaboration with data scientists and business analysts.
Job URLs: