Location: Palo Alto, CA, USA, 94306, US
Job Summary:
Job Duties:
- Build and manage a comprehensive semiconductor dataset.
- Develop software solutions for data scraping and handling.
- Extract and clean information from diverse data modalities.
- Prepare data for the Machine Learning team.
- Manage customer data transfer and feedback systems.
- Parse documents in various formats.
- Develop software pipelines for data labelers.
- Implement systems for dataset pre-processing for AI training.
Required Skills (Keywords):
- Scalable software solutions
- PDF parsing
- Data extraction
- Software engineering
- Custom data processing libraries
- AI training data preparation
- Cloud data management
Required Experiences (Topics):
- Electrical Engineering background (Optional)
- Machine learning model behavior and data quality
- Fine-tuning large language models
- Experience in hyper-growth startups
- Building systems for training foundation models
Job URLs: