Location: N/A
Job Summary:
Job Duties:
- Manage end-to-end data collection, cleaning, and preprocessing for HTML-based datasets.
- Utilize web analysis tools to extract data from DOM environments.
- Support feature engineering experiments with ML Engineers.
- Generate and augment synthetic datasets using LLMs.
- Analyze data with dimensionality reduction techniques.
- Automate data workflows.
- Maintain documentation for data workflows and processes.
- Create validation systems for data integrity.
Required Skills:
- Python (Pandas, NumPy)
- Web analysis tools (Selenium, BeautifulSoup)
- HTML & DOM familiarity
- NLP techniques
- Data quality assurance
- Cloud platforms (AWS, GCP, Azure)
Required Experiences:
- 2+ years as a Data Analyst
- Experience in cybersecurity or ML-focused environments
- Collaboration with technical teams
- Problem-solving skills
- Relevant degree or equivalent experience
Job URLs: