Location: San Francisco, CA, 94105, US
Job Summary:
Job Duties and Scopes:
- Manage high availability of 1000+ clusters using Kubernetes and microservices.
- Contribute code and drive automation in operations using Python/Golang/Puppet/Jenkins.
- Enhance CI/CD pipelines with Terraform, Spinnaker, and Argo.
- Implement monitoring and self-healing mechanisms with Prometheus and Grafana.
- Collaborate with infrastructure teams to evaluate new technologies.
Required Skills:
- Strong experience in Linux Systems Administration and Kubernetes.
- Proficient in scripting/programming languages (Python, GoLang).
- Knowledgeable in networking protocols and observability tools.
Required Experiences:
- 7+ years in SRE/DevOps/Systems Engineering roles.
- Experience with large-scale distributed systems in cloud environments.
- Proficiency with AWS, Terraform, Spinnaker, and other DevOps tools.
Job URLs: