Senior Cloud Operations Engineer (PyTorch)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior Cloud Operations Engineer (AWS/Terraform): Managing and optimizing cloud infrastructure for the PyTorch project with an accent on multi-cloud environments, CI/CD pipeline automation, and scalability. Focus on implementing Infrastructure-as-Code, enhancing observability via Datadog, and collaborating with the open-source community to maintain a robust AI framework environment.
Location: Remote (USA). Candidates must be authorized to work in their country of residence without employer sponsorship.
Salary: $95,000 - $133,000 USD
Company
The is a driving force in fostering open source collaboration and supporting communities across a range of projects, including PyTorch.
What you will do
- Manage multi-cloud environments focusing on AWS services including EKS, EC2, S3, IAM, and ELB.
- Implement and maintain infrastructure-as-code using Terraform.
- Design and maintain CI/CD pipelines using GitHub Actions and ARC.
- Develop comprehensive monitoring and observability solutions using Datadog and AWS CloudWatch.
- Manage and optimize Cloudflare CDN deployments for project assets.
- Collaborate with external contributors and promote DevOps best practices within the open-source community.
Requirements
- 7+ years of experience in cloud operations with significant AWS expertise.
- Strong proficiency in Terraform, Docker, and Kubernetes.
- Experience with scripting languages such as Python, TypeScript, and Bash.
- Expertise in implementing monitoring solutions, specifically Datadog and AWS CloudWatch.
- Must be authorized to work in the USA without employer sponsorship.
- Bachelor's degree in Computer Science, Engineering, or a related field.
Nice to have
- Experience with PyTorch or other open source communities.
- Multi-cloud expertise spanning AWS, GCP, and Azure.
- Knowledge of FinOps principles and cloud cost optimization strategies.
- Previous contributions to open source projects in infrastructure management roles.
Culture & Benefits
- Predominantly remote workforce with a flexible and supportive work culture.
- Opportunity to work at the heart of open source collaboration.
- Environment that values autonomy and avoids the constraints of traditional office spaces.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →