TL;DR
Associate Staff Engineer (Devops): Maintaining and managing Kubernetes clusters, AWS/Azure environments, and GPU infrastructure for high-performance workloads with an accent on AI/ML workloads, infrastructure security best practices, cost optimization, and performance tuning. Focus on troubleshooting production issues and implementing proactive measures to prevent downtime, Continuously improve deployment processes and infrastructure reliability through automation and best practices.
Company
hirify.global is a Digital Product Engineering company that builds products, services, and experiences.
What you will do
- Maintain and manage Kubernetes clusters, AWS/Azure environments, and GPU infrastructure for high-performance workloads.
- Design and implement CI/CD pipelines for seamless deployments and faster release cycles.
- Set up and maintain monitoring and logging systems using Prometheus and Grafana to ensure system health and reliability.
- Collaborate with ML engineering teams to optimize inference performance and resource utilization.
- Automate infrastructure provisioning and configuration using Terraform and other IaC tools.
- Drive cost optimization initiatives for cloud resources and GPU utilization.
Requirements
- Experience: 5+ years in DevOps or Site Reliability Engineering (SRE) roles.
- Strong knowledge of Docker, Kubernetes, Terraform, and CI/CD pipelines.
- Hands-on experience with AWS, Azure, or other cloud platforms.
- Good understanding of monitoring and logging systems (Prometheus, Grafana).
- Ability to collaborate with ML teams for optimized inference and deployment.
- Strong troubleshooting and problem-solving skills in high-scale environments.
Nice to have
- Familiarity with GPU infrastructure and ML workloads is a plus.
- Knowledge of infrastructure security best practices, cost optimization, and performance tuning.
- Exposure to vector databases and AI/ML deployment pipelines is highly desirable.
Culture & Benefits
- Dynamic and non-hierarchical work culture.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →