TL;DR
SRE/DevOps Engineer (AI Team): Provide production support, troubleshoot incidents, and improve stability for AI production environments with an accent on monitoring, alerting, and CI/CD automation. Focus on understanding complex service architectures, designing automated upgrades, and ensuring robust operations for real-time data streaming and cloud infrastructure.
Location: Hybrid role requiring on-site presence at our office 4 days a week. Specific office location not provided.
Company
hirify.global is a technology company specializing in cloud and AI solutions.
What you will do
- Provide ongoing production support, troubleshooting, and incident resolution for environments.
- Contribute to production stability improvements and implement CI/CD pipelines and automations.
- Manage monitoring, alerting, logging, and capacity for services and products.
- Design and automate software and product upgrades, change, and release management solutions.
- Interact and collaborate with internal customers including Development, QA, Infrastructure, and Security teams.
- Participate in on-call rotations and continuously improve system documentation.
Requirements
- Solid knowledge and strong experience in production support activities and SRE principles/DevOps practices.
- English: Intermediate (B1) proficiency.
- At least 2-3 years of experience as a Linux System Administrator.
- Strong understanding of Kubernetes (K8s) networking, security, and storage.
- Extensive AWS cloud experience (IAM, VPC, R53, AZs, EC2/EKS, RDS, S3, CloudFront, CloudWatch).
- Experience with RDBMS administration (PostgreSQL/AuroraDB or MySQL), GIT, Prometheus, Grafana, and Terraform.
Nice to have
- System thinking approach and real automation experience (Python, Bash, Golang).
- Experience with MS Azure/GCP cloud environments.
- Familiarity with Flux + Kustomize/Flagger/Strimzi + Istio.
- Experience with MongoDB, ELK stack usage, and Nginx.
- Knowledge of the Russian language.
Culture & Benefits
- Work within a well-coordinated professional team.
- Access to cutting-edge technologies, interesting and challenging tasks, and a dynamic project environment.
- Great opportunities for self-realization, professional and career growth.
- Additional Health and Life Insurance Package.
- Employee Assistance Program and 25 vacation days.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →