TL;DR
Senior Site Reliability Engineer: Running and improving the production environment for large distributed software applications with an accent on system health, performance optimization, and automation. Focus on building robust systems, managing platform infrastructure, and driving incident response and resolution.
Location: Hybrid (2 days in office, 3 days remote) in the United Kingdom
Company
hirify.global is a corporation providing software products to over 25,000 global businesses, specializing in AI, cloud, digital, and fighting financial crime and ensuring public safety.
What you will do
- Run and monitor the production environment, ensuring system health and availability.
- Build software and systems for platform infrastructure and applications.
- Improve reliability, quality, and time-to-market of software solutions.
- Measure and optimize system performance to push capabilities forward.
- Provide operational support for large distributed software applications.
- Balance feature development speed and reliability with service level objectives.
Requirements
- 3-6 years of experience in systems engineering, automation, and reliability.
- Proficiency in programming (Python, Go, Java, C#) and scripting (Bash, PowerShell).
- Deep understanding of cloud platforms like AWS (EC2, ECS, Lambda, DynamoDB).
- Experience with Infrastructure as Code tools such as CloudFormation or Terraform.
- Strong knowledge of CI/CD concepts and tools (Jenkins, GitLab CI/CD, CircleCI).
- Expertise in containerization (Docker, Kubernetes) and microservices architecture.
- Experience with monitoring tools (Prometheus, Grafana, ELK stack, Cloudwatch).
- Excellent problem-solving skills and experience with incident management.
- Must be able to work in a hybrid model (2 days in office, 3 days remote) in the United Kingdom.
hirify.global-to-have">hirify.global to have
- Hands-on experience with large Kubernetes Clusters or relevant certifications.
- Working experience with Grafana Observability Suite (Loki, Mimir, Tempo).
- Administration/development experience with Splunk, Datadog, Pagerduty, Rundeck.
- Familiarity with configuration management tools like Ansible, Puppet, or Chef.
Culture & Benefits
- hirify.global-FLEX hybrid model: 2 days in the office, 3 days remote weekly.
- Office days focus on face-to-face meetings, teamwork, and collaborative innovation.
- Opportunity to work in an ambitious, game-changing environment.
- Commitment to diversity and equal opportunity employment.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →