TL;DR
Site Reliability Engineer II (DevOps): Building and operating foundational infrastructure to power hirify.global's real-time digital operations platform with an accent on network, compute, and ingress infrastructure. Focus on scaling and hardening existing systems to improve the reliability, scalability, and security of services.
Location: Must come into our Toronto office 2 days per week. Candidates must reside in an eligible location within Canada. Cannot employ candidates residing in: Alberta, Manitoba, Newfoundland, Northwest Territories, Nunavut, PEI, Quebec, Saskatchewan, Yukon
Salary: 115,000 - 165,000 CAD
Company
hirify.global is a global leader in digital operations management, providing a platform that empowers business resilience and drives operational efficiency for enterprises.
What you will do
- Support and improve foundational infrastructure, including networking, compute platforms, Kubernetes clusters, and ingress/traffic management systems.
- Contribute to the reliability and scalability of hirify.global's core platform by hardening existing systems and supporting the rollout of new infrastructure capabilities.
- Participate in agile rituals (standups, planning, retros) and communicate progress/risks early.
- Monitor system health using metrics, logs, and alerts, and participate in 24/7 on-call rotations to help detect, respond to, and resolve incidents.
Requirements
- 3+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles.
- Hands-on experience operating Linux-based systems in production environments.
- Working knowledge of networking fundamentals, such as load balancing, DNS, TLS, and ingress traffic flow.
- Experience with container orchestration (e.g., EKS, Kubernetes).
- Experience working on cloud-native infrastructure (e.g., AWS, GCP, Azure), including networking and compute concepts.
- Proficiency in at least one programming language (e.g., Python, Ruby, Go, etc.).
- Experience with Infrastructure as Code (e.g., Terraform, CloudFormation).
Nice to have
- Experience with AWS cloud networking concepts such as VPCs, subnets, routing, security groups, and load balancers.
- Experience operating or contributing to production Kubernetes platforms (e.g., EKS), including cluster upgrades, networking, or ingress configuration.
- Experience with monitoring, observability, and logging platforms (e.g., DataDog, New Relic, SumoLogic, Splunk, Prometheus, Grafana).
- Familiarity with service meshes, ingress controllers, or API gateways (e.g., Envoy, Istio, NGINX).
Culture & Benefits
- Flexible, hybrid workplace with in-person working as an integral part of the culture.
- Competitive salary and comprehensive benefits package.
- Generous paid vacation time, paid holidays and sick leave.
- Company-wide hack weeks and mental wellness programs.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →