Site Reliability Engineer/Developer (AWS)
ΠΡΡΡ & Π‘ΠΎΠΏΡΠΎΠ²ΠΎΠ΄
ΠΠ»Ρ ΠΌΡΡΡΠ° Ρ ΡΡΠΎΠΉ Π²Π°ΠΊΠ°Π½ΡΠΈΠ΅ΠΉ Π½ΡΠΆΠ΅Π½ Plus
ΠΠΏΠΈΡΠ°Π½ΠΈΠ΅ Π²Π°ΠΊΠ°Π½ΡΠΈΠΈ
TL;DR
Site Reliability Engineer/Developer (AWS/Kubernetes): Designing and maintaining reliable cloud infrastructure for a semiconductor intelligence platform with an accent on automation, scalability, and multi-region AWS deployments. Focus on building CI/CD pipelines, implementing observability solutions, and reducing operational toil through Python, Go, or Java.
Location: Remote for candidates based in Canada
Salary: $109,600 β 116,100 CAD
Company
is the information platform for the semiconductor industry, providing deep intelligence and reverse engineering analysis to over 650 companies.
What you will do
- Design and maintain highly available, scalable infrastructure across multi-region AWS deployments.
- Develop and maintain SLOs and SLIs in collaboration with development teams to quantify system reliability.
- Monitor performance and resource utilization using CloudWatch, DataDog, and Prometheus, conducting root cause analysis for outages.
- Implement infrastructure-as-code solutions using Terraform and GitOps methodologies.
- Build automation tools and scripts in Python, Go, or Java to eliminate manual operational tasks.
- Lead incident response for critical outages and conduct blameless post-mortems to drive preventive measures.
Requirements
- 5β7 years of experience in Site Reliability Engineering, DevOps, or cloud operations.
- Strong expertise in AWS (EC2, ECS/EKS, RDS, S3, Lambda, VPC) and hybrid cloud environments.
- Proficiency in Python, Go, or Java; experience with Docker and Kubernetes.
- Expertise in infrastructure-as-code (Terraform, Ansible, CloudFormation) and CI/CD pipelines.
- Experience with observability tools such as Prometheus, Grafana, and DataDog.
- Must be based in Canada.
Nice to have
- Experience in the semiconductor or technology industry.
- AWS (Solutions Architect, DevOps Engineer) or Kubernetes (CKA, CKAD) certifications.
- Knowledge of security frameworks and compliance requirements (SOC 2, ISO 27001).
- Experience with microservices architecture and distributed systems design.
Culture & Benefits
- Comprehensive benefits package including health, dental, vision, and RRSP matching.
- Company-sponsored training and development opportunities.
- Flexible vacation policy and wellness resources.
- Bring your own device program.
- Inclusive environment prioritizing diversity, equity, and accessibility.
ΠΡΠ΄ΡΡΠ΅ ΠΎΡΡΠΎΡΠΎΠΆΠ½Ρ: Π΅ΡΠ»ΠΈ ΡΠ°Π±ΠΎΡΠΎΠ΄Π°ΡΠ΅Π»Ρ ΠΏΡΠΎΡΠΈΡ Π²ΠΎΠΉΡΠΈ Π² ΠΈΡ ΡΠΈΡΡΠ΅ΠΌΡ, ΠΈΡΠΏΠΎΠ»ΡΠ·ΡΡ iCloud/Google, ΠΏΡΠΈΡΠ»Π°ΡΡ ΠΊΠΎΠ΄/ΠΏΠ°ΡΠΎΠ»Ρ, Π·Π°ΠΏΡΡΡΠΈΡΡ ΠΊΠΎΠ΄/ΠΠ, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡΠ΅ ΡΡΠΎΠ³ΠΎ - ΡΡΠΎ ΠΌΠΎΡΠ΅Π½Π½ΠΈΠΊΠΈ. ΠΠ±ΡΠ·Π°ΡΠ΅Π»ΡΠ½ΠΎ ΠΆΠΌΠΈΡΠ΅ "ΠΠΎΠΆΠ°Π»ΠΎΠ²Π°ΡΡΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡΠΈΡΠ΅ Π² ΠΏΠΎΠ΄Π΄Π΅ΡΠΆΠΊΡ. ΠΠΎΠ΄ΡΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β