Sr. Production Engineer (Cybersecurity)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Sr. Production Engineer (Cybersecurity): Building and optimizing high-availability, scalable infrastructure across multi-cloud and bare-metal environments with an accent on automation, observability, and reliability. Focus on reducing MTTM, implementing self-healing systems, and leading incident management for a global platform processing billions of transactions.
Location: Must be based in the USA (Remote or Hybrid in San Jose, CA)
Salary: $118,400 - $148,000 USD
Company
is an AI-forward enterprise providing a cloud-native Zero Trust Exchange platform to secure digital transformations for global customers.
What you will do
- Implement highly available, scalable infrastructure across AWS, GCP, and bare-metal environments.
- Drive an "automation-first" culture by writing code in Python or Go to eliminate manual toil and build self-healing systems.
- Implement and maintain sophisticated observability using Prometheus, Grafana, and OpenTelemetry, defining SLIs/SLOs and error budgets.
- Act as a lead Incident Commander, developing response playbooks and conducting deep-dive post-incident analyses.
- Partner with Engineering teams to conduct operability reviews to ensure service maturity.
Requirements
- Must be based in the USA.
- 3-5+ years of experience managing reliability, scalability, and availability for large-scale production services.
- Deep expertise in programming with Python, Go, or C/C++.
- Strong background in networking protocols, Linux/RHEL systems, and distributed architecture.
- Experience in high-stakes incident management and participation in 24/7 on-call rotations.
- Proven history of integrating AI tools to enhance daily workflows and problem-solving.
Nice to have
- Extensive experience with AWS, Azure, GCP, and IaC tools such as Ansible, Terraform, Helm, or Temporal.
- Experience with chaos engineering and disaster recovery planning at scale.
- Expertise in global routing (BGP), traffic tunneling (GRE, IPSec), L7 proxy architectures (HAProxy), and DNS at scale.
Culture & Benefits
- Comprehensive health plans and retirement options.
- Paid time off for vacation and sick leave.
- Parental leave options and education reimbursement.
- In-office perks for those working from the San Jose location.
- Culture centered on customer obsession, collaboration, ownership, and accountability.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →