TL;DR
Automation & Tools Engineer (DevOps/SRE): Automating and maintaining the infrastructure that powers the core platform, including data pipelines, ML workloads, and real-time analytics systems with an accent on build and deployment systems, as well as configuration management. Focus on improving system reliability and performance through automation and observability.
Location: Must be based in San Francisco
Salary: $170K - $190K
Company
hirify.global is solving real-world enterprise problems.
What you will do
- Automate and control datacenter and cloud-based infrastructure.
- Improve system reliability and performance through automation, observability, and proactive capacity planning.
- Create and configure tools for datacenter provisioning, configuration management, and observability.
- Improve developer experience by providing self-service tools.
- Implement and maintain monitoring, alerting, and incident response processes.
- Drive post-incident analysis and continuous improvement initiatives.
Requirements
- 5+ years of experience in Tools development, SRE, DevOps, or platform engineering roles.
- Good programming skills with IaC languages such as Ansible, Helm, Kustomize.
- Good programming skills with general-purpose languages such as Python or Go.
- Deep experience with containerization (Docker) and Kubernetes.
- Strong knowledge of Linux systems and networking fundamentals.
- Experience with monitoring and observability stacks (e.g., Prometheus, Grafana, Datadog, ELK, OpenTelemetry).
- Proficiency with CI/CD tools and pipelines (e.g., GitHub Actions, ArgoCD, etc.).
- Ability to debug complex systems and automate solutions in scripting languages (Python, Bash, etc.).
- Excellent communication skills and the ability to work cross-functionally.
Nice to have
- Familiarity with build configuration tools, software dependency management and C++.
- Experience with datacenter automation such as system imaging and configuration management.
- Experience supporting data-intensive platforms (Spark, Airflow, Kafka, etc.).
- Familiarity with security practices for cloud-native applications and infrastructure.
- Experience in high-compliance or SOC-2 environments.
Culture & Benefits
- Ownership of mission-critical infrastructure.
- A front-row seat to a high-performance engineering culture.
- The ability to influence how our platform scales.
- An environment that values curiosity, accountability, and impact.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →