TL;DR
Staff Software Engineer (Observability): Lead development and optimization of scalable, reliable, and secure observability systems including logging, tracing, and metrics platforms. With an accent on global datacenter infrastructure and system reliability. Focus on designing monitoring, alerting, and automation for Kubernetes and microservices environments.
Location: Hybrid work with offices in Livingston, NJ; New York, NY; Sunnyvale, CA; Bellevue, WA. Remote work considered only if located more than 30 miles from an office. Must comply with U.S. export control regulations and be a U.S. person or eligible for export authorization.
Salary: $188,000–$250,000
Company
hirify.global is a publicly traded AI hyperscaler delivering a cloud platform with cutting-edge services powering AI innovation, operating data centers across the US and Europe.
What you will do
- Lead and mentor engineers fostering collaboration and continuous improvement.
- Scale logging, tracing, and metrics platforms supporting global datacenters.
- Develop and refine monitoring and alerting to enhance system reliability.
- Advise engineers on optimal use of observability systems.
- Automate interactions with compute infrastructure.
- Manage production clusters and enforce deployment best practices.
Requirements
- Must be a U.S. person or eligible for export controlled information access.
- 7+ years in software engineering, SRE, DevOps, or related fields.
- Deep expertise with observability tools like ClickHouse, Elastic, Loki, Victoria Metrics, Prometheus, Thanos, and Grafana.
- Expertise in Kubernetes, containerization, and microservices architectures.
- Proven leadership in incident management and post-mortem analysis.
- Excellent problem-solving, analytical, and communication skills.
Nice to have
- Experience running and scaling observability tools as a cloud provider.
- Experience administering large-scale Kubernetes clusters.
- Deep understanding of data-streaming systems.
Culture & Benefits
- Medical, dental, and vision insurance fully paid by the company.
- Company-paid life insurance and disability coverage.
- Flexible spending and health savings accounts.
- Tuition reimbursement and employee stock purchase program.
- Paid parental leave and family-forming support.
- 401(k) with employer match and flexible PTO.
- Catered lunch in office and data center locations.
- Casual work environment with a culture focused on innovative disruption.
Hiring process
- Onboarding at one of the company hubs within the first month.
- Teams gather quarterly for collaboration.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →