Назад
Company hidden
4 дня назад

Senior Site Reliability Engineer (SRE) (AI)

99Β 090 - 123Β 860$
Π€ΠΎΡ€ΠΌΠ°Ρ‚ Ρ€Π°Π±ΠΎΡ‚Ρ‹
onsite
Π’ΠΈΠΏ Ρ€Π°Π±ΠΎΡ‚Ρ‹
fulltime
Π“Ρ€Π΅ΠΉΠ΄
senior
Английский
b2
Π‘Ρ‚Ρ€Π°Π½Π°
US
Вакансия ΠΈΠ· списка Hirify.GlobalВакансия ΠΈΠ· Hirify Global, списка ΠΌΠ΅ΠΆΠ΄ΡƒΠ½Π°Ρ€ΠΎΠ΄Π½Ρ‹Ρ… tech-ΠΊΠΎΠΌΠΏΠ°Π½ΠΈΠΉ
Для мэтча ΠΈ ΠΎΡ‚ΠΊΠ»ΠΈΠΊΠ° Π½ΡƒΠΆΠ΅Π½ Plus

ΠœΡΡ‚Ρ‡ & Π‘ΠΎΠΏΡ€ΠΎΠ²ΠΎΠ΄

Для мэтча с этой вакансиСй Π½ΡƒΠΆΠ΅Π½ Plus

ОписаниС вакансии

ВСкст:
/

TL;DR

Senior Site Reliability Engineer (SRE) (AI): Design, build, and maintain scalable infrastructure and automation tools for traditional and AI-based systems with an accent on reliability, observability, and operational excellence. Focus on implementing CI/CD pipelines, supporting AI/ML model lifecycles, and leading incident response processes.

Location: Atlanta, GA (onsite)

Salary: $99,090 - $123,860 USD

Company

Financial services company focused on providing access to financial opportunities for individuals and communities.

What you will do

  • Design, build, and maintain scalable infrastructure and automation for traditional and AI systems.
  • Develop software to improve reliability and reduce manual operations.
  • Implement and manage CI/CD pipelines, including AI model deployment.
  • Monitor performance, availability, and security with observability tools.
  • Collaborate with data science and ML teams on model training, serving, and lifecycle management.
  • Lead incident response, root cause analysis, and postmortems.
  • Advocate SRE principles across engineering and AI teams.

Requirements

  • 5+ years in SRE, DevOps, or software engineering.
  • Strong programming in Python, Java, etc.
  • Experience with AI/ML workloads (model training, inference, GPU orchestration).
  • Deep knowledge of Linux, cloud platforms (primarily Azure, AWS), container orchestration.
  • Infrastructure-as-code (Terraform, Ansible, GitHub Actions).
  • Monitoring/logging (Dynatrace), networking, security, distributed systems.
  • Excellent communication and collaboration.

Nice to have

  • AI model observability, drift detection, performance monitoring.
  • Open-source contributions in SRE, DevOps, or ML infrastructure.
  • Cloud platform certifications.

Culture & Benefits

  • Competitive compensation and incentive opportunities.
  • Health, dental, vision, life insurance.
  • 401(k) with up to 6% company match; employer-paid retirement plan (4%).
  • Tuition reimbursement up to $5,250/year.
  • 20 days PTO, 9 company holidays, flexible Diversity Celebration Day.
  • 40 hours paid volunteer time per year.

Π‘ΡƒΠ΄ΡŒΡ‚Π΅ остороТны: Ссли Ρ€Π°Π±ΠΎΡ‚ΠΎΠ΄Π°Ρ‚Π΅Π»ΡŒ просит Π²ΠΎΠΉΡ‚ΠΈ Π² ΠΈΡ… систСму, ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΡ iCloud/Google, ΠΏΡ€ΠΈΡΠ»Π°Ρ‚ΡŒ ΠΊΠΎΠ΄/ΠΏΠ°Ρ€ΠΎΠ»ΡŒ, Π·Π°ΠΏΡƒΡΡ‚ΠΈΡ‚ΡŒ ΠΊΠΎΠ΄/ПО, Π½Π΅ Π΄Π΅Π»Π°ΠΉΡ‚Π΅ этого - это мошСнники. ΠžΠ±ΡΠ·Π°Ρ‚Π΅Π»ΡŒΠ½ΠΎ ΠΆΠΌΠΈΡ‚Π΅ "ΠŸΠΎΠΆΠ°Π»ΠΎΠ²Π°Ρ‚ΡŒΡΡ" ΠΈΠ»ΠΈ ΠΏΠΈΡˆΠΈΡ‚Π΅ Π² ΠΏΠΎΠ΄Π΄Π΅Ρ€ΠΆΠΊΡƒ. ΠŸΠΎΠ΄Ρ€ΠΎΠ±Π½Π΅Π΅ Π² Π³Π°ΠΉΠ΄Π΅ β†’