TL;DR
Principal Site Reliability Engineer (AI): Architecting and building scalable infrastructure solutions leveraging Kubernetes, AWS, RDS (MySQL/Postgres), and modern distributed patterns with an accent on high levels of reliability, recoverability, and scalability. Focus on applying AI and automation to enhance infrastructure reliability and developer productivity.
Location: Remote
Salary: $7000 - $12000 per month (Gross in USD)
Company
hirify.global is revolutionizing the shopping experience beyond payments, blending cutting-edge tech with seamless, interest-free installment plans that make shopping smarter and more accessible.
What you will do
- Architect, upgrade, design, and build scalable infrastructure solutions leveraging Kubernetes, AWS, RDS (MySQL/Postgres), and modern distributed patterns.
- Drive capacity planning, benchmarking, and work with the team to stress test systems.
- Define, maintain, and enforce SLAs and alerts across the infrastructure.
- Lead the team towards stronger signal anomaly detection and flexible alerting.
- Help Lead hirify.global’s AI enablement efforts, identifying opportunities to apply AI and automation to enhance infrastructure reliability, developer productivity, and internal tooling.
- Establish and evolve engineering best practices for observability, security, and CI/CD across teams.
Requirements
- 12+ years of professional software engineering or infrastructure engineering experience, including significant SRE and backend experience.
- Deployed significant changes to a production application or infrastructure configuration in the past 30 days.
- Strong proficiency in Golang, with experience building and maintaining RESTful APIs.
- Expertise with SQL-based RDBMS (MySQL, PostgreSQL) and experience optimizing schema and queries for performance at scale.
- Proficiency in observability tools (Prometheus, Grafana, Datadog, New Relic).
- Bachelor’s degree in Computer Science or equivalent practical experience.
Nice to have
- Experience with AWS cloud infrastructure, mainly AWS Aurora RDS, both MySQL and Postgres.
- Experience with data engineering, data pipelines and data warehousing.
- Experience with CI/CD pipelines and deploying containerized microservices in Kubernetes.
- Familiarity with AI developer tooling like Claude Code, Gemini CLI, Codex, Cursor and using it to be a more productive engineer.
- Track record of shipping commercial APIs and data-driven applications in high-growth environments.
Culture & Benefits
- Are more than just brilliant engineers, passionate data enthusiasts, out-of-the-box thinkers, and determined innovators.
- Believe in surrounding ourselves with not only the best and the brightest individuals, but those that are unique and purpose-driven in all that they do.
- Culture is not defined by a certain set of perks designed to give the illusion of the traditional startup culture, but rather, it is the visible example living in every employee that we hire.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →