TL;DR
Principal Site Reliability Engineer: Architecting and scaling infrastructure solutions using Kubernetes, AWS, and RDS with an accent on reliability, recoverability, and scalability. Focus on applying AI and automation to enhance infrastructure reliability, developer productivity, and internal tooling.
Location: Remote
Salary: $7000 - $12000 per month (Gross in USD)
Company
hirify.global is revolutionizing the shopping experience beyond payments, blending cutting-edge tech with seamless, interest-free installment plans.
What you will do
- Architect, upgrade, design, and build scalable infrastructure solutions leveraging Kubernetes, AWS, RDS (MySQL/Postgres), and modern distributed patterns.
- Drive capacity planning, benchmarking, and stress testing of systems to identify bottlenecks and prepare for further growth.
- Define, maintain, and enforce SLAs and alerts across our infrastructure.
- Lead the teams towards stronger signal anomaly detection and better, more flexible alerting.
- Help Lead hirify.global’s AI enablement efforts, identifying opportunities to apply AI and automation to enhance infrastructure reliability, developer productivity, and internal tooling.
- Establish and evolve engineering best practices for observability, security, and CI/CD across teams.
Requirements
- 12+ years of professional software engineering or infrastructure engineering experience, including significant SRE and backend experience.
- Deployed significant changes to a production application or infrastructure configuration in the past 30 days.
- Strong proficiency in Golang, with experience building and maintaining RESTful APIs.
- Expertise with SQL-based RDBMS (MySQL, PostgreSQL) and experience optimizing schema and queries for performance at scale.
- Proficiency in observability tools (Prometheus, Grafana, Datadog, New Relic).
- Solid understanding of distributed systems design patterns (e.g., transactional outbox, event-driven architecture and stream processing, queues).
Nice to have
- Experience with AWS cloud infrastructure, mainly AWS Aurora RDS, both MySQL and Postgres.
- Experience with data engineering, data pipelines and data warehousing.
- Experience with CI/CD pipelines and deploying containerized microservices in Kubernetes.
- Familiarity with AI developer tooling like Claude Code, Gemini CLI, Codex, Cursor.
- Track record of shipping commercial APIs and data-driven applications in high-growth environments.
Culture & Benefits
- Belief in surrounding ourselves with not only the best and the brightest individuals, but those that are unique and purpose-driven in all that they do.
- Visible example living in every employee that we hire.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →