TL;DR
Engineering Manager, Site Reliability (Fintech): Leading and growing a high-performing SRE team responsible for the scalability, reliability, and robustness of hirify.global’s platform with an accent on incident management, on-call operations, and SLIs/SLOs. Focus on driving continuous improvement through post-incident reviews, guiding infrastructure investments, and championing operational excellence.
Location: Hybrid work environment: we value meaningful collaboration and connection at our Toronto office twice a week, with lunch, snacks, and beverages on us.
Company
hirify.global is a digital banking platform that gives self-made business owners the tools and know-how to be great with money—bringing clarity, confidence, and control to every dollar earned.
What you will do
- Lead, coach, and grow a high-performing Site Reliability Engineering team, supporting career development and technical excellence.
- Own the reliability, scalability, and performance of hirify.global’s platform.
- Lead and evolve SRE best practices, including incident management, on-call operations, SLIs/SLOs, and error budgets.
- Partner with Engineering, Product, and Data teams to embed reliability and scalability into features.
- Drive continuous improvement via post-incident reviews, root cause analysis, and preventative actions.
- Define and track key reliability KPIs (e.g., uptime, latency, incident frequency, MTTR) to inform decisions.
Requirements
- 3+ years of experience managing or leading engineers and 6+ years of experience in Site Reliability Engineering, Platform Engineering, or Infrastructure roles.
- Strong track record of owning and improving system reliability, scalability, and performance in production environments.
- Experienced in improving observability, performance, or operational maturity at growing companies.
- Led teams through incident response, postmortems, and reliability improvements using data.
- Strong foundation in operating and scaling production systems in cloud environments (e.g., AWS) and modern infrastructure practices (IaC, CI/CD, monitoring, alerting).
- Proven record of partnering with Product and Engineering leaders to balance delivery velocity with long-term reliability and operational excellence.
- Highly collaborative and experienced in leading cross-functional initiatives.
Nice to have
- Built or scaled an SRE or platform team at a growing fintech startup.
- Experience supporting high-availability, customer-facing systems in fintech or other regulated environments.
- Familiar with SRE best practices such as SLOs, error budgets, and capacity planning at scale.
Culture & Benefits
- Competitive salary and meaningful equity for all employees.
- Comprehensive health benefits from day one, including health, dental, and vision coverage for you and dependents.
- Flexible vacation (15 days + 5 flex days) and an extra week of office closure for holidays.
- Parental leave (12 weeks with 100% salary top-up) for all full-time employees and parents.
- Hybrid work environment in Toronto with lunch, snacks, and beverages provided.
- Dog-friendly office space in Toronto.
- Commitment to personal and professional growth through feedback, mentorship, and coaching.
- Mac-first company with top-tier equipment in Toronto offices.
- Social connection through annual company-wide get-togethers, quarterly team events, and happy hours.
Hiring process
- Stage 1: A 1-hour Google Meet call with a member of the People team.
- Stage 2: A 1-hour Google Meets video call with the hiring manager (VP of Engineering).
- Stage 3: A 1-hour in-person interview with a member of hirify.global’s senior leadership team.
- Stage 4: A 1-hour live technical assessment via Google Meet, or in person, with a panel of interviewers (technical leaders & stakeholders).
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →