Software Engineer (Site Reliability, Crossborder)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Software Engineer (Site Reliability): Building and scaling the reliability foundation for a global marketplace platform with an accent on multi-region architecture, observability, and developer experience. Focus on automating workflows with AI, managing incident response, and ensuring system stability as the platform expands to over 100 countries.
Location: Must be based in Tokyo, Japan (Hybrid)
Company
is a global marketplace platform dedicated to circulating value and connecting people worldwide through technology.
What you will do
- Own the SRE function for the global app platform, defining SLIs/SLOs and leading production readiness reviews.
- Operate and scale infrastructure across multiple regions, evolving from single-region to multi-region configurations.
- Build and maintain the observability stack, including metrics, distributed tracing, and structured logging.
- Lead incident response, mitigation, and blameless post-mortems while reducing alert volume through AI-assisted automation.
- Develop CI/CD pipelines and developer tooling to improve engineering velocity and safety.
- Collaborate cross-functionally with product, platform, and operations teams to bridge infrastructure health with product reliability.
Requirements
- Japanese or English proficiency at C1 level required.
- Hands-on experience operating production services on GCP and Kubernetes.
- Experience defining SLIs/SLOs and managing on-call rotations with blameless post-mortem processes.
- Operational experience with databases and solid understanding of database fundamentals.
- Experience applying AI/LLMs to improve engineering productivity or building developer tooling.
- Proactive mindset toward troubleshooting and willingness to embed into product teams.
Nice to have
- Experience communicating SLIs/SLOs to business and product leaders.
- Experience in product development within a team environment.
- Experience developing AI agents.
Culture & Benefits
- Engineering principles focused on passion for the product, growing together, and open collaboration.
- Full flextime work hours with no core time.
- Commitment to Inclusion & Diversity and equal opportunity hiring.
- Opportunity to work on global-scale challenges in a multi-region environment.
Hiring process
- Application screening followed by a skill assessment on HackerRank or GitHub.
- Multiple interview rounds.
- Reference check prior to final offer.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →