Sr Site Reliability Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Sr Site Reliability Engineer (Network): Design, build, and scale a global network platform spanning physical datacenters and multi-cloud environments with an accent on network automation and operational experience supporting large scale production infrastructure. Focus on troubleshooting and performance tuning in Kubernetes and Docker environments, with a focus on networking.
Location: Sydney
Company
is a global technology company with a mission to create a better, more open internet for everyone through principled, intelligent advertising.
What you will do
- Design, build, and scale a global network platform spanning physical datacenters and multi-cloud environments across AWS, Azure, and Alibaba Cloud.
- Support thousands of hosts worldwide, engineering reliable and efficient solutions to petabyte-scale data challenges.
- Own troubleshooting and resolution of complex network issues, upholding high availability and performance across the entire infrastructure footprint.
- Lead root cause analysis and postmortems, turning incidents into actionable improvements that raise the bar for operational excellence.
- Eliminate toil by building tools, automating workflows, and continuously improving the processes your team depends on every day.
- Share responsibility for network integrity through participation in a global, follow-the-sun on-call rotation.
Requirements
- You have 6-8 years of hands on network automation and operational experience supporting large scale production infrastructure.
- You have a software-first mindset with strong development and networking experience, able to think like an engineer and operate like an architect.
- You bring deep expertise in TCP/IP, the OSI model, and large-scale IP networking protocols including BGP and OSPF.
- You have hands-on experience with Kubernetes networking technologies such as Cilium and Calico, and a solid understanding of container network interfaces (CNIs).
- You have managed software load balancers like NGINX Ingress, Envoy, or HAProxy in large-scale production environments.
- Proficient creating automation and building tools using Python or Go.
Nice to have
- Experience running Kubernetes clusters on bare-metal is a plus.
- Experience integrating AI tools (LLMs, MCP, agentic workflows) into engineering processes to automate tasks and improve development velocity.
Culture & Benefits
- Award-winning culture based on trust, ownership, empathy, and collaboration.
- Value the unique experiences and perspectives that each person brings to .
- Committed to fostering inclusive spaces where everyone can bring their authentic selves to work every day.
- Breadth of technical opportunity.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →