Software Engineer, Fleet Management (AI)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Software Engineer, Fleet Management (AI): Building systems to manage hardware and configurations for large-scale infrastructure with an accent on optimizing performance and reliability. Focus on automating processes and collaborating with cross-functional teams to enhance operational efficiency.
Location: Hybrid (3 days in the office per week, San Francisco, CA)
Company
is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
What you will do
- Design and build systems to manage both cloud and bare-metal fleets at scale.
- Develop tools that integrate low-level hardware metrics with high-level job scheduling and cluster management algorithms.
- Leverage LLMs to coordinate vendor operations and optimize infrastructure workflows.
- Automate infrastructure processes, reducing repetitive toil and improving system reliability.
- Collaborate with hardware, infrastructure, and research teams to ensure seamless integration across the stack.
- Continuously improve tools, automation, processes, and documentation to enhance operational efficiency.
Requirements
- Strong software engineering skills with experience in large-scale infrastructure environments.
- Broad knowledge of cluster-level systems (e.g., Kubernetes, CI/CD pipelines, Terraform, cloud providers).
- Deep expertise in server-level systems (e.g., systemd, containerization, Chef, Linux kernels, firmware management, host routing).
- Passionate about optimizing the performance and reliability of large compute fleets.
- Thrives in dynamic environments and eager to solve complex infrastructure challenges.
- Values automation, efficiency, and continuous improvement in everything built.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →