Production Support Engineer (LMTS)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Production Support Engineer (LMTS) (AI Supply Chain): Scaling architecture for Agentforce platform from startup to enterprise-grade with an accent on reliability, performance tuning, and infrastructure automation. Focus on hardening observability, optimizing SQL/API latency, and ensuring AI/ML infrastructure scales for real-time insights.
Onsite: San Francisco CA, Seattle WA, or Bellevue WA
Company
Agile startup-within- team building AI-powered Agentforce for Supply Chain, backed by global scale.
What you will do
- Own reliability roadmap, transitioning systems to highly-available global-scale solutions.
- Partner with principal engineers on infrastructure strategy, capacity planning, and bottleneck resolution.
- Maintain Infrastructure as Code environments and evolve developer-friendly automation.
- Scale AI/ML infrastructure with GPU resources and data pipelines for supply chain insights.
- Lead observability stack hardening to prevent incidents and explain root causes.
- Deep-dive into performance engineering for SQL, APIs, and cross-service communication.
- Use AI tools to automate operations and contribute to shared system context repository.
- Evaluate human/AI-generated code for correctness, quality, security, and performance.
Requirements
- 5+ years in SRE, Production Engineering, or Backend Engineering focused on operations and scale.
- Proven experience scaling products through high-growth phases.
- Strong proficiency in Kubernetes, Terraform/OpenTofu, and AWS/GCP/Azure.
- Production-level coding in Golang, TypeScript, or Python.
- Deep understanding of distributed systems, microservices, databases, and AI agents.
- Low-ego collaboration in senior Principal engineer teams.
- AI-first engineering approach with experience using AI tools like Claude Code, GitHub Copilot.
- Advanced prompt engineering for reliable AI outputs.
Nice to have
- M.S. in Computer Science or equivalent.
- Strong PostgreSQL experience at scale (partitioning, indexing, query tuning).
- Advanced microservice orchestration, Temporal, service mesh.
- Supply chain/logistics domain experience.
- infrastructure, Hyperforce, or Data Cloud knowledge.
- Deep public cloud networking, security, identity management.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →