Senior DevOps Engineer
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Senior DevOps Engineer (Kubernetes/AWS): Operate and improve platform tools for reliable product team deployments with an accent on Kubernetes operations, CI/CD pipelines, and observability stacks. Focus on maintaining self-service workflows, monitoring with Prometheus/OpenTelemetry, and incident response during on-call rotations.
Location: United States - Remote (any location), Complete EST Overlap
Company
is a digital innovation and enterprise AI services provider helping startups and enterprises with AI-driven solutions and complex problem-solving.
What you will do
- Operate platform tools, triage tickets, fix build issues, and handle service requests like access and environment setup.
- Maintain self-service workflows by updating docs, examples, and guardrails under senior guidance.
- Perform Kubernetes operations: deploy Helm charts, manage namespaces, diagnose issues, and follow incident runbooks.
- Support CI/CD pipelines with GitLab CI: keep pipelines green, adjust jobs, implement quality gates, and promote safer deploy strategies.
- Monitor observability stack with Prometheus, Alertmanager, Thanos; maintain alerts, dashboards, SLOs, and reduce noise.
- Assist with service instrumentation using OpenTelemetry for tracing, logging, and metrics.
- Contribute to documentation like runbooks, FAQs, and onboarding guides; participate in on-call rotation.
- Perform cost- and performance-optimizations like right-sizing workloads and automating tasks.
Requirements
- 8+ years in platform/SRE/DevOps or infrastructure role with automation focus
- Experience operating Kubernetes and tools like Helm, Docker, Ingress NGINX
- Hands-on CI/CD with GitLab CI: jobs, artifacts, environments, deployment strategies
- Scripting in Bash or Python
- AWS fundamentals: IAM, EC2/EKS, S3, CloudWatch, Secrets Manager
- Observability: Prometheus/Alertmanager/Thanos, OpenTelemetry basics
- Work from tickets (Jira/ServiceNow), change-management, stakeholder communication
Nice to have
- Terraform for IaC
- API integration with Java, Python, or Go
- Deep Linux and container runtime knowledge
- Insurance/financial services experience with compliance
Culture & Benefits
- Entrepreneurial spirit with self-reliance, open communication, and collaboration
- Advanced training opportunities to apply in real business solutions
- Passion for technology, responsibility, and challenging problems expected
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →