Principal MLOps Engineer (Healthcare)
Мэтч & Сопровод
Для мэтча с этой вакансией нужен Plus
Описание вакансии
TL;DR
Principal MLOps Engineer (Healthcare): Leading the operational architecture, deployment strategy, and reliability engineering for AI integration into high-stakes Healthcare Information Systems with an accent on production discipline, compliance, and system resilience. Focus on architecting automated release pipelines, defining enterprise monitoring standards, and ensuring model reliability in mission-critical clinical environments.
Location: Remote (US). New hires are required to travel to a designated company location for on-site onboarding.
Salary: $142,800 - $196,350
Company
is a healthcare company focused on creating breakthrough solutions at the intersection of health, material, and data science to improve patient lives.
What you will do
- Architect and govern enterprise release processes, including automated approval gates and deployment readiness standards.
- Oversee the enterprise model registry to ensure seamless CI/CD integration and version control traceability.
- Define and enforce monitoring standards, SLAs, and SLOs across the AI ecosystem.
- Establish incident management frameworks, including triage workflows, postmortems, and rollback mechanisms.
- Partner with platform teams to provide compliance support and ensure operational security.
- Guide junior engineers in maintaining operational runbooks and reliable deployment pipelines.
Requirements
- Must be legally authorized to work in the US without sponsorship.
- Bachelor's degree or higher in Computer Science, Software Engineering, or a related field.
- 10+ years of software engineering experience, with 6+ years in deploying and maintaining large-scale production ML systems.
- Expert-level experience with cloud providers (AWS/GCP/Azure) and orchestration tools like Kubernetes, Kubeflow, or Airflow.
- Proficiency in Python and Java/Go, with deep knowledge of backend frameworks and system design.
- Expert knowledge of observability stacks (Prometheus, Grafana, Datadog) and enterprise SLA/SLO management.
Nice to have
- Master’s or PhD in Computer Science or Software Engineering.
- Deep understanding of cybersecurity best practices and ATO processes in regulated industries like Healthcare or Finance.
- Proven ability to design systems for massive concurrency and distributed data processing.
Culture & Benefits
- Competitive compensation and benefits package including medical, dental, and vision insurance.
- Retirement benefits and flexible spending accounts.
- Commitment to professional integrity and a collaborative, empathy-driven work environment.
- Regular benchmarking to ensure competitive pay and comprehensive well-being programs.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →