Назад
Company hidden
6 часов назад

Senior Data Engineer (AI/ML)

142 000 - 162 500$
Формат работы
remote (только USA)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
US
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Data Engineer (AI/ML): Architecting and building scalable data pipelines for feature extraction, training data generation, and model monitoring with an accent on data lakehouse, feature stores, and MLOps practices. Focus on optimizing Spark jobs and SQL queries for massive datasets, detecting data drift, implementing automated recovery, and ensuring reproducibility for LLM fine-tuning and inference.

Location: United States - Remote

Salary: $142,000 - $162,500 USD (Zone 1, National Average)

Company

hirify.global is the all-in-one EHR+ platform built for independent healthcare practices, connecting EHR software, billing, telehealth, and marketing to streamline operations and reduce burnout.

What you will do

  • Architect and write software for scalable pipelines handling feature extraction, training data, and model monitoring logs.
  • Own and serve as SME for large systems like Feature Store or Data Lakehouse, ensuring data availability for experimentation and production.
  • Monitor production pipelines, detect data drift or quality issues, and build automated recovery systems.
  • Lead engineering design reviews, explaining architecture decisions like batch vs. real-time processing.
  • Develop extensible frameworks for data quality checks and schema validation to prevent training-serving skew.
  • Collaborate with ML engineers on MLOps, including data versioning, lineage tracking, and reproducibility.

Requirements

  • 5+ years of professional software development experience.
  • Deep expertise in Big Data Processing, Distributed Systems, Data Modeling.
  • 3+ years hands-on Data Engineering for analytics or data science.
  • Advanced proficiency in Python and SQL for production-grade data transformation and orchestration.
  • Experience with modern data infrastructure: Spark, Airflow, Kafka, Vector Databases, Data Lake/Lakehouse (Databricks, Snowflake, Delta Lake).
  • Familiarity with MLOps concepts like data leakage prevention, CI/CD, monitoring, and alerting.
  • Excellent technical communication and product mindset.

Nice to have

  • Background in healthcare software or structured business data.
  • Experience with Feature Store (Feast, Tecton) or Data Versioning (DVC, LakeFS).
  • Work with RAG pipelines or vector search.
  • Published research or open-source contributions in data engineering or ML.

Culture & Benefits

  • Competitive compensation based on geo-zone, plus variable pay and robust benefits package.
  • Work-from-home discounts (Dell), Gympass for fitness, Telus EAP for mental health resources.
  • Values: Start with the Customer, Keep It Simple, Stay Entrepreneurial, Better Together, Celebrate Success.
  • Equal opportunity employer committed to diversity and fair hiring.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →