Назад
Company hidden
4 дня назад

Data Engineer (AI)

Формат работы
remote (только Brazil)
Тип работы
fulltime
Грейд
middle
Английский
b2
Страна
Brazil
Вакансия из списка Hirify.GlobalВакансия из Hirify Global, списка международных tech-компаний
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Data Engineer (AI/ML): Building and maintaining robust data pipelines to transform raw industrial data into high-quality datasets for AI model training with an accent on ETL/ELT optimization and data governance. Focus on developing labeling infrastructure, integrating diverse data sources, and ensuring dataset readiness for ML consumption.

Location: Remote (Must be based in Brazil)

Company

hirify.global transforms the industrial world by fusing cutting-edge hardware with innovative software to empower frontline maintenance workers.

What you will do

  • Design and maintain data pipelines for ingestion from APIs, documents, websites, and raw sensor data.
  • Integrate and optimize ETL/ELT processes to improve performance, reliability, and maintainability.
  • Own the full dataset lifecycle from raw ingestion through cleaning and validation for AI consumption.
  • Define and enforce data quality standards and governance practices across the Data Foundry team.
  • Build and maintain labeling pipeline infrastructure for ML applications.
  • Participate in architectural decisions, code reviews, and technical mentorship.

Requirements

  • 3+ years of experience in data engineering.
  • Degree in Computer Science, Data Engineering, or a related technical field.
  • Proficiency in Python and strong SQL skills for data modeling and query optimization.
  • Experience with workflow orchestration tools such as Temporal, Airflow, Prefect, or Dagster.
  • Experience with distributed data processing (Spark, Databricks) and streaming systems (Kafka, Kinesis).
  • Understanding of the ML training lifecycle and layered data architecture (e.g., Medallion Architecture).

Nice to have

  • Experience with Go for high-performance pipeline components.
  • Knowledge of dbt, Delta Lake, Apache Iceberg, or Hudi.
  • Experience with data quality frameworks like Great Expectations or Soda.
  • Cloud experience, preferably with OCI, AWS, GCP, or Azure.
  • Ability to prototype with Streamlit and utilize LLMs/GenAI for internal tooling.

Culture & Benefits

  • Competitive salary and stock options.
  • 30 days of paid annual leave.
  • Education and courses stipend.
  • Travel reward: a trip anywhere in the world every 4 years.
  • Meal allowance (R$1,035/month) and sports incentive (R$300/month).
  • Health and dental insurance with national coverage.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →