Назад
Company hidden
1 день назад

Senior Data Engineer (Python, AWS)

Формат работы
remote (только Europe)
Тип работы
fulltime
Грейд
senior
Английский
b2
Страна
Serbia, Ukraine, Poland, Armenia, Romania, Cyprus, Latvia, Kazakhstan, Georgia, Bulgaria
Вакансия из списка Hirify.GlobalВакансия из Hirify RU Global, списка компаний с восточно-европейскими корнями
Для мэтча и отклика нужен Plus

Мэтч & Сопровод

Для мэтча с этой вакансией нужен Plus

Описание вакансии

Текст:
/

TL;DR

Senior Data Engineer (Python, AWS): Designing, building, and scaling robust data pipelines for collecting, transforming, and structuring large volumes of legal and financial data with an accent on AI-driven enrichment and real-time data processing. Focus on developing and optimizing ETL/ELT workflows using Python, Apache Spark, and SQL, and orchestrating scalable data workloads on AWS.

Location: Remote work available from Bulgaria, Georgia, Kazakhstan, Poland, Serbia, Romania, Ukraine, Cyprus, Latvia, Armenia, and specific cities within Eastern Europe and CIS (Almaty, Astana, Belgrade, Cluj-Napoca, Dnipro, Kharkiv, Krakow, Kyiv, Larnaca, Lodz, Lublin, Lviv, Odesa, Riga, Sofia, Tbilisi, Varna, Warsaw, Wroclaw, Yerevan)

Company

Our client is a leading legal recruiting company building a data-driven platform designed for lawyers and law firms, aggregating data from hundreds of public sources to create a unified ecosystem of structured and interconnected legal data.

What you will do

  • Design and implement data ingestion pipelines to collect and process structured and unstructured data from multiple online sources.
  • Develop and optimize ETL/ELT workflows using Python, Apache Spark, and SQL.
  • Build and orchestrate scalable data workflows leveraging AWS services such as EMR, Batch, S3, and SageMaker.
  • Develop and deploy internal data APIs and utilities supporting platform data access and manipulation.
  • Implement robust text extraction and parsing logic to handle diverse data formats.
  • Ensure data quality through validation, deduplication, normalization, and lineage tracking across data layers.

Requirements

  • Proven expertise in Python programming.
  • Solid understanding of the AWS ecosystem.
  • Practical experience with Docker and containerized development workflows.
  • Experience with web scraping, text extraction, or other data ingestion techniques from diverse online sources.
  • Strong analytical mindset, communication skills, and ability to collaborate across multiple teams.

Nice to have

  • Hands-on experience with Apache Spark and SQL for distributed data processing.
  • Experience with EMR, SageMaker.

Culture & Benefits

  • Vacation as per the laws of your country, with encouragement for proper rest.
  • Health insurance support for you and your loved ones.
  • 10 days sick pay without a doctor's note.
  • Time off for state holidays, regardless of the client’s schedule.
  • Pleasant environment with corporate parties and get-togethers.
  • Comfort service for technical and everyday problems at work.
  • Opportunity to work on global projects, grow your career in a supportive, flexible, and innovative tech environment.
  • Help to cover the cost of IT certifications and provide access to top-tier courses and learning platforms.

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник - загрузка...