TL;DR
Senior Data Engineer (Python, AWS): Designing, building, and scaling robust data pipelines for collecting, transforming, and structuring large volumes of legal and financial data with an accent on AI-driven enrichment and real-time data processing. Focus on developing and optimizing ETL/ELT workflows using Python, Apache Spark, and SQL, and orchestrating scalable data workloads on AWS.
Location: Remote work available from Bulgaria, Georgia, Kazakhstan, Poland, Serbia, Romania, Ukraine, Cyprus, Latvia, Armenia, and specific cities within Eastern Europe and CIS (Almaty, Astana, Belgrade, Cluj-Napoca, Dnipro, Kharkiv, Krakow, Kyiv, Larnaca, Lodz, Lublin, Lviv, Odesa, Riga, Sofia, Tbilisi, Varna, Warsaw, Wroclaw, Yerevan)
Company
Our client is a leading legal recruiting company building a data-driven platform designed for lawyers and law firms, aggregating data from hundreds of public sources to create a unified ecosystem of structured and interconnected legal data.
What you will do
- Design and implement data ingestion pipelines to collect and process structured and unstructured data from multiple online sources.
- Develop and optimize ETL/ELT workflows using Python, Apache Spark, and SQL.
- Build and orchestrate scalable data workflows leveraging AWS services such as EMR, Batch, S3, and SageMaker.
- Develop and deploy internal data APIs and utilities supporting platform data access and manipulation.
- Implement robust text extraction and parsing logic to handle diverse data formats.
- Ensure data quality through validation, deduplication, normalization, and lineage tracking across data layers.
Requirements
- Proven expertise in Python programming.
- Solid understanding of the AWS ecosystem.
- Practical experience with Docker and containerized development workflows.
- Experience with web scraping, text extraction, or other data ingestion techniques from diverse online sources.
- Strong analytical mindset, communication skills, and ability to collaborate across multiple teams.
Nice to have
- Hands-on experience with Apache Spark and SQL for distributed data processing.
- Experience with EMR, SageMaker.
Culture & Benefits
- Vacation as per the laws of your country, with encouragement for proper rest.
- Health insurance support for you and your loved ones.
- 10 days sick pay without a doctor's note.
- Time off for state holidays, regardless of the client’s schedule.
- Pleasant environment with corporate parties and get-togethers.
- Comfort service for technical and everyday problems at work.
- Opportunity to work on global projects, grow your career in a supportive, flexible, and innovative tech environment.
- Help to cover the cost of IT certifications and provide access to top-tier courses and learning platforms.
Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →