TL;DR
Senior Data Engineer (Data): Building scalable data infrastructure for an online fashion & beauty retailer, focusing on migrating data pipelines from GA4 to an internal Customer Behavior Source. Focus on adapting Spark-based ETL pipelines, ensuring data accuracy, and collaborating with analytics teams for validation.
Location: Remote within Central Europe
Company
hirify.global is setting up a new Data Engineering team within a leading European Online Fashion & Beauty Retailer to take ownership of key pipelines that fuel the company's ML and data-driven services.
What you will do
- Migrate existing data pipelines from GA4 (Google Analytics 4) to an internal Customer Behavior Source.
- Refactor and adapt Spark-based ETL pipelines to match a new behavioral event schema.
- Ensure data accuracy, consistency, and reliability during and after the migration.
- Work with large-scale datasets using PySpark and SQL.
- Collaborate with analytics and product teams to validate migrated data and resolve discrepancies.
- Document pipeline logic, data models, and migration decisions.
Requirements
- Strong hands-on experience with PySpark and Apache Spark.
- Proficiency with Airflow for orchestrating data pipelines.
- Solid SQL skills for data transformation and validation.
- Experience working with Databricks in a production environment.
Nice to have
- Familiarity with data modeling concepts.
Culture & Benefits
- Paid Vacation.
- Sick Days.
- Floating Holidays.
- Sport/Insurance Compensation.
- English Classes.
- Charity.
- Training Compensation.
Будьте осторожны: если вас просят войти в iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →