Назад
1 день назад

Lead Data Engineer (GCP/PySpark/Scala)

Формат работы
hybrid
Тип работы
fulltime
Грейд
lead
Английский
b2
Страна
Poland
vacancy_detail.hirify_telegram_tooltipВакансия из Telegram канала -

Мэтч & Сопровод

Покажет вашу совместимость и напишет письмо

Описание вакансии

🚀 We’re Hiring: Lead Data Engineer (GCP / PySpark / Scala)
🌍 Location: Hybrid — Kraków, Poland (STRICTLY Poland only — no relocation provided)
📅 Start: ASAP

We are looking for an experienced Lead Data Engineer to join a collaborative Agile engineering team delivering enterprise-scale cloud and data solutions for an international banking and financial services company. You’ll work on large-scale data migration, cloud modernization initiatives, and scalable data processing pipelines in a distributed engineering environment.

⚠️ Important: Only candidates currently based in Poland will be considered. Relocation is not available.

🛠 Tech Stack

Cloud: Google Cloud Platform (GCP)
Big Data: Apache Hadoop, Spark, PySpark, Hive, YARN
Languages: Python, Scala
Data Engineering: ETL Frameworks, Data Pipelines, MapReduce
Databases & APIs: SQL, RESTful APIs
Workflow Orchestration: Airflow
DevOps & Tools: Git, GitHub, Jenkins, Ansible, Jira
Platforms: Unix / Linux
Nice to Have: Elasticsearch, Java APIs, AI-driven engineering tools

📋 What You’ll Be Doing

Designing and building scalable cloud-based data pipelines
Developing and optimizing large-scale data processing solutions using Spark, PySpark, Hive, and GCP services
Contributing to data migration and platform modernization initiatives
Collaborating closely with Engineers, Data Analysts, and Business Analysts
Driving engineering best practices and contributing to architecture decisions
Leading technical activities within distributed Agile teams
Supporting and mentoring engineers within the team
Building and maintaining ETL and workflow orchestration processes
Troubleshooting, debugging, and communicating technical findings effectively
Supporting continuous improvement through automation and AI-driven engineering approaches

🎯 What We’re Looking For

10+ years of experience in software engineering
Strong hands-on expertise with GCP and PySpark or Scala
Experience designing and building cloud-based data pipelines
Strong practical knowledge of Google Cloud Platform
Experience leading technical teams or mentoring engineers
Hands-on experience with Apache Hadoop ecosystem technologies
Strong experience with Spark, Hive, YARN, ETL frameworks, and SQL
Strong Unix/Linux platform knowledge
Experience working with RESTful APIs
Familiarity with Git/GitHub, Jenkins, Ansible, and Jira
Experience with workflow orchestration tools such as Airflow
Ability to communicate effectively with both technical and non-technical stakeholders
Strong English communication skills

💡 Nice to Have

Experience with Elasticsearch
Experience developing Java-based APIs
Knowledge of data ingestion frameworks and processes
Familiarity with Agile and DevOps methodologies (Scrum / Kanban)
Interest in AI-driven tools and engineering automation

🧠 Soft Skills

Strong problem-solving and analytical mindset
Ownership and accountability
Ability to work independently in distributed international teams
Proactive communication style
Leadership and mentoring capabilities
Continuous improvement mindset

📍 Work Format

Hybrid model
Location: Kraków, Poland

💬 Interested or know someone who fits?
Reach out directly:

Будьте осторожны: если работодатель просит войти в их систему, используя iCloud/Google, прислать код/пароль, запустить код/ПО, не делайте этого - это мошенники. Обязательно жмите "Пожаловаться" или пишите в поддержку. Подробнее в гайде →

Текст вакансии взят без изменений

Источник -