Data Engineer
We are looking for a Data Engineer to build and operate scalable, reliable data pipelines that power analytics, reporting, and downstream data products. This role focuses on distributed data processing, orchestration, and strong engineering fundamentals.
Your responsibilities:
Design, build, and operate scalable batch and streaming data pipelines.
Implement distributed data processing using Spark, Apache Beam, or similar frameworks.
Orchestrate workflows using Apache Airflow.
Develop and maintain transformations using SQL and dbt.
Ensure data quality, observability, and performance across pipelines.
Collaborate closely with Analytics Engineers and Platform Engineers.
Build reliable, well‑modelled datasets in the data warehouse.
We are looking for you, if you have:
Strong experience with Google Cloud Platform (GCP) data services, or 5+ years on another major cloud provider.
Strong Python skills for data pipeline development.
Strong SQL skills and hands‑on experience with dbt.
Hands‑on experience with distributed data processing frameworks (Spark, Beam, Flink, etc.).
Strong experience with Apache Airflow.
Some experience with streaming systems (Pub/Sub, Kafka, etc.).
Mandatory experience using AI‑assisted coding tools in day‑to‑day development.
Experience with AI‑driven data workflows (e.g. ML pipelines, feature generation, LLM‑adjacent pipelines).
Experience supporting analytics or BI‑driven use cases.
Exposure to data quality or governance tooling.
We offer:
Participation in interesting and demanding projects.
Flexible working hours.
A great, non-corporate atmosphere.
Possibility to work remote or hybrid (2 days per week from the office).
Opportunities for development and promotion.
Attractive package of benefits.
We reserve the right to contact the selected candidates.
Data Engineer
Data Engineer
GR8 Tech
Warszawa
Praca w pełni zdalna
Zdalnie