CDP Data Engineer (hybrid work in Warsaw)
We are a team that develops and maintains analytical solutions for digital services. We are building a modern data ecosystem based on a wide range of technologies and the AWS cloud.
We are looking for a person to join our team and help us with the implementation of a Customer Data Platform (CDP) – from installation and configuration, through development, to integration with our analytical and Data Governance tools, and finally, maintenance.
- Implementation and maintenance of the CDP environment (collector, enrich, loaders)
- Integration of event data from various sources
- Configuration and development of the CDP (schema repository)
- Building and maintaining data pipelines in the cloud (AWS)
- Creating and developing analytical models and integration with the data catalog
- Participating in the development and implementation of user identification mechanisms (ID stitching, cross-device, cross-site, consent logic)
- Documentation and collaboration with Data Engineering, Data Management, Data Governance, IT, Privacy, and Analytics teams
- Knowledge of AWS (S3, EMR, Lambda, Kinesis, Glue, Snowflake)
- Experience in implementing or administering open-source systems (Snowplow, Kafka, Spark, Flink, Airflow)
- Experience with CI/CD and GitHub
- Python – for integration, automation, and data processing (essential)
- SQL – for building data models and analysis
- Scala – welcome, if you want to develop the core of Snowplow/OpenSnowcat
- JS – welcome, when working with web trackers
- Independence in problem-solving and readiness to work with new technologies
- Good knowledge of English (communication in documentation and community)
- Experience working with the Snowplow/OpenSnowCat solution
- Experience in working with event-driven architecture
- Contributions to open-source projects
- Interest in Data Governance, Metadata Management, and Data Quality (DQ) topics