
Data Flow Engineer- Varsovia - ROC20
- On-site
- Varsovia, Dolnośląskie, Poland
Job description
Data Flow Engineer
Frontex Headquarters - On-site service provision | Initial duration: 12 months | Total possible
duration: 48 months
SEIDOR is looking for Data Flow Engineers to support complex data flow, data integration and streaming initiatives
in a Big Data environment.
The role focuses on designing, implementing and maintaining advanced data flows in Cloudera DataFlow / Apache
NiFi, integrating systems through APIs, CDC pipelines, Kafka and modern data governance capabilities.
Key Responsibilities
Design, implement, test and maintain complex data flows in Cloudera DataFlow / Apache NiFi, including ingest,
transform, enrich, route and egress.
Build and optimise real-time and near-real-time CDC pipelines using NiFi, Kafka, Debezium or SQL CDC connectors.
Integrate external systems through REST API, JDBC, Kafka and other protocols.
Manage data schemas with Avro and maintain metadata and lineage in Apache Atlas.
Configure security and governance using Apache Ranger policies for data flows.
Monitor, alert and troubleshoot performance and reliability of data pipelines.
Collaborate with data engineers, architects and business stakeholders on requirements and data flow architecture.
Create and maintain SOPs, runbooks and technical documentation; participate in CDP, NiFi and Kafka upgrades and
migrations.
Job requirements
Requirements
Minimum 8 years of IT-relevant professional experience and at least 6 years in a similar position.
Minimum education level: Level 6; English language skills: B2 or above.
Expert knowledge in defining, designing, implementing and maintaining complex data flows in Apache NiFi /
Cloudera DataFlow.
Advanced Python programming skills for data processing, NiFi custom logic, flow automation and integrations.
Advanced experience building REST API integrations, including endpoint calls, OAuth/JWT authentication, rate limiting
and error recovery.
Hands-on experience building CDC-based data flows using native NiFi processors/connectors and SQL Builder.
Good knowledge of Apache Iceberg, including tables, schema evolution and partitioning.
Technical Knowledge
Data governance and catalogue in CDP: Apache Atlas for metadata, lineage and tagging; Apache Ranger for security
policies and authorization.
Apache Kafka as a message broker, including topics, producers/consumers, schema registry and NiFi integration.
Apache Avro as a serialization standard, including schema evolution and compatibility.
Practical experience with NiFi in a CDP environment for design, deployment, monitoring and troubleshooting of
advanced flows.
Experience implementing at least one major integration project using NiFi as the central tool for API calls, database
integrations, transformations, routing and delivery.
Certification
At least 1 relevant certification such as Cloudera Certified Developer for Apache NiFi, Cloudera Data Flow / CFM
certification or an internationally recognised equivalent.
or
All done!
Your application has been successfully submitted!
You've already applied for this job
We appreciate your interest in this position. Unfortunately, you have already applied for this job.