Skip to content

Data Flow Engineer- Varsovia - ROC20

  • On-site
    • Varsovia, Dolnośląskie, Poland

Job description

Data Flow Engineer

Frontex Headquarters - On-site service provision | Initial duration: 12 months | Total possible

duration: 48 months

SEIDOR is looking for Data Flow Engineers to support complex data flow, data integration and streaming initiatives

in a Big Data environment.

The role focuses on designing, implementing and maintaining advanced data flows in Cloudera DataFlow / Apache

NiFi, integrating systems through APIs, CDC pipelines, Kafka and modern data governance capabilities.

Key Responsibilities

  • Design, implement, test and maintain complex data flows in Cloudera DataFlow / Apache NiFi, including ingest,

transform, enrich, route and egress.

  • Build and optimise real-time and near-real-time CDC pipelines using NiFi, Kafka, Debezium or SQL CDC connectors.

  • Integrate external systems through REST API, JDBC, Kafka and other protocols.

  • Manage data schemas with Avro and maintain metadata and lineage in Apache Atlas.

  • Configure security and governance using Apache Ranger policies for data flows.

  • Monitor, alert and troubleshoot performance and reliability of data pipelines.

  • Collaborate with data engineers, architects and business stakeholders on requirements and data flow architecture.

  • Create and maintain SOPs, runbooks and technical documentation; participate in CDP, NiFi and Kafka upgrades and

migrations.

Job requirements

Requirements

  • Minimum 8 years of IT-relevant professional experience and at least 6 years in a similar position.

  • Minimum education level: Level 6; English language skills: B2 or above.

  • Expert knowledge in defining, designing, implementing and maintaining complex data flows in Apache NiFi /

Cloudera DataFlow.

  • Advanced Python programming skills for data processing, NiFi custom logic, flow automation and integrations.

  • Advanced experience building REST API integrations, including endpoint calls, OAuth/JWT authentication, rate limiting

and error recovery.

  • Hands-on experience building CDC-based data flows using native NiFi processors/connectors and SQL Builder.

  • Good knowledge of Apache Iceberg, including tables, schema evolution and partitioning.

Technical Knowledge

  • Data governance and catalogue in CDP: Apache Atlas for metadata, lineage and tagging; Apache Ranger for security

policies and authorization.

  • Apache Kafka as a message broker, including topics, producers/consumers, schema registry and NiFi integration.

  • Apache Avro as a serialization standard, including schema evolution and compatibility.

  • Practical experience with NiFi in a CDP environment for design, deployment, monitoring and troubleshooting of

advanced flows.

  • Experience implementing at least one major integration project using NiFi as the central tool for API calls, database

integrations, transformations, routing and delivery.

Certification

  • At least 1 relevant certification such as Cloudera Certified Developer for Apache NiFi, Cloudera Data Flow / CFM

certification or an internationally recognised equivalent.

or