About this role
We are looking for Data Engineer (Python) to architect ClickHouse ETL pipelines (Airflow/Airbyte, ERPNext), deliver BI dashboards (Superset/Power BI), and oversee on-prem server infrastructure.
Responsibilities
- Design and maintain a ClickHouse data warehouse and Apache Iceberg lakehouse, building ETL/ELT pipelines via Airbyte or custom Python scripts.
- Write custom CDC connectors, transformation scripts, and pipeline utilities to integrate MariaDB, Oracle, and API data sources.
- Build and maintain Apache Superset dashboards with Jinja templating for data visualization and reporting.
- Manage infrastructure using Docker and Linux server administration to ensure reliable, scalable pipeline operations.
- 3–4 years of hands-on experience with ClickHouse, Python, Airbyte/Airflow, Superset, and related data engineering tools required.