I will build a production ready etl data pipeline using AWS, airflow, and pyspark

Alcune informazioni sono riportate in lingua inglese.

Pakistan

Parlo Inglese

Data Engineer, AWS, Apache Airflow, Spark, PostgreSQL, ETL

I am a Data Engineer and final-year Computer Science student with hands-on professional experience building scalable ETL pipelines and data architectures. I have worked at Cognetix.io on enterprise-gr...
Informazioni su questo servizio

Are you drowning in raw data with no reliable way to process it?

I build production-grade data pipelines that run automatically, scale with your data, and never break silently. No spaghetti scripts. No manual steps. Just clean, reliable data exactly where you need it.


What I Build

  • ETL pipelines using Python and PySpark extract, transform, load, done
  • Apache Airflow DAGs for fully automated, scheduled workflows
  • Medallion Architecture pipelines (Bronze Silver Gold) with data quality at every layer
  • AWS data platforms S3 data lake, Glue, EMR on EKS, IAM, Terraform
  • Cloud ingestion pipelines from any source into PostgreSQL, MySQL, ClickHouse, or Supabase
  • Fully containerised setups with Docker and Docker Compose
  • One-command deployments with CI/CD no manual SSH, no runbooks

Expertise:

Big data

Estrazione dati

Flusso di dati

Tecnologia:

Amazon Redshift

Apache Kafka

Apache Spark

Python

SQL

Il mio portfolio