Arfa Zulfiqar
@arfazulfiqar
I build scalable data pipelines, specializing in CDC, Medallion Architecture, and PySpark on AWS/Azure.
What I'm looking for
I’m a Data Engineer focused on building scalable, high-performance data pipelines across AWS and Azure ecosystems. I specialize in full-load and Change Data Capture (CDC) processing, PySpark optimization, and end-to-end Medallion Architecture to support trustworthy downstream analytics.
In my recent roles at ADDO AI, I’ve architected ingestion pipelines for enterprise onboarding and engineered robust data flows using Azure Data Factory and Databricks. I’ve improved dashboard reliability and reporting accuracy by debugging complex datasets, enhancing existing logic, and enforcing strict SLAs for marketing domain datasets.
Previously, I modernized AWS data lakes by developing end-to-end Medallion Architecture ETL/ELT pipelines on AWS S3, automating AWS Glue jobs, and orchestrating event-driven workflows with AWS Lambda. I also built high-volume telecom pipelines (CBS to BRM) with PySpark and optimized SQL on Hive/YARN, delivered zero-defect data through end-to-end validation, and supported integration through Source-to-Target Mapping (SMX) documentation.
Experience
Work history, roles, and key accomplishments
Data Engineer
ADDO AI
Oct 2025 - Present (8 months)
Architected and deployed scalable ingestion pipelines for newly onboarded enterprise datasets, supporting full-load and Change Data Capture (CDC) patterns. Improved dashboard reliability and reporting accuracy by debugging complex datasets and strengthening data refresh logic while meeting marketing SLAs.
Data Engineer
ADDO AI
Jan 2025 - Present (1 year 5 months)
Built end-to-end ETL/ELT pipelines using Medallion Architecture (Bronze/Silver/Gold) on AWS S3 and automated processing with AWS Glue. Implemented event-driven orchestration with Lambda, metadata tracking in DynamoDB, and proactive monitoring in CloudWatch to support multi-source CDC and full-load delivery.
Associate Data Engineer
ADDO AI
Aug 2023 - Dec 2024 (1 year 4 months)
Developed high-volume telecom data pipelines across Raw, Curated, and Serving layers for CDR/EDR datasets, integrating prepaid and postpaid sources into unified views. Automated large-scale extraction/transformation with PySpark and optimized SQL on Hive/YARN, and ensured zero-defect delivery through end-to-end data validation and quality checks.
Assistant Researcher
SBASSE Lab (LUMS)
Jun 2022 - Jul 2022 (1 month)
Contributed to research projects by developing deep learning models for route optimization using CNN, RNN, and GNN architectures. Applied TensorFlow and Python for predictive modeling and supported implementation using mapping data via Google Maps API.
Education
Degrees, certifications, and relevant coursework
University of Engineering and Technology, Lahore
Bachelor of Science, Electrical Engineering (Computer Major)
2018 - 2022
Grade: GPA: 3.701/4.000
Earned a BSc in Electrical Engineering (Computer Major) from UET Lahore, graduating in 2022 with a GPA of 3.701/4.000.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Arfa?
You can contact Arfa and 90k+ other talented remote workers on Himalayas.
Message ArfaFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
