mahwish anjum
@mahwishanjum
I’m a Data Engineer building scalable, cloud-native AWS pipelines.
What I'm looking for
I’m a Data Engineer with overall 4.5 years of experience (4 years as data engineer), specializing in building scalable, cloud-native architectures. I focus on delivering real-world data impact through automation, performance, and reliable pipelines, and I’m AWS Certified Solutions Architect.
At Zielotech Software, I designed and deployed end-to-end serverless, event-driven data pipelines using AWS S3, Glue, Lambda, Athena, QuickSight, and EventBridge. I delivered measurable outcomes—30% faster ETL performance, ~40% ETL efficiency improvements, and a 15% cost reduction—by implementing incremental job bookmarking, optimized partitioning, and Parquet-based datasets.
I also reduced Athena query cost by ~60% by transforming raw CSV into columnar, partitioned Parquet. I automate infrastructure provisioning with Terraform and extend ingestion using AWS Kinesis, while ensuring reliability through CloudWatch monitoring and serverless triggers. Previously, I implemented SCD2 (including evolving business logic) and data exchange frameworks between MySQL and Eloqua at GSPANN, and I’ve also worked on machine learning (95% accuracy) during my ML project associate role.
Experience
Work history, roles, and key accomplishments
Contract Data Engineer
Zielotech Software
Aug 2024 - Present (1 year 8 months)
Designed and deployed an end-to-end serverless AWS data pipeline using S3, Glue, Lambda, Athena, QuickSight, and EventBridge for automated ingestion and transformation. Improved ETL efficiency by ~40%, reduced Athena query costs by ~60% using partitioned Parquet, and provisioned infrastructure via Terraform.
Senior Software Engineer
GSPANN Technologies
Oct 2022 - Dec 2023 (1 year 2 months)
Implemented Slowly Changing Dimensions (SCD2) logic to maintain historical data accuracy and built data ingestion into delta tables. Developed a framework to exchange data between MySQL and Eloqua and performed join/filter-based data processing to improve data quality.
Data Engineer
Zielotech Software
Oct 2021 - Oct 2022 (1 year)
Built end-to-end data pipelines using AWS S3, Glue, and Redshift and performed transformations to generate insights for decision-making. Used PySpark to implement ETL logic supporting downstream analytics.
Project Associate (ML)
Indian Institute of Science
Aug 2018 - Dec 2018 (4 months)
Developed a machine learning-based solution for Complete Blood Counts (CBC) using image processing to achieve 95% accuracy in blood cell classification. Installed and configured a VMware Linux server to support project computational requirements reliably.
Intern (Big Data)
Xavient Information Systems
Feb 2018 - Jul 2018 (5 months)
Built foundational Big Data knowledge through structured knowledge-transfer sessions and gained proficiency in Java to support project contributions.
Associate Software Engineer
Crisp Analytics
Jun 2017 - Sep 2017 (3 months)
Developed an automated database backup script to improve data integrity and recovery by scheduling periodic backups. Authored and maintained cron jobs to automate system tasks and enhance operational reliability.
Education
Degrees, certifications, and relevant coursework
International Institute of Information Technology (IIIT) Bhubaneswar
Master of Technology, Computer Science
2015 - 2017
Completed an M.Tech in Computer Science at IIIT Bhubaneswar from 2015 to 2017.
Uttar Pradesh Technical University, Lucknow
Bachelor of Technology, Computer Science
2009 - 2013
Completed a B.Tech in Computer Science at Uttar Pradesh Technical University, Lucknow from 2009 to 2013.
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring mahwish?
You can contact mahwish and 90k+ other talented remote workers on Himalayas.
Message mahwishFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
