Looking for a job

mahwish anjum

@mahwishanjum

Message

I’m a Data Engineer building scalable, cloud-native AWS pipelines.

India

Message

What I'm looking for

I’m looking to build scalable, event-driven ETL and data platforms in AWS—prioritizing automation, performance, and measurable cost improvements—while partnering with product teams to turn data into near real-time business decisions.

I’m a Data Engineer with overall 4.5 years of experience (4 years as data engineer), specializing in building scalable, cloud-native architectures. I focus on delivering real-world data impact through automation, performance, and reliable pipelines, and I’m AWS Certified Solutions Architect.

At Zielotech Software, I designed and deployed end-to-end serverless, event-driven data pipelines using AWS S3, Glue, Lambda, Athena, QuickSight, and EventBridge. I delivered measurable outcomes—30% faster ETL performance, ~40% ETL efficiency improvements, and a 15% cost reduction—by implementing incremental job bookmarking, optimized partitioning, and Parquet-based datasets.

I also reduced Athena query cost by ~60% by transforming raw CSV into columnar, partitioned Parquet. I automate infrastructure provisioning with Terraform and extend ingestion using AWS Kinesis, while ensuring reliability through CloudWatch monitoring and serverless triggers. Previously, I implemented SCD2 (including evolving business logic) and data exchange frameworks between MySQL and Eloqua at GSPANN, and I’ve also worked on machine learning (95% accuracy) during my ML project associate role.

Experience

Work history, roles, and key accomplishments

Current

Contract Data Engineer

Current

Zielotech Software

Aug 2024 - Present (1 year 11 months)

Designed and deployed an end-to-end serverless AWS data pipeline using S3, Glue, Lambda, Athena, QuickSight, and EventBridge for automated ingestion and transformation. Improved ETL efficiency by ~40%, reduced Athena query costs by ~60% using partitioned Parquet, and provisioned infrastructure via Terraform.

AWS S3 AWS Glue AWS Lambda Amazon Athena Amazon Quicksight Amazon EventBridge Amazon Kinesis Terraform Apache Parquet

Senior Software Engineer

GSPANN Technologies

Oct 2022 - Dec 2023 (1 year 2 months)

Implemented Slowly Changing Dimensions (SCD2) logic to maintain historical data accuracy and built data ingestion into delta tables. Developed a framework to exchange data between MySQL and Eloqua and performed join/filter-based data processing to improve data quality.

SCD2 Data Ingestion Data Modeling MySQL Eloqua Data Processing Filters Data Quality

Data Engineer

Zielotech Software

Oct 2021 - Oct 2022 (1 year)

Built end-to-end data pipelines using AWS S3, Glue, and Redshift and performed transformations to generate insights for decision-making. Used PySpark to implement ETL logic supporting downstream analytics.

AWS S3 AWS Glue Amazon Redshift Pyspark ETL SQL Data Analytics AWS IAM

Project Associate (ML)

Indian Institute of Science

Aug 2018 - Dec 2018 (4 months)

Developed a machine learning-based solution for Complete Blood Counts (CBC) using image processing to achieve 95% accuracy in blood cell classification. Installed and configured a VMware Linux server to support project computational requirements reliably.

Machine Learning Image Processing Computer Vision Blood Cell Classification VMWare Linux Administration Data Preparation Performance Tuning Experimentation

Intern (Big Data)

Xavient Information Systems

Feb 2018 - Jul 2018 (5 months)

Built foundational Big Data knowledge through structured knowledge-transfer sessions and gained proficiency in Java to support project contributions.

Java Software Development Debugging Data Processing

Associate Software Engineer

Crisp Analytics

Jun 2017 - Sep 2017 (3 months)

Developed an automated database backup script to improve data integrity and recovery by scheduling periodic backups. Authored and maintained cron jobs to automate system tasks and enhance operational reliability.

Automated Database Backups Scheduling Data Integrity Disaster Recovery Linux Systems Scripting Reliability Engineering