Manasa Reddy
@manasareddy
Senior Data Engineer specializing in scalable Big Data, Spark, and cloud-native data pipelines.
What I'm looking for
I am a Senior Data Engineer with about 12 years of experience designing, developing, testing, and deploying Big Data, Spark, and Hadoop solutions across cloud and on-premises environments. I build production-ready Spark applications on Databricks and EMR, leverage Delta Lake for ACID-compliant lakehouses, and design scalable ETL and streaming pipelines for analytics and ML consumption.
My background includes end-to-end implementation of enterprise data lakes and lakehouses using AWS services (S3, Glue, EMR, Athena, Redshift, Lambda, Kinesis) and Snowflake, plus orchestration with Airflow, Step Functions, and CI/CD automation. I have a strong track record optimizing Spark jobs (broadcasting, caching, shuffle tuning), implementing CDC with Snowflake Streams & Tasks, and integrating real-time pipelines with Kafka and Kinesis.
I am an innovative self-starter and collaborative team player who mentors junior engineers, drives reusable frameworks, and partners with stakeholders to translate business needs into reliable, cost-efficient data solutions. I focus on data quality, observability, and performance tuning to meet SLAs and accelerate data-driven decision making.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
HEB
Apr 2024 - Present (1 year 5 months)
Designed and implemented an enterprise data lake on AWS and built scalable ETL and real-time Spark pipelines, improving data availability and query performance for analytics and reporting.
Data Engineer / Data Analyst
American Express
May 2018 - Aug 2020 (2 years 3 months)
Developed Spark-based streaming and batch pipelines on AWS EMR and Kinesis, redesigned legacy workflows, and delivered analytics-ready datasets and visualizations to stakeholders.
Education
Degrees, certifications, and relevant coursework
Wright State University
Master of Science, Computer Science
2013 - 2015
Completed a Master of Science in Computer Science focusing on advanced topics in distributed systems and data engineering from August 2013 to June 2015.
Tech stack
Software and tools used professionally
Apache Spark
AWS Glue
Talend
Amazon CloudWatch
Amazon S3
AWS Step Functions
GitHub
Bitbucket
AWS CodeCommit
Kubernetes
Jenkins
GitHub Actions
Bitbucket Pipelines
NumPy
Pandas
PySpark
DB
Sqoop
MySQL
PostgreSQL
MongoDB
Cassandra
Hadoop
HBase
Gmail
Rollout
Databricks
Terraform
Kafka Manager
JSON
XML
Logstash
MLflow
Kafka
Amazon SQS
Apache NiFi
Kibana
Zookeeper
Linux
Elasticsearch
Avro
AWS Lambda
Serverless
pytest
Airflow
Time Analytics
SQL
Delta Lake
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Manasa?
You can contact Manasa and 90k+ other talented remote workers on Himalayas.
Message ManasaFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
