Akshay K
@akshayk
Senior AWS Data Engineer specializing in scalable data pipelines and cloud migrations.
What I'm looking for
I am a senior data engineer with 8+ years building scalable, data-driven solutions across AWS, Hadoop, Spark, and modern cloud data warehouses. I have led migrations from on-premises Hadoop to cloud platforms, designed ETL pipelines using PySpark/Scala, and optimized data warehouses in Snowflake and BigQuery to improve performance and lower costs.
I deliver automated, production-ready systems—CI/CD for data infrastructure, serverless architectures with Lambda and API Gateway, and real-time streaming using Kafka and Kinesis. I focus on reliability, performance tuning, and actionable analytics to help teams turn large, heterogeneous data into business value.
Experience
Work history, roles, and key accomplishments
AWS Data Engineer
Samsung Research America
Jan 2025 - Present (9 months)
Built and optimized data pipelines on AWS and GCP, improving query performance by 40% through BigQuery and PySpark optimizations and enabling real-time ingestion via Kafka and Kinesis. Automated ingestion and ETL using Airflow, Spark, and Snowpipe and reduced processing time by ~40% with PySpark improvements.
AWS Data Engineer
Genuine Auto Parts
Aug 2021 - Dec 2024 (3 years 4 months)
Developed real-time Spark Streaming applications and automated ETL/validation processes, reducing data issues and enabling continuous loads into Snowflake and Hive via Snowpipe and Airflow. Implemented AWS Lambda and EMR orchestration and improved testing and data quality through automated Python scripts.
AWS Data Engineer
Shell
Mar 2019 - Jul 2021 (2 years 4 months)
Built and maintained Hadoop and AWS-based ETL pipelines using Spark, Hive, and Sqoop, migrated data warehouses to Snowflake, and integrated Kafka and Elasticsearch for real-time analytics and search. Automated S3 workflows and infrastructure via CloudFormation and improved data processing with PySpark on EMR.
Cloud Data Engineer
Silicon Valley Bank
May 2017 - Feb 2019 (1 year 9 months)
Implemented ETL and Spark solutions using Python and Scala, managed AWS data services (Glue, Redshift, Lambda), designed Star/Snowflake schemas, and built Redshift and Snowflake data models and pipelines. Automated monitoring with CloudWatch and deployed Spark clusters for large-scale analytics.
Education
Degrees, certifications, and relevant coursework
Trine University
Master of Science, Information Systems
Master's degree in Information Systems from Trine University, Detroit, MI.
Tech stack
Software and tools used professionally
Splunk
Apache Spark
AWS Glue
Talend
SonarQube
Kubernetes
Jenkins
NumPy
Pandas
PySpark
AWS Data Pipeline
dbt
DB
Sqoop
PostgreSQL
MongoDB
Cassandra
Hadoop
HBase
Gmail
Node.js
Django
Yarn
Databricks
Terraform
AWS CloudFormation
jQuery
JavaScript
HTML5
Java
JSON
Apache Flume
scikit-learn
Kafka
RabbitMQ
Zookeeper
Ubuntu
CentOS
Linux
Windows
Elasticsearch
The Hive
Avro
AWS Lambda
Serverless
pytest
Apache Tomcat
JBoss
Airflow
Send it
SQL
Google Kubernetes Engine
Cosmos
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Akshay?
You can contact Akshay and 90k+ other talented remote workers on Himalayas.
Message AkshayFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
