Keval Dainik
@kevaldainik
Senior Data Engineer specializing in scalable ETL and real-time analytics across GCP and AWS.
What I'm looking for
I’m a Senior Data Engineer with over 6+ years of IT experience delivering end-to-end data platforms across diverse industries. I’ve worked hands-on with Cloudera and Hortonworks, and I’m proficient across the Hadoop ecosystem (Spark, MapReduce, Hive, Kafka, HBase, Impala) with cluster management via Ambari.
At Deloitte, I designed and implemented scalable ETL pipelines using Scala with Apache Spark on Dataproc, and built real-time and batch processing with Cloud Dataflow, Apache Beam, Cloud Pub/Sub, and Cloud Composer. I led enterprise workflow migration from Teradata to GCP, refactoring legacy stored procedures into modular BigQuery SQL tuned with partitioning, clustering, and materialized views—while also implementing data validation, restart capabilities, and security controls (including row-level security).
Earlier, at Ford and Capgemini, I implemented AWS-based solutions (EC2, S3, RDS, VPC, EMR), optimized MapReduce and ETL pipelines, and supported data warehouse development with SQL Server/SSIS and automation for reliable data operations. I enjoy troubleshooting with Root Cause Analysis, collaborating through Agile sprints, and turning complex data work into dependable, production-ready pipelines.
Experience
Work history, roles, and key accomplishments
Designed and implemented scalable ETL pipelines with Scala/Spark on GCP (Dataproc, Dataflow, Pub/Sub) and migrated enterprise workflows from Teradata to BigQuery. Refactored stored procedures into partitioned/clustered BigQuery using materialized views, improving product model performance by 50% and building secure, role-based data access plus Looker dashboards.
Built AWS-based ETL and data pipelines using EC2, S3, RDS, EMR, and Lambda, integrating AWS sources and APIs into Redshift and HDFS for downstream analytics. Automated infrastructure with Terraform and CI/CD, and improved processing reliability using SnapLogic and Python-based Spark/MapReduce jobs.
Created and optimized SQL Server database objects and stored procedures to support reporting and application performance, including automated maintenance routines with SSIS. Managed security roles and imports from multiple sources into centralized SQL Server, and led physical-to-virtual/virtual-to-virtual server migrations with follow-up stability monitoring.
Education
Degrees, certifications, and relevant coursework
Charotar University of Science and Technology (CHARUSAT)
Bachelor's of Computer Science, Computer Science
Bachelor’s in Computer Science from Charotar University of Science and Technology (CHARUSAT), completed in 2020.
Tech stack
Software and tools used professionally
Apache Spark
Talend
SAS
Amazon EC2
Amazon S3
Google Cloud Storage
GitHub
Jenkins
NumPy
Pandas
PySpark
dbt
DB
Sqoop
MySQL
MongoDB
Cassandra
Hadoop
HBase
Gmail
Spring MVC
Yarn
Google Analytics
Databricks
Terraform
AWS CloudFormation
Visual Studio
PyCharm
Jira
Apache Ant
jQuery
JavaScript
HTML5
Java
JSON
XML
Apache Flume
Log4j
Kafka
Ambari
Ubuntu
Linux
Windows
AWS Lambda
Amazon RDS
JUnit
Notepad++
Amazon VPC
Apache Tomcat
Airflow
Apache Beam
Root Cause
Amazon EMR
SQL
SciPy
Transform
Enhance
Phase
Dynamic
Middleware
SnapLogic
Task
Factory
Beam
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Keval?
You can contact Keval and 90k+ other talented remote workers on Himalayas.
Message KevalFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
