Tarsh Patel
@tarshpatel
Senior Data Engineer specializing in cloud-scale ETL, real-time pipelines, and data platform optimization.
What I'm looking for
I am a Senior Data Engineer with over 7 years' experience designing and optimizing large-scale, high-performance data systems across AWS, Azure, and GCP, delivering real-time and batch pipelines, lakehouse architectures, and ETL/ELT frameworks that produce measurable business value.
I have a proven record improving query and pipeline performance, reducing latency and costs, and deploying ML-driven solutions for trade surveillance and anomaly detection. I mentor junior engineers, lead cross-functional initiatives, and automate CI/CD and infrastructure to ensure reliable, scalable analytics platforms.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
JPMorgan Chase & Co
Oct 2022 - Present (3 years)
Optimized Snowflake queries and built real-time Kafka+Spark pipelines to reduce trading analytics latency from 10 minutes to under 30 seconds and trade settlement latency to <500ms, while standardizing 20+ KPIs via dbt and improving report consistency by 40%.
Architected a petabyte-scale Lakehouse with Azure Synapse, ADLS Gen2 and Databricks, boosting query performance by 65% and cut compute costs via T-SQL optimization; automated CI/CD and saved 25+ hours/week through PySpark ETL automation.
Built GCP-based pipelines with Cloud Composer, Dataflow and BigQuery to integrate 10+ sources, migrated legacy Hadoop to Dataproc/Spark reducing job runtimes by 50% and lowered processing time by 35% via serverless Beam.
Education
Degrees, certifications, and relevant coursework
Northwestern Polytechnic University
Master of Science, Computer Science
Completed a Master’s in Computer Science focusing on advanced data engineering and cloud-based analytics.
Jawaharlal Nehru Technological University, Hyderabad
Bachelor of Technology, Information Technology
Completed a Bachelor’s in Information Technology covering software development, databases, and systems engineering.
Tech stack
Software and tools used professionally
Amazon Redshift
Azure Synapse
Apache Spark
Apache Hive
GitLab
Kubernetes
Jenkins
GitLab CI
NumPy
Pandas
PySpark
dbt
MySQL
PostgreSQL
Hadoop
Gmail
Databricks
Microsoft Teams
Terraform
Azure DevOps
Java
TensorFlow
PyTorch
scikit-learn
Kafka
Serverless
Microsoft Excel
Kafka Streams
Airflow
Apache Beam
s3-lambda
SQL
XGBoost
Google Cloud Dataproc
Delta Lake
Transform
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Tarsh?
You can contact Tarsh and 90k+ other talented remote workers on Himalayas.
Message TarshFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
