Ketan Boro
@ketanboro
Senior Data Engineer delivering scalable, compliant cloud data platforms and real-time pipelines.
What I'm looking for
I am a Senior Data Engineer with 6+ years building secure, scalable data platforms across healthcare, automotive, and retail. I design and deploy ETL/ELT and streaming solutions on AWS, Azure, and GCP, implement lakehouse architectures with Delta Lake and Databricks, and automate CI/CD to reduce deployment time and costs.
I lead cross-functional initiatives, mentor engineers, and drive self-service analytics adoption while ensuring governance and compliance (HIPAA, GDPR, SOX). My work has delivered realtime analytics, reduced latency from hours to seconds, and produced substantial cloud cost savings through monitoring and optimization.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Pfizer
Jun 2022 - Present (3 years 3 months)
Led migration of legacy pipelines to AWS, built streaming solutions with Kafka and Spark to enable real-time marketing features, and implemented Databricks Lakehouse and Delta Lake to improve governance and reduce ETL latency across retail and inventory data.
Data Engineer
Bofa
Aug 2018 - May 2022 (3 years 9 months)
Built scalable pipelines processing 3B+ sensor records/month using Azure Data Factory, Databricks and Kafka, implemented medallion architecture to improve data freshness and reduced processing time by 40%.
ETL Developer / Data Engineer
Novartis
May 2017 - Jul 2018 (1 year 2 months)
Developed Informatica and SSIS ETL workflows for clinical and commercial analytics, created reusable Mapplets and S3-to-Redshift connectors to standardize ingestion and improve ETL cycle speed by 40%.
Education
Degrees, certifications, and relevant coursework
Texas State University
Master of Science, Engineering Management
Master's degree in Engineering Management from Texas State University focused on technical leadership and engineering operations.
Tech stack
Software and tools used professionally
Amazon Redshift
Azure Synapse
Apache Spark
AWS Glue
Apache Flink
AWS Step Functions
GitHub
Kubernetes
Jenkins
GitHub Actions
PySpark
DBeaver
dbt
PostgreSQL
MongoDB
Cassandra
Hadoop
Gmail
Rollout
Databricks
Terraform
AWS CloudFormation
Jira
JSON
XML
MLflow
Kafka
PagerDuty
Grafana
Linux
macOS
Windows
Avro
AWS Lambda
Serverless
pytest
Airflow
Time Analytics
SQL
ServiceNow
AWS KMS
Delta Lake
Great Expectations
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Ketan?
You can contact Ketan and 90k+ other talented remote workers on Himalayas.
Message KetanFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
