Shiv Hari Baral
@shivharibaral
I am a Senior Data Engineer delivering secure, scalable cloud data platforms and self-service analytics.
What I'm looking for
I am a Senior Data Engineer with 6+ years building secure, scalable data platforms across AWS, Azure, and GCP to enable executive-ready analytics and compliant data access.
I have delivered both real-time and batch ETL/ELT pipelines, tuned data warehouses to reduce query times by up to 60%, modularized transformations with dbt to cut query time by 45%, and built centralized data lakes for multi-domain analytics.
I prioritize data quality, governance, and security—implementing Great Expectations, Lake Formation, KMS, IAM, and vaulting solutions—while shipping reproducible infrastructure with Terraform, CI/CD, and automated monitoring to reduce MTTR and SLA breaches.
I am an Agile collaborator and mentor who partners with product and ML teams to produce feature-ready datasets, deploy RAG and predictive models, and drive trustworthy, production-grade analytics across regulated industries.
Experience
Work history, roles, and key accomplishments
Designed and deployed a centralized data lake and AWS-native ETL platform, tuning Redshift and introducing dbt to reduce weekly sales report execution time by 60% and query time by 45%. Implemented data quality frameworks and Terraform-driven infrastructure, reducing reconciliation effort by 30% and maintaining 100% HIPAA/compliance adherence.
Data Analytics Engineer
Lockheed Martin
Jan 2021 - Apr 2023 (2 years 3 months)
Built multi-cloud real-time and batch ingestion pipelines into S3, ADLS Gen2, and GCS and containerized secure processing on EKS, standardizing schema-on-read patterns for analytics-ready datasets. Established unified metadata and lineage across Glue Catalog and Azure Purview, and delivered executive dashboards and monitoring to support governance and anomaly detection.
Data Engineer
Berkshire Hathaway
Aug 2018 - Dec 2020 (2 years 4 months)
Migrated legacy clinical ETL to AWS Glue, Step Functions, and Lambda, reducing end-to-end batch processing time by 60% and cutting operational dashboard lag by 35%. Implemented HIPAA-compliant data lake and reproducible Terraform deployments, reducing manual environment setup time by 70% and improving auditability with dbt lineage.
Education
Degrees, certifications, and relevant coursework
Lamar University
Master of Science, Computer Science
Master of Science in Computer Science from Lamar University.
Tech stack
Software and tools used professionally
Amazon API Gateway
Amazon Redshift
Fivetran
Azure Synapse
Apache Spark
AWS Glue
Apache Flink
AtScale
Data Studio
Amazon Quicksight
AWS IAM
Google Cloud Platform
Amazon CloudWatch
Amazon S3
Google Cloud Storage
AWS Step Functions
GitHub
Bitbucket
Kubernetes
AWS CodePipeline
Jenkins
GitHub Actions
Jupyter
PySpark
AWS Glue DataBrew
AWS Data Pipeline
dbt
HBase
Gmail
Google Analytics
Databricks
Adobe Analytics
Dist
Terraform
AWS CloudFormation
Azure DevOps
Jira
Java
JSON
XML
TensorFlow
MLflow
scikit-learn
Kafka
Amazon SNS
Prometheus
Azure Monitor
Windows
Datadog
AWS X-Ray
Amazon Kinesis
Amazon Kinesis Firehose
Avro
AWS Lambda
Serverless
Airflow
s3-lambda
Google BigQuery
Amazon EMR
SQL
ServiceNow
AWS KMS
Mode Analytics
LangChain
Ray
Delta Lake
Availability
Location
Authorized to work in
Website
shivharibaral.comJob categories
Skills
Interested in hiring Shiv Hari?
You can contact Shiv Hari and 90k+ other talented remote workers on Himalayas.
Message Shiv HariFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
