diwas rai
@diwasrai
Senior Data Engineer specializing in cloud data platforms, Lakehouse architectures, and AI-ready pipelines.
What I'm looking for
I am a results-driven Senior Data Engineer with 5+ years building and scaling cloud data platforms, enterprise data warehouses, and AI-ready ecosystems across healthcare, finance, insurance, and retail. I design batch, real-time, and event-driven pipelines on Azure, AWS, and GCP using Databricks, Snowflake, Delta Lake, and Lakehouse architectures while enforcing data governance and compliance (HIPAA, SOX).
I have delivered large-scale projects—integrating 50M+ records, processing 10TB+ daily transactional data, and enabling predictive analytics—while optimizing performance, automating CI/CD, and mentoring junior engineers. I bring hands-on expertise in Python, PySpark, SQL, ML/LLM integrations, RAG pipelines, Infrastructure-as-Code, and cost/governance-minded cloud architecture.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
State Farm Insurance
Jan 2023 - Present (3 years 2 months)
Led a data modernization project building Azure Data Factory pipelines to integrate 50M+ records from 12+ sources, reducing report generation time by 40% and enabling Snowflake and Databricks solutions that processed 10TB+ daily for predictive analytics.
Data Engineer
TIAA
May 2020 - Dec 2022 (2 years 7 months)
Designed and developed end-to-end Azure Data Factory and Databricks pipelines, improving ETL automation by 40%, boosting Snowflake query performance by 25%, and migrating on-prem systems to Azure reducing costs by 20%.
ETL Developer
Pfizer
Aug 2019 - Apr 2020 (8 months)
Developed Informatica and SSIS ETL processes integrating 20M+ records into the data warehouse, reducing errors by 35% and improving ETL processing times by 30% through SQL and PL/SQL optimizations.
Education
Degrees, certifications, and relevant coursework
Southeast Missouri State University
Master of Science, Technology Management
Master of Science in Technology Management emphasizing technology leadership and management practices.
Tech stack
Software and tools used professionally
Azure Synapse
Apache Spark
AWS Glue
Azure RBAC
AWS Step Functions
GitHub
Jenkins
GitHub Actions
NumPy
Pandas
PySpark
MongoDB
Microsoft SQL Server
Gmail
Databricks
Terraform
AWS CloudFormation
Azure DevOps
JavaScript
Java
MATLAB
Azure Machine Learning
TensorFlow
Azure Monitor
Ansible
AWS Lambda
Azure SQL Database
OAuth2
Airflow
Time Analytics
s3-lambda
SQL
Microsoft Visio
Azure Blob Storage
Hugging Face
Apache Iceberg
LangChain
LlamaIndex
Weaviate
Pinecone
Delta Lake
Great Expectations
Azure Logic Apps
Bicep
Bash
Farm
Factory
Jan
Movement
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring diwas?
You can contact diwas and 90k+ other talented remote workers on Himalayas.
Message diwasFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
