Venkat Samir
@venkatsamir
Senior Data Engineer specializing in Big Data, Azure, and Scala.
What I'm looking for
I am a Senior Data Engineer with over six years of experience designing, developing, and optimizing large-scale data pipelines and distributed systems using Hadoop, Spark, Scala, Python, and cloud platforms like Azure and AWS.
I have delivered end-to-end data solutions across healthcare, automotive, and financial domains — building Azure-based data lakes, Databricks pipelines, real-time Kafka streaming, and ML-enabled processing while ensuring security and regulatory compliance such as HIPAA and GDPR.
I focus on performance tuning, CI/CD automation, and production-grade architecture, and I bring hands-on experience with tooling including Azure Data Factory, Synapse, Databricks, Spark MLlib, Akka, Solr, Splunk, and Microsoft Fabric to drive reliable, scalable analytics and operational systems.
Experience
Work history, roles, and key accomplishments
Sr. Data Engineer
Amaris Consulting
Mar 2024 - Present (2 years)
Built a centralized Azure data lake and end-to-end data pipelines using Databricks, Synapse, ADF and Functions to enable FHIR/HL7 integrations and improve analytics; implemented CI/CD, data security (HIPAA/GDPR) and optimized Scala/Akka data processing for production workloads.
Data Engineer
Toyota Motors
Sep 2022 - Feb 2024 (1 year 5 months)
Designed and maintained Big Data pipelines on Hadoop and Azure, implemented real-time Spark streaming from Kafka, and built ETL/ADF/Databricks solutions and data warehouses to support analytics and fraud detection use cases.
Education
Degrees, certifications, and relevant coursework
Indus University
Bachelor of Technology, Information Technology
Completed a Bachelor of Technology in Information Technology, graduating in April 2020 from Indus University in Ahmedabad, Gujarat.
Tech stack
Software and tools used professionally
Splunk
Azure Synapse
Apache Spark
Talend
D3.js
Azure Storage
AWS Step Functions
GitHub
GitLab
Kubernetes
GitHub Actions
GitLab CI
Jupyter
Pandas
PySpark
DB
Sqoop
MySQL
MongoDB
Cassandra
Hadoop
HBase
Gmail
Yarn
Databricks
AWS CloudFormation
PyCharm
Spyder
Java
JSON
PowerShell
XML
Apache Flume
Kafka
Ambari
SQLAlchemy
Zookeeper
CentOS
Linux
macOS
Windows
Solr
Avro
Serverless
Azure Functions
Microsoft Excel
Azure SQL Database
Apache Storm
pytest
Airflow
Time Analytics
SQL
Azure Blob Storage
Akka
Cosmos
Transform
Enhance
Microsoft Fabric
Dynamic
Factory
Unify
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Venkat?
You can contact Venkat and 90k+ other talented remote workers on Himalayas.
Message VenkatFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
