Sarthak Vinchurkar
@sarthakvinchurkar
Data Engineer passionate about innovative data solutions and analytics.
What I'm looking for
I am a dedicated Data Engineer with a strong foundation in data architecture and analytics, currently working at Bajaj Finserv Health. My journey in data engineering has been marked by significant achievements, including the development of a highly scalable PII tokenization solution that not only secured data flows but also saved the company substantial licensing costs. I take pride in my ability to lead large-scale data migrations, ensuring data integrity and efficiency while mentoring interns to foster a collaborative learning environment.
Throughout my career, I have consistently focused on optimizing data processes and reducing costs. For instance, I successfully reduced ETL job runtimes by over 50% and implemented innovative solutions that cut processing costs by up to 70%. My technical expertise spans various tools and technologies, including Azure Databricks, Spark, and Delta Lake, which I leverage to build robust ETL pipelines and data lakes. I am passionate about driving data-driven decision-making and continuously improving data workflows.
Experience
Work history, roles, and key accomplishments
Data Engineer
Bajaj Finserv Health
Nov 2024 - Present (8 months)
Architected Privacy Vault, a highly scalable PII tokenization solution using Cassandra DB and UUID, which helped secure data flows and comply with privacy laws entirely in-house, saving approximately ₹30 lakh/year in licensing costs. Engineered a meticulous Data Discovery & PII detection framework in-house achieving 94% F1-score with RegEx and NER-based tagging, eliminating external tool costs and
Associate Data Engineer
Bajaj Finserv Health
Jul 2023 - Oct 2024 (1 year 3 months)
Reduced ETL job runtimes by over 50% by moving from Azure Synapse pipelines to parallelized Airflow DAGs. Reduced processing costs and query execution times for critical business functions by up to 70% by implementing trigger-based CDC, data partitioning, and Z-ordering in Delta Lake tables.
Data Engineer Apprentice
Bajaj Finserv Health
Jul 2022 - Jun 2023 (11 months)
Co-developed a modern Data Lakehouse system processing terabytes of data from diverse SQL and NoSQL sources, leveraging Delta Lake, Databricks, and Trino. Designed and implemented robust ETL pipelines integrating data from Salesforce, Oracle, MongoDB, PostgreSQL and other sources using Azure Data Factory, Airflow and ADLS Gen2.
Education
Degrees, certifications, and relevant coursework
Rajiv Gandhi Proudyogiki Vishwavidhyalaya
B.Tech (Computer Science), Computer Science
Grade: 8.66 CGPA
Completed a Bachelor of Technology in Computer Science, achieving First Division with Honours and an 8.66 CGPA. The curriculum covered core computer science principles and practical applications.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Social media
Job categories
Interested in hiring Sarthak?
You can contact Sarthak and 90k+ other talented remote workers on Himalayas.
Message SarthakFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
