Data Analyst, Data Engineer, Analytics Engineer, LLMs and Databases
Saket Kumar
@saketkumar
AI Data Engineer with 4+ yrs experience building scalable ETL pipelines & data models using Python, PySpark, MySQL, Databricks, AWS & Azure.
What I'm looking for
AI Data Engineer
Experience
Work history, roles, and key accomplishments
Optimized high-volume pipelines (1M+ payments/week) and built Customer 360 entity resolution using PySpark, Databricks, MySQL & Snowflake, improving KYC accuracy, credit scoring, and reducing duplicates. Designed Snowflake data models cutting query latency by 45%. Engineered ML feature scaling on 10M+ records and contributed to Agile SDLC practices.
Built and optimized real-time Spark pipelines on Databricks (Azure) using Python, PySpark & Kafka, reducing ETL time by 23.6%. Ingested ~5TB data (CSV/JSON/Parquet) via AWS Glue & Lambda from S3/MySQL into Spark and Redshift. Wrote pytest-based unit tests for 10+ scripts/month, improving code quality and reliability.
Education
Degrees, certifications, and relevant coursework
Galgotias University
Bachelor of Technology, Computer Science
2019 - 2023
Grade: 8.14/10
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Portfolio
https//github.com/SaketKr-On-GitSalary expectations
Social media
Job categories
Interested in hiring Saket?
You can contact Saket and 90k+ other talented remote workers on Himalayas.
Message SaketFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
