sahil jainSJ
Open to opportunities

sahil jain

@sahiljain

Experienced Senior Data Engineer with expertise in cloud platforms and data solutions.

India
Message

I am a Senior Data Engineer with over 5 years of experience in the Retail and Entertainment sectors. I have a strong background in designing and implementing scalable, secure, and cost-effective data solutions on cloud platforms like AWS and GCP. My expertise lies in AWS Data Engineering and services such as S3, RDS, EMR, EC2, and Dynamo DB. I also have hands-on experience with ETL tools like Databricks Workflows and Apache Airflow.

As a practitioner in the field, I have successfully cleaned and performed exploratory data analysis on large datasets. I have built ETL pipelines utilizing various data engineering tools. I am skilled in data mining, visualization, stakeholder management, and business intelligence. I am also proficient in predictive modeling, data cleaning, data structuring, and cross-functional coordination.

I am well-versed in agile practices and have experience in Hadoop, data pipelines, deployment, and framework maintenance. I have a strong background in data modeling, scheduling, and workflows. My technical skills include Python, BigData, PySpark, Apache Spark, MySQL, Elastic Store, Mongo DB, RDBMS, Hadoop, Sqoop, Pandas, Numpy, Excel, Power BI, Matplotlib, Seaborn, Airflow, Databricks, Linux, and Bitbucket.

Experience

KL

Senior Data Engineer

Koantek Cloud and AI Pvt Ltd

Refined data quality checks by implementing Python & SQL in AWS S3 & DynamoDB, optimizing data validation, deduplication, transformation processes. Boosted data migration efficiency by 50% through PySpark & Spark SQL, & AWS Glue, to execute intricate data transformations, consolidating & linking multiple datasets. Enhanced data accuracy by 20% through migrating client data from GCP BigQuery to Dat

DU

Consultant

Deloitte USI

Enhanced data accuracy by 20% through the implementation of a new algorithm using Python & the Fuzzy Wuzzy library. Enhanced data accuracy by 30% by developing a keyword specific model on Amazon using fuzzy-wuzzy algo & pandas for retrieving information. Improved data-driven decision-making through managing advanced analytics, visualization, & structured databases in Power BI. Implemented real-tim

TS

System Engineer

Tata Consultancy Services

Led raw data analysis using Power BI & SQL to identify cost-saving opportunities, market trends, & drive strategic decision-making. Developed Python logic using Pandas & Boto3 to extract 500GB of data in snappy format & seamlessly upload it to an S3 bucket. Implemented Python & Selenium automation framework, boosting productivity by 40% & populating data into MongoDB, CSVs, & database tables.

Tech stack

Learn about the tools and technologies that sahil likes to use.

Find your dream job

Sign up now and join thousands of other remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan