Himalayas logo
Dipankar KalitaDK
Looking for a job

Dipankar Kalita

@dipankarkalita

Data Engineer | Expert in Big Data, AWS, Spark, Kafka, Airflow |Building Scalable Data Pipelines

India
Message

What I'm looking for

I’m seeking a challenging role in data engineering that allows me to work with large-scale systems, contribute to building efficient data pipelines, and continue developing my skills in cloud technologies while driving business impact.

I am a Data Engineer with 4 years of experience in designing and implementing scalable, end-to-end data pipelines for both real-time and batch processing systems. My core expertise lies in Python, SQL, PySpark, Apache Spark, and Kafka, with hands-on experience in cloud platforms like AWS (Glue, EMR, Redshift, S3).

I’ve led data engineering efforts on e-commerce and banking projects, leveraging tools like Apache Airflow, Apache NiFi, and AWS services to build robust ETL frameworks. I focus on improving data quality, optimizing performance, and ensuring secure data workflows.

I am passionate about working with cutting-edge big data technologies, contributing to open-source projects, and continuously learning.

Let’s connect if you're looking for a reliable and enthusiastic data engineer who thrives in dynamic, fast-paced environments.

Experience

Work history, roles, and key accomplishments

Smartbeings Software Innovation logoSI
Current

Data Engineer

Smartbeings Software Innovation

Oct 2022 - Present (3 years 3 months)

Delivered a fully automated and reliable data platform by designing end-to-end ETL pipelines with Apache Airflow, AWS Glue, and PySpark on EMR. Ingested large-scale data from RDBMS and FTP into S3 and Redshift, improved quality with cleaning, validation, and fraud detection, and optimized performance using CloudWatch and EMR, reducing costs while ensuring security and compliance.

Smartbeings Software Innovation logoSI

Trainee Engineer

Smartbeings Software Innovation

Apr 2022 - Oct 2022 (6 months)

As a trainee engineer, contributed to building a high-throughput system for real-time financial data by developing scalable pipelines with Kafka and Spark on HDFS. Assisted in PySpark jobs for data cleaning and enrichment, optimized storage in Parquet/DBs, and supported secure, low-latency data workflows using SFTP.

Education

Degrees, certifications, and relevant coursework

Gauhati University logoGU

Gauhati University

Bachelor of Commerce, Management

2015 - 2018

Completed a Bachelor of Commerce (Management) program focusing on management and commerce fundamentals from 2015 to 2018.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Dipankar Kalita - Data Engineer - Smartbeings Software Innovation | Himalayas