Skip to main content
AB
Open to opportunities

Abin Bijo

@abinbijo

I’m a data engineer building real-time streaming pipelines and cloud analytics on AWS/Azure.

India
Message

What I'm looking for

I’m looking to build and optimize cloud-native data platforms—real-time + batch pipelines, strong governance/lineage, and self-service analytics—where I can reduce costs, improve latency, and enable BI/semantic layers for teams.

I’m a Data Engineer with 6+ years of experience designing and implementing real-time streaming systems, distributed data pipelines, and cloud-native analytics platforms on AWS and Azure. I build cost-efficient ETL/ELT pipelines and scalable ingestion frameworks that power analytics for millions of users.

Most recently, I engineered a batch ingestion platform using dlt to sync data from 6+ source systems into a centralized data lake, cutting onboarding time by ~75%, and built DBT models delivering 10+ analytics-ready datasets. I’ve also implemented semantic layers with Cube Core atop AWS Athena to unify REST/GraphQL APIs and reduce ad-hoc SQL requests by ~90%, deployed OpenMetadata for governance/lineage across 20+ environments, and orchestrated end-to-end workflows in Airflow across 20+ DAGs.

Experience

Work history, roles, and key accomplishments

SL
Current

Member of Technical Staff III

SMC Global Securities Ltd

Feb 2026 - Present (4 months)

Engineered a dlt-based batch ingestion platform syncing 6+ source systems into a centralized data lake, reducing onboarding time for new data sources by ~75%. Built DBT transformations, a Cube Core semantic layer over AWS Athena with REST/GraphQL APIs, and Airflow workflows across 20+ DAGs, alongside OpenMetadata governance and lineage for 20+ environments.

PL

Software Engineer 2

Pluralsight

Sep 2024 - Feb 2026 (1 year 5 months)

Designed real-time and batch pipelines with Kafka and Spark Streaming on AWS to process ~2M events/day for Pluralsight learning analytics. Migrated Materialize workloads to Spark Structured Streaming, reducing infrastructure costs by ~70% while maintaining sub-second latency, and standardized schemas via automated DBT models.

EM

Software Engineer

Embibe

Sep 2022 - Sep 2024 (2 years)

Built ETL pipelines ingesting 7+ data sources (Google Sheets, Excel, MongoDB, PostgreSQL, Kafka, EventHub) into a unified analytics layer. Delivered an EventHub + ADX real-time analytics platform with sub-second insights, and built 10+ dashboards in Power BI/Superset/ADX while cutting average query latency by ~80%.

EM

Associate Program Manager

Embibe

Sep 2020 - Sep 2022 (2 years)

Led Content Tech operations overseeing digitization of 410+ books with 595K+ learning objects, delivering the Aug 2021 milestone on schedule. Partnered with engineering and design teams on product requirements and delivery timelines, ran daily scrums, and resolved cross-team blockers while maintaining quality standards.

Education

Degrees, certifications, and relevant coursework

GT

Government Engineering College, Thrissur

B.Tech, Electrical & Electronics Engineering

2012 - 2016

Earned a B.Tech in Electrical & Electronics Engineering from Government Engineering College, Thrissur (2012–2016).

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan