Harish Ravi
@harishravi
Data engineer with 8+ years building production ETL/ELT pipelines for analytics and ML.
What I'm looking for
I’m a data engineer with 8+ years of experience building production-grade data platforms and analytics-ready data products. I turn fragmented source systems and high-volume operational data into trusted, well-modeled outputs that support reporting, automation, machine learning, and business decision-making.
I’m at my best when the problem is messy—sources are inconsistent, requirements move fast, and the business still expects the data to be accurate, fast, and dependable. I don’t see data engineering as “moving data”; I design for downstream usability, long-term maintainability, and production reliability as analytics, AI, and operational demands grow.
In my recent work at Accenture (Senior Software Engineer / Data Engineer), I designed and maintained Python- and SQL-based ETL/ELT workflows feeding analytics, automation, and AI-oriented use cases. I built cloud-native batch, incremental, and near-real-time pipelines with schema-change resilience, monitoring/recovery logic, and strong data quality checks (freshness, completeness, duplicates, null-rate shifts, schema drift, and anomalies).
At IBM (Senior Software Engineer / Data Platform Engineer) and Crescent Financial Systems (Software Engineer / Data Engineer), I strengthened warehouse-friendly modeling (star/snowflake-style dimensional structures), improved performance through tuning and refactoring, and supported ML by preparing model-ready datasets and reliable data access patterns. I consistently focus on observability, lineage awareness, and operational resilience—because analytics and ML are only as good as the consistency of the data behind them.
Experience
Work history, roles, and key accomplishments
Designed and maintained ETL/ELT workflows in Python and SQL to transform multi-source operational data into analytics-ready, curated datasets. Built cloud-native batch/incremental/near-real-time pipelines with schema-evolution handling, stronger data quality/observability, and support for ML feature and model-ready dataset preparation.
Built and improved Python- and SQL-based data pipelines and transformation workflows to normalize inconsistent multi-source data for enterprise analytics and operational reporting. Contributed to streaming/event-driven ingestion, curated warehouse-friendly models, and AI/ML dataset preparation while improving pipeline health checks, failure visibility, and performance.
Software Engineer / Data Engineer
Crescent Financial Systems
Jul 2018 - Mar 2020 (1 year 8 months)
Built and maintained SQL- and Python-based reporting and analytics data workflows for a fintech environment where data accuracy and timeliness were critical. Implemented validation, exception handling, and performance improvements for batch pipelines while shaping transaction and operational datasets into consistent, dashboard-ready outputs.
Education
Degrees, certifications, and relevant coursework
The University of Texas at Austin
Master of Science, Computer Science
2020 - 2024
Earned a Master of Science in Computer Science at The University of Texas at Austin from 2020 to 2024.
University of Maryland, College Park
Bachelor of Science, Economics (Quantitative Track)
2014 - 2018
Earned a Bachelor of Science with a major in Economics (Quantitative Track) and minors in Computer Science and Statistics at the University of Maryland, College Park from 2014 to 2018.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Harish?
You can contact Harish and 90k+ other talented remote workers on Himalayas.
Message HarishFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
