Miss Irungu
@graceirungu96
Data Engineer building AWS-based pipelines, CDC systems, and SaaS data infrastructure for real-time analytics.
What I'm looking for
Data Engineer specializing in cloud data engineering, real-time data processing, and SaaS style data systems. I design and build end-to-end data pipelines that move operational and streaming data into reliable analytics platforms.
My experience includes building Change Data Capture pipelines using AWS DMS and Snowflake for near real-time data ingestion, designing lakehouse architectures using Databricks Medallion framework, and implementing event-driven pipelines using AWS Lambda and Kinesis.
I have worked on large-scale data systems involving data validation, schema enforcement, and PII masking to ensure data quality and governance in production environments.
I am focused on roles involving data infrastructure, scalable pipelines, and systems that support product analytics, operational intelligence, and business decision making.
Experience
Work history, roles, and key accomplishments
Data Engineering Consultant
Freelance
Jan 2024 - Present (2 years 5 months)
Built a real-time Change Data Capture pipeline using AWS DMS streaming from RDS into Snowflake, reducing latency from 24 hours to under 5 minutes. Implemented Databricks lakehouse architecture, plus AWS Lambda and Kinesis event pipelines with schema validation and PII masking for governed ingestion.
AI Infrastructure Instructor
ISACA Kenya Chapter
Jan 2025 - Dec 2025 (11 months)
Designed and delivered data engineering training for 50+ professionals covering Apache Airflow, Python orchestration, and cloud pipelines. Built AWS-based training environments (S3, RDS, IAM) to simulate production data systems and guide end-to-end pipeline development.
Head of Data Partnerships
Adamur
Sep 2024 - Jan 2025 (4 months)
Designed cross-system data integration frameworks to support structured data exchange between partner organizations. Built Python-based automated reporting pipelines and defined data architecture standards to ensure consistency, traceability, and governance compliance.
Airflow Data Orchestration Lead
Scratch and Script Limited
Aug 2024 - Dec 2024 (4 months)
Built Airflow-based orchestration pipelines for automated data movement and transformation workflows, standardizing reusable pipeline design patterns across teams. Simulated production-grade AWS environments and mentored engineers on ingestion and orchestration workflow design.
Project Manager & Technical Lead
Smartone.ai
Jul 2024 - Sep 2024 (2 months)
Managed data preparation pipelines for AI model training, ensuring dataset quality and consistency requirements. Coordinated engineering and labeling stakeholders, improved data cleaning to enhance training accuracy, and documented lineage, transformation logic, and validation processes for auditability.
Data Governance & ML Infrastructure
Samasource
Aug 2017 - Jan 2024 (6 years 5 months)
Built large-scale data validation pipelines for machine learning datasets to meet production quality standards. Developed Python ETL frameworks and automated filtering to prevent corrupted data from entering training pipelines while optimizing cloud workflows with structured logging and performance-oriented batching/indexing.
Education
Degrees, certifications, and relevant coursework
University of the People
Bachelor of Science in Computer Science, Computer Science
Bachelor's degree in Computer Science from the University of the People.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Portfolio
grace-irungu-portfolio.netlify.appSocial media
Job categories
Skills
Interested in hiring Miss?
You can contact Miss and 90k+ other talented remote workers on Himalayas.
Message MissFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
