I am looking for a fully remote software or data engineering role where the technical work is complex and challenging
Fawwaz Atha
@fawwaz
Software Data Engineer with production experience building data pipelines and web scraping infrastructure for financial platforms.
What I'm looking for
I am a Software Data Engineer with hands-on experience building production-grade data engineering and scraping infrastructure. At Supertype AI, I built end-to-end scraping pipelines for Sectors, a financial data platform covering IDX and SGX listed companies, ranging from lightweight HTTP pipelines with LLM-based preprocessing to full async browser automation with Playwright against heavily bot-protected web applications.
Experience
Work history, roles, and key accomplishments
Data Scientist
Supertype AI
Jan 2026 - Present (4 months)
Built and maintained end-to-end data scraping pipelines for IDX and SGX, including financial news and filings, using checkpoint-resume patterns and scheduled GitHub Actions jobs. Implemented an LLM-based news preprocessing pipeline and improved sectors API v2 efficiency by 10%, while developing WhatsApp workflow automation and Playwright scraping with human-behavior simulation.
AI Engineer Intern
Zettabyte Pte Ltd
Dec 2024 - May 2025 (5 months)
Improved chatbot data analysis workflows using LangChain agents, improving data retrieved accuracy by 10%. Built a RAG system with semantic document upload/retrieval (context quality +15%) and implemented AstraDB-backed job recommendation matching plus Flask cron automation that reduced manual effort by 20%.
Business Analyst Intern
DXYARY
Oct 2024 - Dec 2024 (2 months)
Conducted data analysis and implemented forecasting models, contributing to a 10% improvement in decision-making management. Built Streamlit-based data management integrated with MySQL (+15% operational efficiency), and integrated Gemini Vertex AI so users could query database insights via natural language prompts.
Data Science Analyst Intern
Imperial Healthtech
Jul 2024 - Sep 2024 (2 months)
Developed Streamlit dashboards integrating NoSQL data, improving user decision-making speed by 20%. Improved data integrity by 15%, built an internal hospital analytics dashboard that increased operational efficiency by 15%, and executed data transformations that reduced reporting errors by 10%.
Education
Degrees, certifications, and relevant coursework
Universitas Dian Nuswantoro
Bachelor of Computer Science, Information Systems
2021 - 2025
Grade: 3.84/4.0
Activities and societies: Relevant coursework: Data Science, Data Analytics, Machine Learning, Deep Learning. Finalist (Top 10): FindIT UGM 2024 Finalist (Top 10): Isfest UMN 2024 Finalist (Top 10): MCF ITB 2024. FindIT UGM 2025 (Top 7 Kaggle leaderboard) Global AI Hackathon '25 (Elucidata, Top 26 Kaggle leaderboard).
3.84/4.0.
Bangkit Academy
Machine Learning Cohort Study, Machine Learning
2024 - 2024
Grade: Top 10% distinction
Activities and societies: Bangkit Tribe member
Machine Learning cohort study via Bangkit Academy, graduating with a top 10% distinction and completing 12 courses covering data analytics, machine learning, and deep learning.
Availability
Location
Authorized to work in
Social media
Job categories
Interested in hiring Fawwaz?
You can contact Fawwaz and 90k+ other talented remote workers on Himalayas.
Message FawwazFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
