Tayiba R
@tayibar
Data Scientist with 4+ years building ML, NLP, and Generative AI solutions for fraud, risk, and RAG pipelines.
What I'm looking for
I’m a Data Scientist focused on delivering end-to-end machine learning, NLP, and Generative AI systems that solve real business problems. I’ve worked across research, financial services, and enterprise environments, turning messy data into reliable models and production-ready pipelines.
At SCORE Lab, I investigate predictive modeling and classification pipelines using LangGraph, LangChain, and Large Language Models, and I’ve used Python-based statistical workflows (including hypothesis testing, correlation analysis, and regression modeling) to process 50K+ records. I also built Retrieval-Augmented Generation pipelines with LangChain and AWS Lambda to support document intelligence research and reduce manual extraction overhead.
Previously at Northern Trust, I designed credit risk analytics and fraud detection systems using LightGBM, PySpark, and feature engineering, as well as real-time AML pipelines with Apache Kafka, Structured Streaming, and Delta Lake. I deployed customer segmentation models on AWS SageMaker, automated portfolio analytics with Azure Data Factory and dbt, and engineered RAG architectures with Pinecone Vector Database and semantic search to cut analyst research time.
Earlier, as an Associate Data Scientist at Symtrax, I built ETL pipelines with Python, Apache Airflow, and AWS Glue, developed KPI dashboards with Pandas and SQL, and delivered interpretable churn forecasting models using scikit-learn with SHAP. I’m currently pursuing a Ph.D. in Information Science (Data Science), and I bring a research-minded, experiment-driven approach to every model I ship.
Experience
Work history, roles, and key accomplishments
Data Scientist
SCORE Lab
Aug 2024 - Present (1 year 11 months)
Worked on predictive modeling and classification pipelines using LangChain/LangGraph and large language models on research datasets. Conducted statistical analysis and implemented RAG pipelines for document intelligence using LangChain and AWS Lambda.
Built credit risk analytics and fraud detection pipelines using machine learning and streaming technologies for financial datasets. Deployed customer segmentation and document-focused RAG architectures to reduce analyst and monitoring time.
Associate Data Scientist
Symtrax
Sep 2020 - Aug 2022 (1 year 11 months)
Built ETL pipelines and analytical reporting workflows using Python and cloud/data tools, including dashboards for business stakeholders. Developed predictive models for churn and applied NLP techniques with containerized classification scripts.
Education
Degrees, certifications, and relevant coursework
University of North Texas
Ph.D. in Information Science (Data Science), Information Science (Data Science)
2025 -
Ph.D. program in Information Science (Data Science) at the University of North Texas starting in 2025. Currently enrolled.
University of North Texas
Master of Science in Data Science, Data Science
2022 - 2024
Master of Science in Data Science at the University of North Texas from 2022 to 2024.
Malnad College of Engineering
Bachelor of Engineering in Electronics & Instrumentation Engineering, Electronics & Instrumentation Engineering
2016 - 2020
Bachelor of Engineering in Electronics & Instrumentation Engineering at Malnad College of Engineering from 2016 to 2020.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Tayiba?
You can contact Tayiba and 90k+ other talented remote workers on Himalayas.
Message TayibaGet matched with your dream remote job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
