Pedro Soares
@pedrosoares
Data Scientist focused on AI models and data mining.
What I'm looking for
As a dedicated Data Scientist, I specialize in the development of AI models and data mining, with a robust background in managing the entire lifecycle of data science projects. My experience includes creating over 50 large-scale crawlers for the healthcare industry, collaborating with international teams to deliver impactful solutions. I am proficient in utilizing advanced technologies such as Apache Airflow and GCP to run and monitor scrapers efficiently.
Throughout my career, I have designed custom data pipelines and scrapers for Brazil's largest hospital chains, leveraging tools like AWS, Apache Kafka, and various Python frameworks. My commitment to innovation is evident in my independent development and deployment of LLMs and chatbots, which have significantly automated client systems. I hold a Bachelor's degree in Statistics from the Federal University of Minas Gerais, where I honed my analytical skills and technical expertise.
Experience
Work history, roles, and key accomplishments
Data Engineer
Engineer Access
Sep 2024 - Present (9 months)
-Developed and applied NLP models for advanced data filtering, grouping, and categorization in large-scale unstructured data
-Implemented Apache Airflow, Docker and Elasticsearch + Kibana to orchestrate and monitor 300+ web scrapers
-Built and optimized 50+ web scrapers tailored to the Healthcare industry, using several tools such as Scrapy, Selenium, Selectolax, BS4, Playwright and Golang Colly
System Development Analyst Junior
Arkmeds
Dec 2023 - Sep 2024 (9 months)
Designed custom data pipelines and scrapers using AWS, Apache Kafka, and SQL for Brazil’s largest hospital chains. Developed and deployed LLMs and machine learning models to integrate and automate client systems.
Software Developer Intern
Pampulha Valley
May 2023 - Dec 2023 (7 months)
Assisted in the development of data processing pipelines and log analysis systems with Elasticsearch and Grafana. Maintained and created PostgreSQL and Redis databases, and developed scripts for data mining and scraping with Python.
Education
Degrees, certifications, and relevant coursework
Federal University of Minas Gerais
Bachelor in Statistics, Statistics
2024 -
Bachelor in Statistics with a focus on data processing and analysis. Involved in the development of log analysis systems and data pipelines.
Centro Federal de Educação Tecnológica de Minas Gerais
IT Technical Course, Information Technology
2020 - 2022
IT Technical Course focusing on data mining and scraping techniques using Python and Shell Script.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Website
pedrosrs.devSocial media
Job categories
Interested in hiring Pedro?
You can contact Pedro and 90k+ other talented remote workers on Himalayas.
Message PedroFind your dream job
Sign up now and join over 85,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
