Pedro SoaresPS
Open to opportunities

Pedro Soares

@pedrosoares

Data Scientist focused on AI models and data mining.

Brazil

What I'm looking for

I am seeking a role that fosters innovation and collaboration, where I can leverage my data science skills to drive impactful projects and contribute to a dynamic team.

As a dedicated Data Scientist, I specialize in the development of AI models and data mining, with a robust background in managing the entire lifecycle of data science projects. My experience includes creating over 50 large-scale crawlers for the healthcare industry, collaborating with international teams to deliver impactful solutions. I am proficient in utilizing advanced technologies such as Apache Airflow and GCP to run and monitor scrapers efficiently.

Throughout my career, I have designed custom data pipelines and scrapers for Brazil's largest hospital chains, leveraging tools like AWS, Apache Kafka, and various Python frameworks. My commitment to innovation is evident in my independent development and deployment of LLMs and chatbots, which have significantly automated client systems. I hold a Bachelor's degree in Statistics from the Federal University of Minas Gerais, where I honed my analytical skills and technical expertise.

Experience

Work history, roles, and key accomplishments

EA
Current

Data Engineer

Engineer Access

Sep 2024 - Present (9 months)

-Developed and applied NLP models for advanced data filtering, grouping, and categorization in large-scale unstructured data
-Implemented Apache Airflow, Docker and Elasticsearch + Kibana to orchestrate and monitor 300+ web scrapers
-Built and optimized 50+ web scrapers tailored to the Healthcare industry, using several tools such as Scrapy, Selenium, Selectolax, BS4, Playwright and Golang Colly

AR

System Development Analyst Junior

Arkmeds

Dec 2023 - Sep 2024 (9 months)

Designed custom data pipelines and scrapers using AWS, Apache Kafka, and SQL for Brazil’s largest hospital chains. Developed and deployed LLMs and machine learning models to integrate and automate client systems.

PV

Software Developer Intern

Pampulha Valley

May 2023 - Dec 2023 (7 months)

Assisted in the development of data processing pipelines and log analysis systems with Elasticsearch and Grafana. Maintained and created PostgreSQL and Redis databases, and developed scripts for data mining and scraping with Python.

Education

Degrees, certifications, and relevant coursework

Federal University of Minas Gerais logoFG

Federal University of Minas Gerais

Bachelor in Statistics, Statistics

2024 -

Bachelor in Statistics with a focus on data processing and analysis. Involved in the development of log analysis systems and data pipelines.

Centro Federal de Educação Tecnológica de Minas Gerais logoCG

Centro Federal de Educação Tecnológica de Minas Gerais

IT Technical Course, Information Technology

2020 - 2022

IT Technical Course focusing on data mining and scraping techniques using Python and Shell Script.

Find your dream job

Sign up now and join over 85,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Pedro Soares - Data Engineer - Engineer Access | Himalayas