Andrey ChauzovAC
Looking for a job

Andrey Chauzov

@andrewchauzov

Data Scientist and Machine Learning Engineer | NLP

France
Message

What I'm looking for

I'm seeking a role at an early-stage data-driven startup that combines Data Scientist and ML Engineer responsibilities, allowing me to work on data analysis and machine learning model development.

- I have over 10 years of experience in Data Science & Analytics, with 7 years dedicated to Python & Machine Learning and 5 years to Deep Learning & Engineering.
- My recent focus has been on NLP, where I have honed my skills in BERT, Transformers, Transfer Learning, and LLM APIs.
- I have a strong foundation in machine learning methodologies and statistics.
- My expertise includes SQL, managing large datasets, API development, and deploying solutions on GCP/AWS.
- I am proficient in communicating complex data insights and enjoy mentoring others.

Skills & Expertise:
- Proficient in machine learning, specializing in predictive modeling, Scikit-Learn, gradient boosting, model selection, validation, permutation importance, SHAP values, and optimization using Hyperopt and Optuna.
- Experienced in data science tasks, including regression, classification, clustering, time series analysis, dimensionality reduction (PCA, UMAP), and anomaly detection.
- Skilled in developing and deploying models, optimizing for scalability, and proficient in tools like Docker, FastAPI, Flask, and MLflow.
- Familiar with statistics, with expertise in SciPy, Bayesian statistics, hypothesis testing, probability theory, and statistical modeling.
- Experienced in utilizing Python libraries such as NumPy and Pandas for data manipulation and mining and Selenium for web scraping.
- Knowledgeable in natural language processing (NLP) techniques, including BERT, Transformers, Transfer Learning, and LLM APIs.
- Expertise in deep learning frameworks, including TensorFlow, PyTorch, and Keras, for developing neural networks.
- Comfortable with cloud computing platforms like GCP and AWS and proficiency in Linux (Ubuntu, Debian).
- Knowledgeable in software development practices and familiar with agile methodologies such as Scrum.
Proficient in using Git for version control and Jupyter Notebook for interactive data analysis.
- Skilled in data visualization using tools like Plotly, Seaborn, and Matplotlib.
- Familiar with BigQuery, PL/SQL, and Redis databases.
- Competent in stakeholder interaction and reporting.
- Experienced in network analysis and graph theory.

Experience

Toptal logoTO
Current

Freelance Artificial Intelligence Engineer

Mar 2020 - Present (4 years 2 months)

- Formulated a BERT-based approach for Instagram profile categorization, achieving over 80% accuracy across 50+ groups enhancing ad campaigns.
- Engineered over 15 predictive models for 100+ million IDs via GCP using PL/SQL and regex, ensuring scalability and efficient logging.

UP

Data Scientist | Python | Machine Learning | Deep Learning | Programming

Jan 2017 - Mar 2020 (3 years 2 months)

- Applied NumPy to custom sale volumes clustering algorithm, analyzing over 10k time series, resulting in up to a 15% increase in sales effectiveness.
- Trained a CatBoost model for employee churn prediction task with 85% accuracy, providing HR with actionable insights using the LIME library.

Cred Investments logoCI

Data Scientist | LightGBM | API Development | Data Visualization | Regression

Nov 2020 - Oct 2023 (2 years 11 months)

- Combined classification analysis and tree embeddings for in-game location data, identifying player roles with ~90% accuracy and improving clustering.
- Assembled a 'team profile' algorithm using gradient boosting and statistics, achieving 85% accuracy in pinpointing team weaknesses within a league.

BBDO Group logoBG

Junior Data Analyst | SQL | Cluster Analysis | Regression Analysis

Jul 2011 - Feb 2012 (7 months)

- Architected and implemented aggregation logic using SQL and VBA, incorporating correlation analysis, resulting in a 2.5x data expansion.
- Initiated cluster analysis to estimate pre-campaign efficiency, leading to more effective budget allocation across over 10 campaigns.
- Revised ROI prediction regression model using Excel/VBA, achieving a 25% increase in campaign efficiency.

Association 'Non-Profit Market Council' logoAC

(Senior) Data Analyst | Time Series Analysis | Regression Analysis | Mentorship

Feb 2012 - Dec 2016 (4 years 10 months)

- Conducted ARIMA-based time series analysis for anomaly detection, leading to a 50% increase in prediction accuracy and improved data quality.
- Enhanced power price/volume forecasting models using feature engineering, achieving up to a 2.5x reduction in error rates for targeted regions.

Find your dream job

Sign up now and join thousands of other remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan