Truelogic is a leading provider of nearshore staff augmentation services, located in New York. Our team of 500 tech talents is driving digital disruption from Latin America to the top projects in U.S. companies. Truelogic has been helping companies of all sizes to achieve their digital transformation goals.
Would you like to make innovation happen? Have you ever dreamed of building Products that impact millions of users? Nice! Then we have a seat for you on our team!
Our Client is a well known US cloud-based provider network optimization management and analytics solution company.
What are you going to do?
We are seeking an experienced Web Scraping Manager to lead our web data extraction initiatives. In this role, you will be responsible for managing a team of web scrapers, ensuring efficient and accurate data extraction from various sources across the internet.
- Track hours and manage time cards for each scraper
- Ensure team members meet productivity goals
- Prioritize and assign scraping tasks based on product requirements
- Ensure scrapes with errors are addressed promptly and effectively
What will help you succeed
- Handle escalations and provide technical guidance to/from the scraping team
- Manage the code review queue, ensuring code submissions are correct, efficient, and adhere to best practices
- Oversee the movement of scraped outputs from staging to production databases
- Utilize CSS and XPath selectors to navigate and extract data from complex HTML structures
- Ensure data quality by implementing validation and cleansing processes
- Mentor and guide new team members in web scraping techniques and coding practices
- Minimum of 5 years of experience in web scraping and data extraction using Python
- Proficient in CSS and XPath selectors for data extraction from HTML/XML documents
- Experience with ScraPy, Git version control and pull request (PR) workflows