At LegitScript, we are passionate about making the internet and payment ecosystems safer and more transparent. We help companies of all sizes keep their services legal and safe for consumers. To do this, LegitScript combines big data with the world’s leading team of experts skilled in highly regulated and complex sectors, including transaction laundering detection, pharmaceuticals, online gambling, and more.
The result? Unmatched accuracy and deep risk analysis that identifies which commercial entities play by the rules, and which do not. Our diverse industry partnerships provide unique insights that keep businesses and governments at the forefront of emerging trends. That’s why LegitScript is trusted by the world's largest search engines, internet platforms, payment companies, and regulatory agencies.
Overview:
We're an innovative technology incubator seeking an experienced and forward-thinking Sr Data Engineer specializing in Generative AI to join our team. In this role, you'll spearhead the development and implementation of cutting-edge AI solutions, with a primary focus on creating a sophisticated risk detection algorithm using large language models, Generative AI techniques, and traditional machine learning methods within our SaaS environment.
What You'll Do:
- Design, build, and maintain scalable data pipelines to ingest data from disparate sources into our data warehouse/lake.
- Research and develop high-performance machine learning models to solve complex business problems.
- Wrap models into production-ready APIs and integrate them into our core product.
- Implement automated workflows for data validation, model training, and continuous deployment (CI/CD for ML).
- Monitor pipeline latency and model drift, ensuring that the system remains performant and accurate as data evolves.
What You'll Bring:
- 5–8+ years in a Data Engineering or Data Science role, with a proven track record of shipping models to production.
- Advanced proficiency in Structured Query Language for complex data transformation and analysis.
- Hands-on experience with cloud-based data platforms such as Databricks or Snowflake.
- Experience with ETL and ELT tools or frameworks such as Lakeflow Declarative Pipelines, Databricks Autoloader, Informatica, Talend, or dbt.
- Strong proficiency in Python, Spark/PySpark, and DABs/Terraform for data processing and pipeline development.
- Strong understanding of data modeling, database design principles, and building curated datasets for analytics and operational use cases.
- Experience with DevOps practices including IAC, CI/CD, Git-based development, branching strategies, and code reviews.
- Proven history implementing continuous integration and continuous deployment for data pipelines and managing deployments across environments.
- Familiarity with orchestration and workflow tools such as Databricks Workflows or Airflow is preferred.
- Previous experience working with containerization technologies such as Docker
- Proficiency with ML experiment tracking tools like MLFlow or Weights & Biases
- Design ML models that do the heavy lifting—prioritizing tasks and automating risk assessment to make our operations smarter.
- Ensure every prediction is explainable, turning "black box" code into actionable "reason codes" for our end users.
- Partner directly with the teams using your tools to refine features and improve model relevance based on their feedback.
- Own the success of your models by measuring their real-world efficacy, focusing on business ROI.
In addition to competitive salaries, full-time employees enjoy a great benefits package:
- Multiple Medical, Dental & Vision plans
- 401k with company match and immediate vesting
- Generous paid time off package and 11 paid holidays
- And much more!
If you got to this point, we hope you're feeling excited about the job description you just read. Even if you don't feel that you meet every single requirement, we still encourage you to apply. We're eager to meet people that believe in LegitScript’s mission and can contribute to our team in a variety of ways.
This job description is not designed to cover or contain a comprehensive listing of all activities, duties or responsibilities that are required of the employee. Duties, responsibilities and activities may change or new ones may be assigned at any time with or without notice.
Please note that visa sponsorship is not available for this position. We cannot support international remote work.
**We do not accept unsolicited applications from third-party recruiters or agencies for this job posting. Any candidate submission without a prior agreement will be considered the property of our company, and we will not be responsible for any fees or obligations related to such submissions. We encourage interested candidates to apply directly through our official channels.**
