Machinify is a leading healthcare intelligence company that transforms raw external data into powerful, trusted datasets. As a Data Engineer, you'll design and implement production-grade pipelines using Python, Spark SQL, and Airflow, and work closely with product managers, data scientists, and engineers to build, scale, and refine production pipelines.
Requirements
- 4+ years of experience as a Data Engineer (or equivalent)
- Strong expertise in Python, Spark SQL, and Airflow
- Experience processing large-scale file-based datasets
- Experience mapping and standardizing raw external data into canonical models
- Familiarity with AWS (or any cloud), including file storage and distributed compute concepts
- Experience onboarding new customers and integrating external customer data with non-standard formats
- Ability to work across teams, manage priorities, and own complex data workflows with minimal supervision
- Strong written and verbal communication skills
Benefits
- Flexible work arrangement
- Opportunities for professional growth and development
- Collaborative and diverse work environment
- Competitive salary and benefits package
