As a Senior Software Engineer specializing in Audio and Digital Signal Processing for Voice AI, you will be instrumental in establishing the foundation for real-time audio processing and advanced conversational capabilities, particularly tailored for the restaurant industry.
You will design and evaluate algorithms to enhance speech and audio performance in our products. Working closely with product and engineering teams, you will develop and assess new solutions, devising innovative ways to integrate them. Additionally, you will prototype and research various software/hardware solutions involving audio, speech, speakers, and sensors for phone and drive-thru interfaces.
This role offers the opportunity to create new products from the ground up, significantly influence product direction and the engineering roadmap, participate in building our voice team culture, and ultimately shape the future of restaurant technology. Your efforts will enhance customer interactions, ensuring a seamless and enjoyable experience.
● Algorithm Development: Create, design, and troubleshoot real-time audio algorithms.
- Collaborative Engineering: Work within a multidisciplinary team to make essential architectural decisions for efficient software delivery.
- Software Creation: Develop software for various consumer devices that utilize real-time audio inputs and outputs.
- Code Quality: Write clean, readable code, efficiently debug complex issues, and prioritize tasks to maintain high productivity.
- Continuous Learning: Engage in ongoing learning, explore new technologies, and stay updated on the latest advancements in audio and digital signal processing.
- AI/ML Integration: Utilize AI/ML techniques for speech enhancement; experience in this area is a plus.
- Prototyping: Rapidly prototype new software and hardware solutions to contribute to product ideation and development.
Requirements
- Higher education (e.g., a Master's or PhD) or multiple years of industry experience in computer science or engineering.
- Experience in evaluating speech quality/recognition and building scalable systems for real-time audio input/output across multiple concurrent channels.
- Good understanding of voice activity detection (VAD), beamforming, source separation, and speaker diarization/separation.
- Interest in prototyping hardware solutions ranging from hobby to professional levels, and a strong sense of future-proofing when it comes to hardware decisions.
- Good problem-solving skills and ability to work independently and in a team.
- Understanding of conversational English and experience with voice programs/products.
- Strong communication skills for explaining technical ideas to various audiences.
- Ability to manage product specifications from concept to production.
- Extensive experience ordering food at fast food and quick service restaurants