Software Engineer, Text-To-Speech (C++) at SoundHound Inc.

SoundHound Inc. builds voice AI products that make it easier for people to engage with the world around them. We do this by creating custom voice assistants that make it possible for companies to extend their brand in new and meaningful ways. Today, our customized voice AI solutions allow people to talk to phones, cars, smart speakers, mobile apps, coffee machines, and every other part of the emerging ‘voice-first’ world.

We are looking for a software engineer to join our core speech technology team. We are looking for someone who thrives in an algorithmically intense and hands-on environment. You will collaborate with world-class deep learning researchers to research and implement production-ready TTS solutions. Team projects include sophisticated deep learning models, front-end Text-to-Phoneme, and Embedded TTS.

Our team is distributed across our offices in Europe and North America. We enjoy flexible working hours and a high level of ownership across our projects.

In this role you will:

Productionize deep learning models for our Houndify Voice AI Platform
Collaborate with seasoned machine learning engineers/scientists to develop a faster and more accurate Text-to-Speech
Spend much of your time coding, testing, and scripting in a Linux environment

We would love to hear from you if:

You graduated with a Bachelor’s degree in Computer Science
You are well-versed in writing production-level C++
You are adept in algorithms and data structures
You do your best work when you are in a creative and diverse environment

We’d be especially excited if you have experience with TTS engines. Be sure to let us know how comfortable you are using deep learning frameworks like TensorFlow and PyTorch. Familiarity with real-time audio and signal processing is also a plus!