Senior Machine Learning Engineer - Speech
Who We Are
Kensho is a 100-person AI and machine learning company, centered around providing cutting-edge solutions to meet the challenges of some of the largest and most successful businesses and institutions. Our toolkit illuminates insights by helping the world better understand, process, and leverage messy data. Specifically, our solutions largely involve natural language processing (NLP) and include speech recognition (ASR), entity linking (Named Entity Disambiguation), structured document extraction, automated database linking, text classification, and more. We are continuously expanding our portfolio and are looking for passionate researchers to help us build and deploy state-of-the-art models across a variety of domains!
At Kensho, we believe in flexibility-first, and give our employees the opportunity to work from where they feel most productive and engaged (must be in the United States). We also value in-person collaboration, so there may be times when travel to one of our Kensho hubs (NY/DC/MA) will be required for team meetings or company events.
About The Role
Kensho is looking for a Senior ML Engineer to join the Kensho Scribe team. The Scribe team builds world-class automated speech recognition (ASR) solutions for business audio. The ideal candidate has deep industry or research experience in the ASR domain.
What You’ll Do
- Work with hundreds of thousands of hours of audio and transcription data
- Work on cutting-edge machine learning models through all phases of development, from design through training, evaluation, deployment, and monitoring
- Design new custom metrics to evaluate model performance
- Contribute toward building machine learning pipelines that automate processes of training and evaluation
- Work closely with a team of excellent machine learning and software engineers who are passionate about the product
- Contribute to Kensho’s open-source packages and datasets (see pyctcdecode and SPGISpeech)
- Collaborate with NVIDIA, HuggingFace, and other partners
- Share your work through presentations, papers, and blog posts.
Who You Are
- An expert in the automatic speech recognition (ASR) domain
- Have 3+ years of significant, hands-on experience with ML/NLP/Speech in an industry or research capacity
- Have expertise in Python and Python-based ML frameworks (e.g., PyTorch or TensorFlow)
- Experienced working with messy, real-world audio and know the nuances of spoken language
- Have strong skills in experiment design
- Able and excited to lead initiatives from the ideation stage to implementation
- Exercise strong communication skills and can effectively express even complicated methods and results to a broad, often non-technical audience
- [optionally] Have published in top-tier journals and conferences in the ASR domain or a related ML domain
Technologies We Love
- ML: PyTorch, NVIDIA NeMo, Weights & Biases
- Deployment: Airflow, Docker, AWS