hero

We invest in people who change the way the world works.

Interested in working with them?
65
companies
783
Jobs

Speech Modeling Practitioner, Voice AI Innovation

ASAPP

ASAPP

Software Engineering, Data Science
Argentina
Posted on Nov 12, 2024
About Us
At ASAPP, we're reimagining how voice and AI work together in customer experience. Our GenerativeAgent platform goes beyond traditional speech recognition, tackling the unique challenges of voice-first AI interactions. We understand that speech is fundamentally different from text - it's not just about transcription or latency, but about creating truly natural, fluid conversations between humans and AI.
Your Impact
As a Speech Modeling Intern, you'll help pioneer new approaches to voice-based AI interactions. You'll work at the intersection of speech science and large language models, helping to solve the unique challenges that arise when building conversational AI systems that truly understand and respond to human speech.

What You'll Do

  • Research and develop novel approaches to voice-first AI interactions
  • Explore the unique characteristics of speech that differentiate it from text-based interactions
  • Help design and implement speech processing systems that work seamlessly with (speech-) large language models
  • Use unique data to develop validate capabilities
  • Contribute to building more natural and effective voice interfaces
  • Participate in research discussions and experiments around the future of voice AI
  • Learn from experienced researchers and potentially contribute to research publications

What You Bring

  • Currently pursuing or recently completed a graduate degree (MS/PhD) in Computer Science, Electrical Engineering, or related field
  • Understanding of speech processing fundamentals
  • Experience with machine learning frameworks such as PyTorch or TensorFlow
  • Programming skills in Python
  • Curiosity about what makes voice interactions unique and challenging

What Will Help You Succeed

  • Background in conversation analysis or dialogue systems
  • Experience with speech recognition, text-to-speech, or other speech-related work
  • Familiarity with large language models (LLMs) and their application to speech tasks
  • Interest in human-AI interaction
  • Ability to think creatively about unsolved problems
  • Strong communication skills and enthusiasm for learning

What We Offer

  • Opportunity to work on fundamental challenges in voice AI
  • Mentorship from leading researchers in speech technology and AI
  • Chance to shape the future of voice-based customer experience
  • Competitive internship compensation
  • Access to cutting-edge computing resources
  • Flexible work arrangements
  • Learning and development opportunities
  • Collaborative, inclusive work environment

Duration

  • 3-6 months, with potential for extension based on project needs and performance
ASAPP welcomes and encourages applications from people of all backgrounds. We're committed to building a diverse team where everyone can do their best work. If you need any accommodations during the application process, please let us know at careers@asapp.com. #LI-VR1 #LI-Remote