Find a career with Emergence Capital Partners companies

Explore career opportunities across the Emergence Capital portfolio.
companies
Jobs

Audio AI Engineer

Zoom

Zoom

Software Engineering, Data Science
San Jose, CA, USA
USD 127,700-255,400 / year + Equity
Posted on Nov 1, 2025

Audio AI Engineer

Engineering (EN)

Apply Now

Thank you

Audio AI Engineer

  • San Jose, California, United States
  • Full time

Audio AI Engineer

What you can expect

As an Audio AI Engineer, you will research and develop algorithms for accent conversion, voice conversion, speech synthesis, and speech recognition on low-latency streaming architectures. You’ll prototype and refine end-to-end audio models that enhance intelligibility and naturalness while maintaining speaker identity. Working closely with product and platform teams, you’ll help bring these models into real-time communication systems. You will also evaluate and optimize model performance across dimensions such as quality, latency, and scalability. Staying current with advances in speech processing, you’ll contribute to innovation through patents and internal knowledge sharing.


About the Team

Zoom's Audio team develops real-time audio features based on AI algorithms. Members of the team are spread worldwide, including the U.S., China and Singapore.

Responsibilities

  • Researching, designing, and developing algorithms for accent conversion, voice conversion, speech synthesis, and automatic speech recognition, focusing on low-latency streaming architectures

  • Prototyping end-to-end audio models that enhance intelligibility and naturalness while preserving speaker identity and expressiveness.

  • Collaborating closely with product and platform teams to integrate models into real-time video and audio communication systems.

  • Analyzing and optimizing model performance across speech quality, latency, robustness, and scalability dimensions.

  • Staying current with the latest developments in speech processing research, and contribute to the community through patents, and internal knowledge sharing.

What we’re looking for

  • Hold a PhD or equivalent experience in a relevant field in Streaming, Voice Conversion, TTS, or ASR.

  • Show proficiency in deep learning frameworks like PyTorch or TensorFlow.

  • Demonstrate effective programming skills in Python, C/C++, or similar languages.

  • Have an understanding of sequence modeling architectures (Transformers, RNNs, diffusion models, or conformers).

  • Demonstrate experience developing and deploying low-latency, real-time speech or audio models with streaming architectures and optimized pipelines.

  • Show familiarity with model compression and acceleration techniques, including quantization, pruning, and distillation.

  • Exhibit experience working with real-time audio systems in networked communication environments.

  • Publish in top-tier conferences such as ICASSP, INTERSPEECH, NeurIPS, and ICLR.

  • Must be fluent in Mandarin

Salary Range or On Target Earnings:

Minimum:

$127,700.00

Maximum:

$255,400.00

In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.

Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.

We also have a location based compensation structure; there may be a different range for candidates in this and other locations

At Zoom, we offer a window of at least 5 days for you to apply because we believe in giving you every opportunity. Below is the potential closing date, just in case you want to mark it on your calendar. We look forward to receiving your application!

Anticipated Position Close Date:

11/06/25

Ways of Working
Our structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.

Benefits
As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click Learn for more information.

About Us
Zoomies help people stay connected so they can get more done together. We set out to build the best collaboration platform for the enterprise, and today help people communicate better with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars.
We’re problem-solvers, working at a fast pace to design solutions with our customers and users in mind. Find room to grow with opportunities to stretch your skills and advance your career in a collaborative, growth-focused environment.


Our Commitment​

At Zoom, we believe great work happens when people feel supported and empowered. We’re committed to fair hiring practices that ensure every candidate is evaluated based on skills, experience, and potential. If you require an accommodation during the hiring process, let us know—we’re here to support you at every step.


If you need assistance navigating the interview process due to a medical disability, please submit an Accommodations Request Form and someone from our team will reach out soon. This form is solely for applicants who require an accommodation due to a qualifying medical disability. Non-accommodation-related requests, such as application follow-ups or technical issues, will not be addressed.

Video Player is loading.
Current Time 0:00
Duration 1:13
Loaded: 1.13%
Stream Type LIVE
Remaining Time 1:13
1x
  • Chapters
  • descriptions off, selected
  • captions settings, opens captions settings dialog
  • captions off, selected

Recommended Jobs

Audio AI Engineer

Engineering (EN)
Audio AI Engineer What you can expect As an Audio AI Engineer, you will research and develop algorithms for accent conversion, voice conversion, speech synthesis, and speech recognition on low-latency streaming architectures. You’ll prototype an...

PWA Engineer (Denver, CO)

Engineering (EN)
PWA Engineer What you can expect As a progressive web app (PWA) engineer, your primary responsibility will be to scale distributed web applications, make architectural trade-offs and apply synchronous and asynchronous design patterns. Furthermor...

Security Engineer

Information Security (SC)
What you can expect As an Application Security Engineer, you will work with security testing automation and tools, focusing on Static Application Security Testing/Dynamic Application Security Testing implementation. You'll develop security program...

Fraudulent Employment Offers

Zoom is aware of scams that involve fake Zoom job listings posted on third-party sites. Responding applicants are contacted primarily over email, InMail and/or chat applications by people impersonating Zoom employees. Eventually a fake offer letter is sent in exchange for personal identification information as part of a fake new-hire screening process.

Please be advised that these offers, communications and impersonations are illegitimate and fraudulent. All communication with Zoom employees come from an “@zoom.us” email address. Zoom job applicants complete an interview process including in-person (on Zoom) meetings and phone calls. Our process also requires you to create an account with our applicant tracking system, Workday. If you have already completed an application, you can access it here.

Zoom will never ask for your personally identifying information during the interview process or ask you to pay money or purchase equipment. If you have received a message from Zoom that appears suspicious, please contact careers@zoom.us.

Sign up for job alerts

Find roles that are just the right fit for you, delivered straight to your inbox. The next opportunity you see could become your new career.

Thank you for signing up for job alerts from Zoom!

Please choose the category/categories