ZeroUtil

Extract Audio from Video

Pull the audio track out of MP4, WebM, MOV and MKV. Output as MP3, WAV, AAC or Opus. Files auto-deleted after 15 minutes.

Maintained by

How to Extract Audio from Video

  1. Drop your video file onto the upload area. MP4, WebM, MOV and MKV up to 500 MB are accepted.
  2. Pick the output format: MP3 (universal, 192 kbps), WAV (uncompressed PCM), AAC (192 kbps), or Opus (128 kbps).
  3. Press Extract audio. The video stream is dropped and the audio is re-encoded into your chosen format.
  4. Download the audio file. Both the original video and the extracted audio are auto-deleted from our servers within 15 minutes.

Which output format to pick

  • MP3 at 192 kbps is the default. Plays on every device, every player, every messenger. Pick it unless you have a specific reason not to.
  • WAV is uncompressed PCM, sample-perfect copy of the audio waveform. Files are 10-15x larger than MP3 but you lose nothing. Use for editing in a DAW or for archival.
  • AAC is what YouTube, iTunes and modern streaming use. Slightly better quality than MP3 at the same bitrate but less universal.
  • Opus is the best lossy codec available - higher quality than MP3 at half the bitrate. Great for podcasts and voice if your players support it (most modern browsers and Android do; Apple devices struggle).

Common use cases

  • Extracting the audio track from a podcast YouTube video for offline listening.
  • Pulling the music out of a wedding/birthday video to identify a song.
  • Salvaging an interview where only the video file survived.
  • Creating a ringtone source from a video clip (combine with a trimmer).

Frequently Asked Questions

Does extracting audio reduce quality?

Lossy formats (MP3, AAC, Opus) re-encode the audio, which means some detail is discarded. For most material the difference is inaudible at 192 kbps. For perfect quality choose WAV - it copies the PCM waveform exactly without re-encoding loss. Note that the source audio in a video is usually already compressed (AAC inside MP4), so the absolute ceiling is set by the source.

Why is the MP3 quality lower than the original?

YouTube and most video sources store audio as AAC at roughly 128-192 kbps. Re-encoding from AAC to MP3 introduces a second lossy compression pass on top, called transcoding loss. To minimize the loss use the same bitrate or higher (192 kbps MP3 from 128 kbps AAC is fine), or pick WAV to skip the second compression entirely.

Can I extract audio without re-encoding?

In theory yes (FFmpeg with `-c:a copy` writes the audio stream as-is), but the output container has to match the codec. AAC inside MP4 can be copied as M4A; MP3 inside MKV can be copied as MP3. We default to re-encoding because it produces a predictable, universally playable file. If you need a stream copy ask for it as a feature request.

What is the difference between MP3 and AAC?

AAC is the newer, more efficient codec. At the same bitrate AAC sounds noticeably cleaner than MP3, especially on cymbals, applause and other high-frequency material. The downside is compatibility - very old devices and some legacy players only handle MP3. For 2026 use cases AAC is usually the better technical choice; MP3 wins only on universal compatibility.

How big will the audio file be?

At 192 kbps a 60-minute MP3 is around 80-90 MB; at 320 kbps it is 130-140 MB. WAV at 44.1 kHz stereo is ~10 MB per minute, so a one-hour file is ~600 MB. Opus at 128 kbps is the smallest of the lossy options at ~55 MB per hour.

Why is the output file larger than expected?

Most likely because the source video had higher-bitrate audio than the output target (e.g. a music video with 320 kbps source extracted to 192 kbps MP3 - the MP3 is smaller). Or you picked WAV which stores raw uncompressed samples. To make a smaller file pick MP3 or Opus at a lower bitrate.

Are my files private?

Files travel over HTTPS to api.zeroutil.com (EU server). FFmpeg processes them locally on the server, returns a signed download URL, and both the input video and the extracted audio are auto-deleted after 15 minutes. We do not log content, do not transcribe audio, and do not retain anything beyond the deletion window.

More Video & Audio