Audio

Audio Formats for Podcasting: Recording to Distribution

Published Mar 19, 2026 8 min read By ChangeThisFile Team

Quick Answer

Record podcasts in WAV or FLAC at 48 kHz / 24-bit. Edit in the same format. Export for distribution as MP3 at 128 kbps mono (speech-only) or 192 kbps stereo (with music). Normalize loudness to -16 LUFS for Apple Podcasts and -14 LUFS for Spotify. MP3 is the universal distribution format — every podcast app supports it without exception.

Podcast audio passes through three distinct phases, and each phase has different format requirements. Recording demands the highest quality capture possible. Editing requires a lossless working format that preserves full fidelity through cuts, crossfades, and processing. Distribution demands small files in a universally compatible format that sounds good on phone speakers and earbuds.

Getting this wrong in either direction costs you. Recording in MP3 means your raw footage already has compression artifacts — every edit and re-export degrades further. Distributing in WAV means 60-minute episodes at 500+ MB, which hosting platforms won't accept and listeners won't download.

This guide covers the complete format pipeline: what to record in, what to edit in, and what to distribute in — with exact specifications for Apple Podcasts, Spotify, and other platforms.

Recording: WAV at 48 kHz / 24-bit

Record your raw audio in WAV at 48 kHz sample rate, 24-bit depth. Here's why each parameter matters:

WAV: Uncompressed, no encoding artifacts, universal DAW support. Your raw recording should never be lossy — you can't un-compress audio later. FLAC is an acceptable alternative (same quality, smaller files), but WAV is more universally supported by audio interfaces and portable recorders.
48 kHz: The broadcast/video standard. If your podcast ever includes video content (YouTube, clips), having audio at 48 kHz avoids sample rate conversion. 44.1 kHz is fine too — the difference is inaudible for speech — but 48 kHz is the more future-proof choice.
24-bit: Gives 144 dB of dynamic range versus 96 dB for 16-bit. This headroom means you can set conservative recording levels (-18 to -12 dBFS) without worrying about noise floor. If a guest suddenly shouts, 24-bit captures it cleanly; 16-bit at the same recording level might clip or produce quantization noise on quiet passages.

File size at these settings: ~16.6 MB per minute, stereo. A 60-minute recording session: ~1 GB. That's a lot, but storage is cheap and your raw recording is irreplaceable.

Do not record in MP3. Even at 320 kbps, MP3 introduces artifacts that compound when you edit, process, and re-export. One lossy encode (at the final export step) is acceptable. Two is not.

Editing: Stay Lossless Throughout

Your DAW or audio editor should work with the same format you recorded in — WAV at 48 kHz / 24-bit. Every cut, crossfade, EQ adjustment, compression, and noise reduction happens in this format. When you export for review or share works-in-progress with co-hosts, use WAV or FLAC.

Key editing considerations:

Processing in 32-bit float: Most DAWs (Audacity, Reaper, Logic, Pro Tools) process audio internally in 32-bit float regardless of the source format. This gives unlimited headroom for effects processing. The 24-bit precision is maintained; float just prevents clipping during intermediate stages.
Mono vs stereo for speech: If you're recording with a single microphone, your source is mono. Keep it mono throughout editing. A mono podcast exported as stereo doubles the file size with zero benefit — both channels contain identical audio. Multi-mic setups (two hosts, interviews) can be stereo, but for a single-voice show, mono is correct and halves your file size.
Don't export and re-import lossy: If you need to send a work-in-progress for review, use FLAC (smaller than WAV, still lossless). Exporting as MP3 and re-importing for further editing introduces a generation of loss.

Distribution: MP3 Is the Standard

For podcast distribution via RSS feeds, MP3 is the universal standard. Not because it's the best codec — it isn't — but because it works in every podcast app, on every device, without exception. Apple Podcasts, Spotify, Google Podcasts, Overcast, Pocket Casts, generic RSS readers — all handle MP3 reliably.

Recommended export settings:

Content Type	Format	Bitrate	Channels	Size per Hour
Speech only (solo/interview)	MP3	128 kbps CBR	Mono	~57 MB
Speech with music segments	MP3	192 kbps CBR	Stereo	~86 MB
Music-heavy podcast	MP3	192-256 kbps CBR	Stereo	~86-115 MB

Why CBR over VBR for podcasts: some podcast apps and hosting platforms have issues with VBR MP3s — specifically, duration calculation and seeking can be inaccurate because the file header doesn't contain exact timing information. CBR avoids these edge cases. The quality difference between CBR and VBR at 128+ kbps for speech is negligible.

Why Not AAC for Distribution?

AAC at 96 kbps sounds as good as MP3 at 128 kbps for speech — smaller files at the same quality. Apple Podcasts and Spotify both support AAC. But a non-trivial number of podcast apps, older devices, and RSS feed parsers don't handle AAC correctly. The file size savings (~30%) isn't worth the compatibility risk. Use MP3 for distribution unless you're distributing exclusively through a platform that guarantees AAC support.

Loudness Normalization: -16 LUFS and Why It Matters

Loudness normalization ensures your podcast plays at a consistent volume alongside other podcasts. Without it, listeners constantly adjust their volume knob between shows — a terrible experience.

The standard metric is LUFS (Loudness Units Full Scale), an integrated measurement that accounts for how humans perceive loudness across frequencies. The targets:

Apple Podcasts: -16 LUFS (their Sound Check feature targets this)
Spotify: -14 LUFS (they normalize to this for all audio)
YouTube: -14 LUFS
General podcast standard: -16 LUFS (most hosting platforms recommend this)

If you master to -16 LUFS, Apple plays it as-is and Spotify boosts it by 2 dB (negligible). If you master to -14 LUFS, Spotify plays it as-is and Apple reduces it by 2 dB. Either target works. -16 LUFS is the safer default because platforms that normalize downward (turning it quieter) never introduce artifacts, while platforms that normalize upward might slightly increase noise floor.

Loudness measurement tools: Youlean Loudness Meter (free plugin), iZotope Insight, NUGEN VisLM. Most DAWs also have built-in loudness meters. Aim for -16 LUFS integrated loudness with peaks no higher than -1 dBTP (true peak, measured with intersample peak detection).

Platform-Specific Requirements

Platform	Accepted Formats	Recommended	Max File Size	Loudness
Apple Podcasts	MP3, AAC, FLAC, WAV	MP3 128-192 kbps	No strict limit (practical: 200 MB)	-16 LUFS (Sound Check)
Spotify	MP3, AAC, OGG, FLAC	MP3 128-192 kbps	200 MB	-14 LUFS (normalized)
YouTube Music/Podcasts	MP3, AAC	MP3 192 kbps (or AAC 128)	Varies by video upload	-14 LUFS
RSS (generic)	MP3 (safest)	MP3 128 kbps mono CBR	Depends on host	-16 LUFS

Apple technically accepts FLAC and WAV, but file sizes make this impractical for distribution. The RSS <enclosure> tag includes the file's byte size and MIME type — podcast apps use this to estimate download times. A 500 MB WAV episode will deter listeners on cellular data.

Chapter Markers and ID3 Tags

MP3 supports chapter markers through ID3v2 CHAP frames. Chapters let listeners jump to specific segments — invaluable for long-form podcasts. Apps that support chapters: Apple Podcasts, Overcast, Pocket Casts, Castro.

Chapter metadata includes: start time, end time, title, optional URL, optional image. Tools for adding chapters: Forecast (Mac, free, by Overcast developer Marco Arment), Hindenburg Journalist, chapter marker plugins for various DAWs.

Essential ID3 tags for podcast MP3s:

Title (TIT2): Episode title
Artist (TPE1): Podcast name or host name
Album (TALB): Podcast name
Year (TDRC): Release year
Genre (TCON): "Podcast"
Cover Art (APIC): Episode or show artwork (3000x3000 recommended by Apple, JPEG, under 500 KB)
Comment (COMM): Episode description / show notes

Common Podcast Audio Mistakes

Recording in MP3: Your raw recording should always be WAV or FLAC. MP3 artifacts compound with every edit and re-export. Record lossless, encode lossy once at the end.
Exporting stereo for a single-mic show: A mono podcast in stereo wastes 50% of the file size. Both channels are identical. Export mono.
No loudness normalization: Your podcast plays alongside others. Without normalization to -16 LUFS, listeners adjust volume constantly. This is the number one reason podcast audio sounds "unprofessional."
Too high a bitrate for speech: 320 kbps MP3 for a speech podcast is pure waste. Human speech occupies a narrow frequency range. 128 kbps mono captures it perfectly. The extra 192 kbps preserves ultrasonic information that doesn't exist in speech.
Forgetting metadata: Podcast apps display episode titles, artwork, and descriptions from ID3 tags. Missing tags mean blank fields in the listener's app. Always tag your files.
VBR encoding: While VBR is superior for music, some podcast apps and hosting platforms mishandle VBR MP3s (wrong duration display, broken seeking). CBR is the safer choice for podcast distribution.

Converting Audio for Podcast Use

Common podcast conversion workflows: extracting audio from video interviews, converting guest recordings from various formats, preparing clips from existing episodes.

The podcast audio pipeline is straightforward: capture at maximum quality (WAV 48/24), edit losslessly, encode once to MP3 at the end. The most impactful quality improvements aren't format choices — they're loudness normalization, proper mono export, and complete metadata. Get those right and your podcast sounds professional regardless of whether you're recording on a $50 USB mic or a $500 dynamic.

Need to prepare audio for your podcast? Convert WAV to MP3, extract audio from MP4, or convert M4A to MP3 — free at ChangeThisFile.

Key Takeaways

Record in WAV at 48 kHz / 24-bit. Never record in MP3 — artifacts compound with editing and re-encoding.
Distribute as MP3 at 128 kbps mono (speech) or 192 kbps stereo (with music). CBR, not VBR.
Normalize loudness to -16 LUFS integrated with peaks below -1 dBTP.
Export mono for single-mic shows — stereo doubles file size with zero benefit for speech.
Always include ID3 tags: title, artist, album, cover art. Podcast apps display these directly.
Chapter markers (ID3v2 CHAP frames) help listeners navigate long episodes. Use them.

Frequently Asked Questions

What is the best audio format for podcast distribution?

MP3 at 128 kbps CBR mono for speech-only podcasts, 192 kbps stereo for shows with music. MP3 is universally compatible with every podcast app and RSS reader. AAC is technically more efficient but has edge-case compatibility issues with some apps and platforms.

Should I record my podcast in WAV or MP3?

Always WAV (or FLAC). Recording in MP3 means your raw audio already has compression artifacts that can't be removed. Every subsequent edit and re-export compounds this loss. Record lossless, do all editing in lossless, and encode to MP3 once at the very end as your final distribution step.

What loudness level should my podcast be?

Target -16 LUFS integrated loudness with true peaks below -1 dBTP. This is the Apple Podcasts recommendation and the general industry standard. Spotify normalizes to -14 LUFS, so mastering to -16 means Spotify will boost your audio by 2 dB (negligible impact). Use a loudness meter plugin during mastering to hit this target.

Should I export my podcast in mono or stereo?

Mono for single-microphone shows. A mono show exported as stereo contains identical audio in both channels — doubling the file size for zero benefit. Export as mono to cut file sizes in half. Use stereo only if you have multiple speakers panned to different positions, music segments, or spatial audio elements.

What sample rate should I use for podcast recording?

48 kHz is the broadcast standard and recommended for podcasts. It matches video production standards (useful if you also publish video versions) and is the default for most audio interfaces and recorders. 44.1 kHz is also fine — the difference is inaudible for speech. Avoid 96 kHz or higher; it doubles storage requirements with no benefit for speech.

Why do podcast files need to be CBR instead of VBR?

Some podcast apps and hosting platforms miscalculate duration and have broken seeking with VBR MP3 files, because VBR files don't have a fixed relationship between byte position and playback time. CBR files have this relationship, making duration and seeking calculations trivial. The quality difference between CBR and VBR at 128 kbps for speech is negligible.

How big should a podcast episode file be?

At 128 kbps MP3 mono, a 60-minute episode is about 57 MB. At 192 kbps stereo, it's about 86 MB. Most hosting platforms recommend keeping episodes under 200 MB. If your files are significantly larger, you're likely using too high a bitrate or exporting stereo when mono would suffice.

Can I use Opus for podcast distribution?

Not yet for broad distribution. Opus at 64 kbps sounds as good as MP3 at 128 kbps for speech — half the file size. But podcast RSS feeds rely on universal format support, and many podcast apps, especially older ones and web-based players, don't support Opus. Until Opus support is as universal as MP3 in the podcast ecosystem, stick with MP3 for your RSS enclosures.

Ready to convert your files?

Use ChangeThisFile to convert between 600+ formats — free, fast, and private.

Start Converting