MP3 vs WAV vs FLAC: Audio Format Comparison
Compare MP3, WAV, and FLAC audio formats. Understand the trade-offs between file size, audio quality, and compatibility.
Convert-To Editorial Team
Editorial PolicyA three-minute song stored as WAV takes up roughly 31 MB of disk space. The same song as a 128 kbps MP3 occupies about 2.9 MB — a 91% reduction. As FLAC, it lands somewhere in between at around 19 MB, yet sounds identical to the WAV when played back. These three formats represent three fundamentally different approaches to storing audio: uncompressed, lossy compressed, and lossless compressed. The format you choose determines how much storage you use, what quality your listeners hear, and which devices and platforms can actually play your files.
How Audio Compression Differs from Image Compression
Audio and image compression share the same core principle — removing data humans won't notice — but they target different senses. Image compression exploits the fact that human vision is less sensitive to color detail than brightness detail. Audio compression exploits psychoacoustic masking: when a loud sound at one frequency is playing, quieter sounds at nearby frequencies become inaudible.
The key difference is temporal. Images are static, so compression works on spatial redundancy (neighboring pixels). Audio is a continuous stream, so compression works on spectral redundancy across time windows, typically 576 or 1152 samples per frame in MP3. This means audio compression artifacts sound different from image artifacts — you hear them as metallic ringing, pre-echo before transients, or a subtle "underwater" quality rather than the blocky pixels you see in over-compressed JPGs.
| Compression Aspect | Image (JPG) | Audio (MP3) |
|---|---|---|
| Perceptual model | Human visual system (HVS) | Psychoacoustic model |
| What's discarded | High-frequency spatial detail | Masked frequencies, sub-threshold signals |
| Artifact type | Block artifacts, banding | Pre-echo, metallic ringing, stereo imaging loss |
| Quality control | Quality slider (1-100) | Bitrate (kbps) |
| Typical compression | 10:1 to 50:1 | 4:1 to 12:1 |
MP3: The Universal Compromise
MP3 (MPEG-1 Audio Layer III) was standardized in 1993 and became the format that defined digital music distribution. It uses lossy compression, meaning it permanently removes audio data during encoding. The MP3 encoder analyzes audio in short frames, applies a psychoacoustic model to determine which frequency components are inaudible, and discards them.
How MP3 Quality Works
MP3 quality is controlled by bitrate — the number of bits used per second of audio. Higher bitrate means more data is preserved and fewer artifacts are introduced.
| Bitrate | File Size (3 min) | Quality Description | Typical Use |
|---|---|---|---|
| 64 kbps | 1.4 MB | Noticeable artifacts, thin sound | Voice-only recordings, talk radio |
| 128 kbps | 2.9 MB | Acceptable for casual listening | Streaming, podcasts |
| 192 kbps | 4.3 MB | Good quality, artifacts rare | Music streaming (standard tier) |
| 256 kbps | 5.8 MB | Very good, near-transparent | High-quality streaming |
| 320 kbps | 7.2 MB | Best MP3 quality available | Archival-quality MP3, DJ use |
Variable bitrate (VBR) encoding adjusts the bitrate dynamically — using more bits for complex passages (cymbal crashes, dense orchestration) and fewer bits for simple passages (silence, single instruments). A VBR MP3 targeting quality level V2 (roughly equivalent to 190 kbps average) often sounds better than a constant 192 kbps MP3 while being a similar size.
MP3 Limitations
MP3 has a hard ceiling: it cannot store audio above 16 kHz at lower bitrates, and even at 320 kbps, the frequency response rolls off around 20 kHz. This matters less than it sounds — most adults can't hear frequencies above 16-17 kHz — but it means MP3 is fundamentally incapable of bit-perfect audio reproduction regardless of bitrate settings.
MP3 also uses joint stereo encoding at lower bitrates, which merges the left and right channels for frequencies where stereo separation is less perceptible. This can subtly alter the stereo imaging of a recording, particularly noticeable on headphones with acoustic music.
WAV: Uncompressed Fidelity at a Cost
WAV (Waveform Audio File Format) stores raw PCM (pulse-code modulation) audio data with no compression whatsoever. Every sample from every channel is stored as a numerical value, typically at 16-bit or 24-bit depth.
The Numbers
CD-quality WAV uses 44,100 samples per second (44.1 kHz), 16 bits per sample, across 2 channels (stereo). The math is straightforward:
44,100 samples/sec x 16 bits/sample x 2 channels = 1,411,200 bits/sec = 176.4 KB/sec
For a 3-minute song: 176.4 KB/sec x 180 seconds = 31.7 MB
Professional studio recordings often use 24-bit depth at 48 kHz or 96 kHz, pushing file sizes even higher:
| Sample Rate | Bit Depth | Channels | Size per Minute | 60-min Album |
|---|---|---|---|---|
| 44.1 kHz | 16-bit | Stereo | 10.6 MB | 635 MB |
| 48 kHz | 24-bit | Stereo | 17.3 MB | 1,037 MB |
| 96 kHz | 24-bit | Stereo | 34.6 MB | 2,074 MB |
| 192 kHz | 32-bit | Stereo | 92.2 MB | 5,530 MB |
WAV Strengths and Weaknesses
WAV's strength is simplicity and universality. Every audio application, operating system, and hardware player supports WAV. There's no decoding overhead, no compatibility concerns, and no quality questions — the audio is stored exactly as captured.
The weaknesses are substantial: enormous file sizes, no built-in metadata support (no reliable way to store artist, album, or track information), and no compression. WAV also has a 4 GB file size limit due to its 32-bit header, which restricts recordings to roughly 6.75 hours at CD quality. Professional environments typically use BWF (Broadcast WAV Format) or RF64 to work around this limit.
FLAC: Lossless Compression Without the Bulk
FLAC (Free Lossless Audio Codec) was released in 2001 as an open-source alternative to proprietary lossless codecs. It applies lossless compression to audio data — like ZIP for sound files — reducing file size by 30-60% compared to WAV while preserving every sample perfectly.
How FLAC Compression Works
FLAC uses linear prediction to model the audio signal. For each block of samples, the encoder predicts what each sample should be based on preceding samples, then stores only the difference (residual) between the prediction and the actual value. Since audio signals are continuous and predictable, these residuals are small numbers that compress efficiently with Rice coding.
The result: a FLAC file decoded back to WAV produces output that is bit-for-bit identical to the original WAV. This isn't "nearly identical" or "perceptually identical" — it is mathematically the same data, verifiable by comparing checksums.
| Feature | FLAC | WAV | MP3 (320 kbps) |
|---|---|---|---|
| Compression | Lossless (30-60% reduction) | None | Lossy (78% reduction) |
| Quality | Bit-perfect | Bit-perfect | Perceptual approximation |
| File size (3 min, CD quality) | ~19 MB | ~31.7 MB | ~7.2 MB |
| Metadata support | Full (Vorbis comments) | Minimal | ID3 tags |
| Streaming support | Yes | Limited | Yes |
| Hardware support | Most modern devices | Universal | Universal |
| Open source | Yes | No (Microsoft format) | No (patented, though patents expired) |
FLAC Compression Levels
FLAC offers compression levels 0-8, but this only affects encoding speed, not audio quality (since all levels are lossless). Higher levels try harder to find patterns, producing slightly smaller files at the cost of longer encoding time.
| FLAC Level | Encoding Speed | File Size (typical) | Decoding Speed |
|---|---|---|---|
| 0 (fastest) | Very fast | ~62% of WAV | Fast |
| 5 (default) | Moderate | ~58% of WAV | Fast |
| 8 (best compression) | Slow | ~56% of WAV | Fast |
The difference between level 0 and level 8 is typically only 3-6% of the WAV size. Since decoding speed is virtually identical across all levels, most users stick with the default level 5.
Bitrate, Sample Rate, and Bit Depth: What the Numbers Mean
Audio quality specifications can be confusing because three independent parameters all affect quality and file size:
Sample rate (measured in Hz or kHz) determines how many times per second the audio signal is measured. CD quality is 44.1 kHz (44,100 measurements per second). By the Nyquist theorem, a sample rate captures frequencies up to half its value — so 44.1 kHz captures up to 22.05 kHz, comfortably above the ~20 kHz upper limit of human hearing.
Bit depth determines the precision of each measurement. 16-bit audio has 65,536 possible amplitude values per sample, providing approximately 96 dB of dynamic range. 24-bit audio has 16,777,216 values and approximately 144 dB of dynamic range — far exceeding what any listening environment can reproduce (a quiet room is roughly 30-40 dB, and the threshold of pain is about 120 dB).
Bitrate (measured in kbps) is the total data throughput. For uncompressed audio, bitrate is calculated from sample rate, bit depth, and channel count. For compressed formats like MP3, bitrate is a target set by the encoder that determines how aggressively data is discarded.
Format Comparison: Size, Quality, and Compatibility
| Criterion | MP3 | WAV | FLAC |
|---|---|---|---|
| File size (album) | ~100 MB | ~650 MB | ~380 MB |
| Quality | Good to very good | Perfect | Perfect |
| Editing suitability | Poor (re-encoding degrades) | Excellent | Good (decode first) |
| Mobile device support | Universal | Universal | Most modern devices |
| Streaming platform use | Yes (legacy) | Rarely | Tidal, Qobuz, Apple (ALAC) |
| Web browser playback | Universal | Universal | Limited |
| Cost | Free (patents expired 2017) | Free | Free (open source) |
When MP3 Artifacts Become Audible
MP3 compression artifacts are not equally audible across all content. Certain audio characteristics make artifacts more noticeable:
Castanets and cymbals: These instruments produce sharp transients with broad spectral content. MP3 encoders struggle with pre-echo — a faint "ghost" of the transient that appears a few milliseconds before the actual sound, caused by the encoder's windowing function.
Solo acoustic instruments: A solo violin or acoustic guitar in a quiet recording exposes artifacts that would be masked by a dense mix. The lack of competing sounds removes the psychoacoustic masking that MP3 relies on.
Applause and rain: Noise-like signals with no dominant frequency are difficult for the psychoacoustic model to optimize. At lower bitrates (128 kbps and below), applause recordings develop a characteristic "swirling" or "watery" quality.
At 256 kbps and above, these artifacts become extremely difficult to detect, even in controlled listening tests with high-end headphones. Multiple double-blind studies have found that most listeners cannot reliably distinguish 256 kbps MP3 from the lossless original.
If you're distributing audio that will be further processed (used in a video edit, mixed into a podcast, or played through a PA system), always provide WAV or FLAC. Each processing step that involves re-encoding MP3 adds another generation of lossy compression. Use our FLAC to MP3 converter only for the final distribution copy.
Choosing the Right Format for Your Workflow
Choose MP3 when: file size is the primary constraint, the audio will be consumed on mobile devices or web players, the content is spoken word (podcasts, audiobooks), or the destination platform requires MP3 specifically. Use 192 kbps or higher for music; 128 kbps is sufficient for speech.
Choose WAV when: you're recording, editing, or mixing audio in a DAW (Digital Audio Workstation); maximum compatibility with professional audio hardware is required; the files won't be stored long-term (WAV lacks robust metadata); or you need zero decoding latency for live performance or broadcast.
Choose FLAC when: you want lossless quality with smaller files than WAV; you're building a music archive and want both quality and metadata (artist, album, track number, album art); the destination supports FLAC (most modern music players, Android, many car systems); or you need a lossless master from which you can generate MP3, AAC, or other lossy formats on demand.
Real-World Scenario: Podcast Production Pipeline
A podcaster records a 45-minute interview. Here's how all three formats fit into the workflow:
-
Recording: The DAW records to WAV at 48 kHz, 24-bit mono. File size: approximately 247 MB per track, two tracks (host + guest) = 494 MB.
-
Editing: All editing (noise reduction, EQ, compression, normalization) happens in WAV. The edited master is exported as a single stereo WAV file: approximately 494 MB.
-
Archival: The stereo master is converted to FLAC for long-term storage. File size drops to approximately 290 MB — a 41% saving over WAV with zero quality loss. The FLAC file includes metadata (episode title, date, show name).
-
Distribution: The FLAC master is encoded to MP3 at 128 kbps mono for podcast distribution. Final file size: approximately 41 MB. This is what listeners download from the RSS feed.
-
Audiogram clips: Short 60-second clips are exported as WAV files for video editors who need uncompressed audio for social media audiograms.
Total storage for one episode: 290 MB (FLAC archive) + 41 MB (MP3 distribution) = 331 MB, compared to 494 MB if archiving as WAV. Over 200 episodes, FLAC archival saves approximately 32 GB compared to WAV.
Audio files can contain metadata beyond what you'd expect — recording device information, GPS coordinates (if recorded on a phone), timestamps, and software version strings. When converting audio files online, this metadata travels with the upload. When you convert a file on Convert-To.co, it is processed by CloudConvert, a GDPR-compliant and ISO 27001 certified service. All files are automatically deleted within 15 minutes after conversion. Convert-To.co does not store your files on its own servers. For sensitive recordings (legal depositions, confidential interviews), consider offline tools like FFmpeg or Audacity for format conversion. Read more in our file conversion privacy guide.
Related Tools and Resources
- MP3 to WAV Converter — convert MP3 to uncompressed WAV for editing
- WAV to MP3 Converter — compress WAV files for distribution
- FLAC to MP3 Converter — create compact copies from lossless originals
- Image Compressor — similar compression concepts applied to images
- PDF format guide — understanding document format compression
- JPG format guide — lossy compression in images
- PNG format guide — lossless compression in images
- Lossy vs Lossless Compression — the underlying theory behind audio and image compression
- JPG vs PNG vs WebP — compression trade-offs in image formats
- Complete Guide to File Formats — overview of all format families
Tags
Related Guides
HEIC vs JPG: Apple's Image Format Compared
Compare HEIC and JPG image formats. Learn why iPhones use HEIC and when to convert to JPG for compatibility.
ComparisonJPG vs PNG vs WebP: Choosing the Right Image Format
Compare JPG, PNG, and WebP image formats. Understand the differences in quality, file size, and transparency support.
ComparisonPDF vs JPG: When to Use Each Format
Compare PDF and JPG formats for documents and images. Learn when each format is the better choice for your needs.
ComparisonPDF vs Word: Which Document Format Should You Use?
Compare PDF and Word document formats. Learn when to use each format for editing, sharing, and archiving documents.
Try It Now
Ready to use MP3 to WAV? Convert your files for free with our online tool.
Use MP3 to WAV →Try It Now
Ready to use WAV to MP3? Convert your files for free with our online tool.
Use WAV to MP3 →Try It Now
Ready to use FLAC to MP3? Convert your files for free with our online tool.
Use FLAC to MP3 →