Comparison

MP3 vs WAV vs FLAC: Audio Format Comparison

Compare MP3, WAV, and FLAC audio formats. Understand the trade-offs between file size, audio quality, and compatibility.

February 22, 202613 min read

Convert-To Editorial Team

Editorial Policy

A three-minute song stored as WAV takes up roughly 31 MB of disk space. The same song as a 128 kbps MP3 occupies about 2.9 MB — a 91% reduction. As FLAC, it lands somewhere in between at around 19 MB, yet sounds identical to the WAV when played back. These three formats represent three fundamentally different approaches to storing audio: uncompressed, lossy compressed, and lossless compressed. The format you choose determines how much storage you use, what quality your listeners hear, and which devices and platforms can actually play your files.

How Audio Compression Differs from Image Compression

Audio and image compression share the same core principle — removing data humans won't notice — but they target different senses. Image compression exploits the fact that human vision is less sensitive to color detail than brightness detail. Audio compression exploits psychoacoustic masking: when a loud sound at one frequency is playing, quieter sounds at nearby frequencies become inaudible.

The key difference is temporal. Images are static, so compression works on spatial redundancy (neighboring pixels). Audio is a continuous stream, so compression works on spectral redundancy across time windows, typically 576 or 1152 samples per frame in MP3. This means audio compression artifacts sound different from image artifacts — you hear them as metallic ringing, pre-echo before transients, or a subtle "underwater" quality rather than the blocky pixels you see in over-compressed JPGs.

Compression AspectImage (JPG)Audio (MP3)
Perceptual modelHuman visual system (HVS)Psychoacoustic model
What's discardedHigh-frequency spatial detailMasked frequencies, sub-threshold signals
Artifact typeBlock artifacts, bandingPre-echo, metallic ringing, stereo imaging loss
Quality controlQuality slider (1-100)Bitrate (kbps)
Typical compression10:1 to 50:14:1 to 12:1

MP3: The Universal Compromise

MP3 (MPEG-1 Audio Layer III) was standardized in 1993 and became the format that defined digital music distribution. It uses lossy compression, meaning it permanently removes audio data during encoding. The MP3 encoder analyzes audio in short frames, applies a psychoacoustic model to determine which frequency components are inaudible, and discards them.

How MP3 Quality Works

MP3 quality is controlled by bitrate — the number of bits used per second of audio. Higher bitrate means more data is preserved and fewer artifacts are introduced.

BitrateFile Size (3 min)Quality DescriptionTypical Use
64 kbps1.4 MBNoticeable artifacts, thin soundVoice-only recordings, talk radio
128 kbps2.9 MBAcceptable for casual listeningStreaming, podcasts
192 kbps4.3 MBGood quality, artifacts rareMusic streaming (standard tier)
256 kbps5.8 MBVery good, near-transparentHigh-quality streaming
320 kbps7.2 MBBest MP3 quality availableArchival-quality MP3, DJ use

Variable bitrate (VBR) encoding adjusts the bitrate dynamically — using more bits for complex passages (cymbal crashes, dense orchestration) and fewer bits for simple passages (silence, single instruments). A VBR MP3 targeting quality level V2 (roughly equivalent to 190 kbps average) often sounds better than a constant 192 kbps MP3 while being a similar size.

MP3 Limitations

MP3 has a hard ceiling: it cannot store audio above 16 kHz at lower bitrates, and even at 320 kbps, the frequency response rolls off around 20 kHz. This matters less than it sounds — most adults can't hear frequencies above 16-17 kHz — but it means MP3 is fundamentally incapable of bit-perfect audio reproduction regardless of bitrate settings.

MP3 also uses joint stereo encoding at lower bitrates, which merges the left and right channels for frequencies where stereo separation is less perceptible. This can subtly alter the stereo imaging of a recording, particularly noticeable on headphones with acoustic music.

WAV: Uncompressed Fidelity at a Cost

WAV (Waveform Audio File Format) stores raw PCM (pulse-code modulation) audio data with no compression whatsoever. Every sample from every channel is stored as a numerical value, typically at 16-bit or 24-bit depth.

The Numbers

CD-quality WAV uses 44,100 samples per second (44.1 kHz), 16 bits per sample, across 2 channels (stereo). The math is straightforward:

44,100 samples/sec x 16 bits/sample x 2 channels = 1,411,200 bits/sec = 176.4 KB/sec

For a 3-minute song: 176.4 KB/sec x 180 seconds = 31.7 MB

Professional studio recordings often use 24-bit depth at 48 kHz or 96 kHz, pushing file sizes even higher:

Sample RateBit DepthChannelsSize per Minute60-min Album
44.1 kHz16-bitStereo10.6 MB635 MB
48 kHz24-bitStereo17.3 MB1,037 MB
96 kHz24-bitStereo34.6 MB2,074 MB
192 kHz32-bitStereo92.2 MB5,530 MB

WAV Strengths and Weaknesses

WAV's strength is simplicity and universality. Every audio application, operating system, and hardware player supports WAV. There's no decoding overhead, no compatibility concerns, and no quality questions — the audio is stored exactly as captured.

The weaknesses are substantial: enormous file sizes, no built-in metadata support (no reliable way to store artist, album, or track information), and no compression. WAV also has a 4 GB file size limit due to its 32-bit header, which restricts recordings to roughly 6.75 hours at CD quality. Professional environments typically use BWF (Broadcast WAV Format) or RF64 to work around this limit.

FLAC: Lossless Compression Without the Bulk

FLAC (Free Lossless Audio Codec) was released in 2001 as an open-source alternative to proprietary lossless codecs. It applies lossless compression to audio data — like ZIP for sound files — reducing file size by 30-60% compared to WAV while preserving every sample perfectly.

How FLAC Compression Works

FLAC uses linear prediction to model the audio signal. For each block of samples, the encoder predicts what each sample should be based on preceding samples, then stores only the difference (residual) between the prediction and the actual value. Since audio signals are continuous and predictable, these residuals are small numbers that compress efficiently with Rice coding.

The result: a FLAC file decoded back to WAV produces output that is bit-for-bit identical to the original WAV. This isn't "nearly identical" or "perceptually identical" — it is mathematically the same data, verifiable by comparing checksums.

FeatureFLACWAVMP3 (320 kbps)
CompressionLossless (30-60% reduction)NoneLossy (78% reduction)
QualityBit-perfectBit-perfectPerceptual approximation
File size (3 min, CD quality)~19 MB~31.7 MB~7.2 MB
Metadata supportFull (Vorbis comments)MinimalID3 tags
Streaming supportYesLimitedYes
Hardware supportMost modern devicesUniversalUniversal
Open sourceYesNo (Microsoft format)No (patented, though patents expired)

FLAC Compression Levels

FLAC offers compression levels 0-8, but this only affects encoding speed, not audio quality (since all levels are lossless). Higher levels try harder to find patterns, producing slightly smaller files at the cost of longer encoding time.

FLAC LevelEncoding SpeedFile Size (typical)Decoding Speed
0 (fastest)Very fast~62% of WAVFast
5 (default)Moderate~58% of WAVFast
8 (best compression)Slow~56% of WAVFast

The difference between level 0 and level 8 is typically only 3-6% of the WAV size. Since decoding speed is virtually identical across all levels, most users stick with the default level 5.

Bitrate, Sample Rate, and Bit Depth: What the Numbers Mean

Audio quality specifications can be confusing because three independent parameters all affect quality and file size:

Sample rate (measured in Hz or kHz) determines how many times per second the audio signal is measured. CD quality is 44.1 kHz (44,100 measurements per second). By the Nyquist theorem, a sample rate captures frequencies up to half its value — so 44.1 kHz captures up to 22.05 kHz, comfortably above the ~20 kHz upper limit of human hearing.

Bit depth determines the precision of each measurement. 16-bit audio has 65,536 possible amplitude values per sample, providing approximately 96 dB of dynamic range. 24-bit audio has 16,777,216 values and approximately 144 dB of dynamic range — far exceeding what any listening environment can reproduce (a quiet room is roughly 30-40 dB, and the threshold of pain is about 120 dB).

Bitrate (measured in kbps) is the total data throughput. For uncompressed audio, bitrate is calculated from sample rate, bit depth, and channel count. For compressed formats like MP3, bitrate is a target set by the encoder that determines how aggressively data is discarded.

Format Comparison: Size, Quality, and Compatibility

CriterionMP3WAVFLAC
File size (album)~100 MB~650 MB~380 MB
QualityGood to very goodPerfectPerfect
Editing suitabilityPoor (re-encoding degrades)ExcellentGood (decode first)
Mobile device supportUniversalUniversalMost modern devices
Streaming platform useYes (legacy)RarelyTidal, Qobuz, Apple (ALAC)
Web browser playbackUniversalUniversalLimited
CostFree (patents expired 2017)FreeFree (open source)

When MP3 Artifacts Become Audible

MP3 compression artifacts are not equally audible across all content. Certain audio characteristics make artifacts more noticeable:

Castanets and cymbals: These instruments produce sharp transients with broad spectral content. MP3 encoders struggle with pre-echo — a faint "ghost" of the transient that appears a few milliseconds before the actual sound, caused by the encoder's windowing function.

Solo acoustic instruments: A solo violin or acoustic guitar in a quiet recording exposes artifacts that would be masked by a dense mix. The lack of competing sounds removes the psychoacoustic masking that MP3 relies on.

Applause and rain: Noise-like signals with no dominant frequency are difficult for the psychoacoustic model to optimize. At lower bitrates (128 kbps and below), applause recordings develop a characteristic "swirling" or "watery" quality.

At 256 kbps and above, these artifacts become extremely difficult to detect, even in controlled listening tests with high-end headphones. Multiple double-blind studies have found that most listeners cannot reliably distinguish 256 kbps MP3 from the lossless original.

Convert-To Tip

If you're distributing audio that will be further processed (used in a video edit, mixed into a podcast, or played through a PA system), always provide WAV or FLAC. Each processing step that involves re-encoding MP3 adds another generation of lossy compression. Use our FLAC to MP3 converter only for the final distribution copy.

Choosing the Right Format for Your Workflow

Choose MP3 when: file size is the primary constraint, the audio will be consumed on mobile devices or web players, the content is spoken word (podcasts, audiobooks), or the destination platform requires MP3 specifically. Use 192 kbps or higher for music; 128 kbps is sufficient for speech.

Choose WAV when: you're recording, editing, or mixing audio in a DAW (Digital Audio Workstation); maximum compatibility with professional audio hardware is required; the files won't be stored long-term (WAV lacks robust metadata); or you need zero decoding latency for live performance or broadcast.

Choose FLAC when: you want lossless quality with smaller files than WAV; you're building a music archive and want both quality and metadata (artist, album, track number, album art); the destination supports FLAC (most modern music players, Android, many car systems); or you need a lossless master from which you can generate MP3, AAC, or other lossy formats on demand.

Real-World Scenario: Podcast Production Pipeline

A podcaster records a 45-minute interview. Here's how all three formats fit into the workflow:

  1. Recording: The DAW records to WAV at 48 kHz, 24-bit mono. File size: approximately 247 MB per track, two tracks (host + guest) = 494 MB.

  2. Editing: All editing (noise reduction, EQ, compression, normalization) happens in WAV. The edited master is exported as a single stereo WAV file: approximately 494 MB.

  3. Archival: The stereo master is converted to FLAC for long-term storage. File size drops to approximately 290 MB — a 41% saving over WAV with zero quality loss. The FLAC file includes metadata (episode title, date, show name).

  4. Distribution: The FLAC master is encoded to MP3 at 128 kbps mono for podcast distribution. Final file size: approximately 41 MB. This is what listeners download from the RSS feed.

  5. Audiogram clips: Short 60-second clips are exported as WAV files for video editors who need uncompressed audio for social media audiograms.

Total storage for one episode: 290 MB (FLAC archive) + 41 MB (MP3 distribution) = 331 MB, compared to 494 MB if archiving as WAV. Over 200 episodes, FLAC archival saves approximately 32 GB compared to WAV.

Privacy Note

Audio files can contain metadata beyond what you'd expect — recording device information, GPS coordinates (if recorded on a phone), timestamps, and software version strings. When converting audio files online, this metadata travels with the upload. When you convert a file on Convert-To.co, it is processed by CloudConvert, a GDPR-compliant and ISO 27001 certified service. All files are automatically deleted within 15 minutes after conversion. Convert-To.co does not store your files on its own servers. For sensitive recordings (legal depositions, confidential interviews), consider offline tools like FFmpeg or Audacity for format conversion. Read more in our file conversion privacy guide.

Tags

mp3wavflacaudio formatscomparison
Back to Blog
Updated 2/22/2026

Try It Now

Ready to use MP3 to WAV? Convert your files for free with our online tool.

Use MP3 to WAV

Try It Now

Ready to use WAV to MP3? Convert your files for free with our online tool.

Use WAV to MP3

Try It Now

Ready to use FLAC to MP3? Convert your files for free with our online tool.

Use FLAC to MP3