Audio File Size Calculation Formula Calculator

Estimate uncompressed and compressed audio file sizes instantly using sample rate, bit depth, channels, duration, and bitrate. Designed for podcasters, engineers, musicians, editors, archivists, and media teams.

Calculator

Duration

Duration Unit

Sample Rate

Bit Depth

Channels

Calculation Mode

Compressed Audio Bitrate (kbps)

Estimated file size

8.82 MB

Formula: sample rate × bit depth × channels × duration
5 minutes at 44.1 kHz, 16-bit, stereo produces about 8.82 MB of raw PCM audio
Switch to compressed mode to estimate MP3, AAC, or similar encoded sizes by bitrate

Visual Size Breakdown

Compare the estimated output in bytes, kilobytes, megabytes, and gigabytes, plus a direct comparison between uncompressed and compressed size.

Tip: Uncompressed PCM size is driven by sample rate, bit depth, channel count, and duration. Compressed formats are usually estimated from bitrate and time.

Expert Guide to the Audio File Size Calculation Formula

The audio file size calculation formula is one of the most practical concepts in recording, editing, media delivery, streaming preparation, and digital archiving. Whether you are producing a podcast, sending voiceovers to a client, capturing interviews in the field, preserving oral histories, or mastering music for release, file size affects workflow, storage costs, upload time, transfer reliability, and long term media management.

At a basic level, audio file size depends on how much digital information is stored for every second of sound. In uncompressed formats such as PCM WAV or AIFF, every sample is preserved directly, so the math is predictable. In compressed formats such as MP3 or AAC, the file size is typically estimated using bitrate, because the encoder decides how much data to allocate to the signal over time. That is why professionals often use two related formulas depending on the format they are working with.

Main uncompressed formula: File Size in bits = Sample Rate × Bit Depth × Channels × Duration in seconds
Convert to bytes: Divide by 8
Main compressed formula: File Size in bits = Bitrate × Duration in seconds

What each variable means

Sample rate: How many times per second the analog signal is sampled. Common values include 44,100 Hz for music and 48,000 Hz for video production.
Bit depth: The number of bits used to store each sample. Common values include 16-bit for consumer delivery and 24-bit for production and recording.
Channels: Mono uses 1 channel, stereo uses 2, while surround formats can use 6 or more.
Duration: The total running time of the audio in seconds, minutes, or hours.
Bitrate: For compressed formats, bitrate describes how many bits are used per second. For example, 128 kbps, 192 kbps, or 320 kbps.

The standard uncompressed audio size formula

For raw PCM audio, the formula is straightforward because the amount of data per second is fixed. If you know the sample rate, bit depth, and number of channels, then every second of audio will always occupy the same amount of storage.

Multiply sample rate by bit depth
Multiply that result by the number of channels
Multiply again by duration in seconds
Divide by 8 to convert bits into bytes
Optionally divide by 1,024 or 1,000 factors to convert into KB, MB, or GB

For example, suppose you record 5 minutes of stereo WAV audio at 44.1 kHz and 16-bit:

44,100 × 16 × 2 = 1,411,200 bits per second
5 minutes = 300 seconds
1,411,200 × 300 = 423,360,000 bits
423,360,000 ÷ 8 = 52,920,000 bytes

That works out to about 52.92 MB in decimal units, or about 50.47 MiB in binary units. Many calculators show MB using decimal conversion, while operating systems may display binary measurements. This is one reason the same file may appear to have slightly different sizes in different tools.

The compressed audio file size formula

Compressed formats are usually calculated from bitrate because the encoder determines what data to keep and what to reduce. The quick estimate is:

File Size in bytes = (Bitrate in bits per second × Duration in seconds) ÷ 8

If you encode a 5-minute spoken word recording at 128 kbps:

128 kbps = 128,000 bits per second
300 seconds × 128,000 = 38,400,000 bits
38,400,000 ÷ 8 = 4,800,000 bytes

That is about 4.8 MB in decimal units. This is dramatically smaller than an uncompressed WAV because lossy codecs remove data judged less important to human perception. In practical terms, compression makes streaming and downloading faster, but it may also reduce fidelity depending on the codec and bitrate chosen.

Why sample rate matters

Sample rate controls temporal detail. A higher sample rate means more snapshots of the waveform are captured each second. As sample rate doubles, storage needs also double, assuming bit depth, channels, and duration remain the same. This is why moving from 48 kHz to 96 kHz immediately doubles the raw data rate. Higher sample rates can be useful in some production contexts, especially during intensive post processing, but they also increase storage and transfer requirements substantially.

Why bit depth matters

Bit depth influences dynamic range and quantization precision. Increasing from 16-bit to 24-bit increases the amount of data per sample by 50 percent. That means a 24-bit recording is 1.5 times larger than a 16-bit recording at the same sample rate, channel count, and duration. In production, 24-bit is common because it provides more headroom and editing flexibility. For final consumer playback, 16-bit often remains adequate in many delivery workflows.

Why channels matter

Every additional channel adds another stream of audio data. Mono is half the size of stereo under otherwise identical settings. Surround formats can become much larger. For example, a 5.1 recording with 6 channels is three times larger than stereo, and a 7.1 recording with 8 channels is four times larger than stereo. This matters for film post production, game audio, and immersive media pipelines where storage planning can become a major budget factor.

Format Example	Core Settings	Approx Data Rate	Approx Size for 1 Hour
PCM WAV mono voice	16 kHz, 16-bit, mono	256 kbps	115.2 MB
CD quality stereo	44.1 kHz, 16-bit, stereo	1,411.2 kbps	635.04 MB
Video production stereo	48 kHz, 24-bit, stereo	2,304 kbps	1.0368 GB
High resolution stereo	96 kHz, 24-bit, stereo	4,608 kbps	2.0736 GB

Real world interpretation of these statistics

The table above reveals how quickly high quality production formats grow. A single hour of 48 kHz, 24-bit stereo PCM audio exceeds 1 GB. A multi track session with 24 simultaneous channels recorded at that rate can therefore reach tens of gigabytes very quickly. By contrast, spoken word capture for transcription or internal communication may use 16 kHz mono with far smaller storage requirements. The correct setting depends on the use case, not simply on the desire for the largest number.

Compressed bitrate comparisons

For encoded delivery, bitrate often becomes the dominant planning metric. The same duration can vary greatly in file size based on the selected bitrate. This is especially important for streaming catalogs, podcast hosting, mobile listening, language learning platforms, and lecture distribution where bandwidth costs matter.

Compressed Bitrate	Use Case	Approx Size for 10 Minutes	Approx Size for 1 Hour
64 kbps	Low bandwidth speech	4.8 MB	28.8 MB
128 kbps	General podcast and streaming	9.6 MB	57.6 MB
192 kbps	Higher quality stereo delivery	14.4 MB	86.4 MB
320 kbps	High bitrate MP3 delivery	24 MB	144 MB

Common mistakes when calculating audio file size

Confusing kilobits and kilobytes: 128 kbps does not mean 128 KB per second. You must divide bits by 8 to get bytes.
Ignoring channel count: Stereo doubles the storage of mono. Surround multiplies it further.
Mixing decimal and binary units: 1 MB can mean 1,000,000 bytes in decimal or 1,048,576 bytes in binary style reporting.
Assuming WAV headers are the entire difference: Container overhead exists, but the recording parameters are what drive size the most.
Forgetting variable bitrate behavior: Some formats use variable bitrate, so the final output may differ from a simple estimate.

When to use uncompressed vs compressed estimates

Use the uncompressed formula whenever you are dealing with PCM based formats such as WAV, AIFF, or raw digital audio in production systems. Use bitrate based estimates whenever you are delivering MP3, AAC, OGG, or other compressed files. In workflows that include both recording and final publishing, it is normal to calculate both. For example, a studio may need to know the raw session storage requirement for a 24-bit WAV recording and also the compressed distribution size for a podcast feed.

How professionals apply this formula

Broadcasters, media archivists, and post production teams often rely on exact audio size calculations before recording even begins. They estimate card capacity for field recorders, decide whether network transfer windows are realistic, and project storage growth over months or years. Educational institutions also need these calculations when storing lectures, oral histories, and research media. Public agencies managing digital preservation use data rate planning to ensure content remains maintainable and backed up correctly.

Helpful authoritative references

If you want deeper technical or preservation context, review these reliable resources:

Practical rule of thumb summary

For WAV or PCM, multiply sample rate × bit depth × channels × seconds, then divide by 8.
For MP3 or AAC style delivery, multiply bitrate × seconds, then divide by 8.
Use mono for speech when appropriate to reduce storage.
Use 48 kHz for video workflows and 44.1 kHz for many music delivery scenarios unless your project requires something else.
Use 24-bit for production and editing, but expect larger files.
Always account for transfer time, backup space, and future growth, not just the size of one file.

In short, the audio file size calculation formula gives you control. It lets you estimate storage before recording starts, choose efficient delivery settings, avoid failed uploads, and plan archive capacity intelligently. Once you understand how sample rate, bit depth, channels, duration, and bitrate interact, file size stops being a surprise and becomes a manageable engineering decision.