Skip to main content
ChapterPass

Audiobook Mastering vs Mixing: What's the Difference?

ChapterPass Editorial Team

If you've spent any time in audio production forums, you've seen "mixing" and "mastering" used interchangeably. Someone asks how to get their audiobook ready for ACX and gets told they need to "mix and master" their files. But mixing and mastering are distinct processes with different goals, and for audiobooks specifically, the distinction matters more than most people realise.

What Is Mixing?

Mixing is the process of combining and balancing multiple audio elements into a cohesive whole. In music, this means taking individual tracks (vocals, guitar, drums, bass) and adjusting their relative levels, panning, EQ, compression, reverb, and other effects to create a finished song.

The mixer makes hundreds of creative decisions: how loud the vocal should sit relative to instruments, whether the guitar pans left or centre, how much reverb on the snare. Two mixers working with the same tracks will produce different results, and both can be excellent.

How Does Mixing Apply to Audiobooks?

For audiobooks, "mixing" is rarely the right word because you're typically working with a single sound source: one narrator, one microphone, one channel. There's nothing to mix.

However, mixing-like work is needed when:

  • Multi-narrator projects: Full-cast audiobooks with multiple narrators need voices balanced for consistent volume and tone.
  • Sound effects and music: Some audiobooks include ambient soundscapes or chapter transition music. Blending these with narration is mixing.
  • Room tone matching: Chapters recorded in different sessions or environments may need room tone matching.
  • Pickup sessions: Re-recorded sections must match the original session's character.

For a standard single-narrator audiobook, what people call "mixing" is usually just editing: removing breaths, cutting mistakes, trimming pauses. Important work, but editorial, not mixing.

What Is Mastering?

Mastering is the final stage of audio production. It takes the finished mix (or finished edit) and processes it to meet delivery specifications.[1]

How Does Mastering Differ for Music vs Audiobooks?

Music mastering is creative. The mastering engineer might boost high frequencies for "air," compress the low end for punch, or widen the stereo image. These are artistic decisions informed by genre and taste.

Audiobook mastering is almost entirely technical. ACX defines exactly what the output must measure:[1]

  • RMS loudness between -23 and -18 dBFS
  • True peak below -3 dBFS
  • Noise floor below -60 dBFS
  • Sample rate of 44,100 Hz, mono
  • MP3 192 kbps CBR
  • 0.5 to 1 second head silence, 1 to 5 seconds tail silence

There's no creative interpretation. Either your file measures within spec or it doesn't. The mastering signal chain is a fixed sequence: format conversion, high-pass filtering, noise management, loudness adjustment, peak limiting, silence padding, and encoding. The audiobook mastering guide covers the full chain.

This predictability is what makes audiobook mastering automatable. Music mastering requires human judgment because the targets are subjective. Audiobook mastering has objective, measurable targets that a deterministic process can hit consistently.

What Are the Key Differences for Audiobooks?

AspectMixingMastering
InputMultiple tracks or elementsSingle finished file
GoalBalance and blend elementsMeet delivery specifications
DecisionsCreative and subjectiveTechnical and objective
When neededMulti-narrator, effects, musicEvery audiobook submission
ToolsDAW with multi-track editingProcessors, limiters, encoders
Can be automatedRarelyYes, for fixed specs
Required for ACXOnly if multi-elementAlways

The critical takeaway: every audiobook needs mastering. Not every audiobook needs mixing.

Which One Do You Need?

For the vast majority of audiobook projects, a single narrator reading a single text, you need editing and mastering. Not mixing.

You need mastering if:

  • You're submitting to ACX, Findaway, or any distributor
  • Your files must meet loudness, peak, and noise specifications
  • You want consistent audio quality across all chapters

You also need mixing if:

  • Multiple narrators recorded separately need to sound like they're in the same room
  • Your production includes background music or sound effects
  • Chapters have significantly different recording characteristics
  • You're blending pickup recordings with original sessions

What Does the Full Production Workflow Look Like?

Here's the complete audiobook production chain with mixing and mastering in their proper positions:

1. Recording. Capture narration in a treated space with consistent mic technique. Record at 44.1 kHz or higher, 24-bit, mono or stereo.[1]

2. Editing. Remove mistakes, re-takes, and unwanted sounds. Trim pauses. Handle breath reduction. This is where you spend the most time.

3. Mixing (if needed). If your project has multiple narrators, sound effects, or music, blend them here. If it's a solo narrator, skip this step entirely.

4. Mastering. Process each chapter file through the mastering signal chain. Standard compression ratio for spoken word is 2:1 to 3:1; above 4:1 destroys speech dynamics.[2] The audiobook mastering guide covers the full chain.

5. Verification. Check every file against every spec before submission.[1] RMS, peak, noise floor, format, silence padding. ACX-compliant files will pass requirements on every other distributor too.[3] This catches problems before the distributor does, saving a rejection cycle.

Mixing vs Mastering at a Glance

The table earlier captures the functional differences. Here's a more practical breakdown organised by what you're actually trying to decide.

QuestionAnswer
Is your source a single narrator, single mic?Skip mixing. Go straight to mastering.
Do you have multiple narrators who recorded separately?Mixing needed before mastering.
Does your audiobook include music or sound effects?Mixing needed to blend with narration.
Do your chapters sound noticeably different from each other?This is a recording consistency problem, not a mixing problem. Fix at source or re-record.
Are you submitting to ACX, Findaway, or any distributor?Mastering required. No exceptions.
Can you hear clicks, pops, or breath artefacts?This is an editing problem. Fix before mastering.
Is your RMS outside the -23 to -18 dBFS range?Mastering problem. Adjust gain or compression in the mastering chain.
Are your chapters consistent with each other in volume and tone?If no, re-examine your mastering chain settings file by file.

When Do You Need a Mixing Engineer vs a Mastering Tool?

This is the practical question most narrators and authors arrive at after understanding the distinction. The answer depends on what your project actually contains.

A mastering tool is sufficient when:

Your project is a single narrator, one microphone, one room, recorded consistently across all sessions. The audio is cleanly edited: no clicks, no breath noise, no significant background hum. What you need is a reliable, repeatable process to bring every chapter file to the same loudness, peak, and noise floor targets.

For this, a mastering tool like ChapterPass is the right choice. The entire signal chain (format conversion, high-pass filtering, compression, loudness normalisation, peak limiting, silence padding, MP3 encoding) can be applied identically to every file. The result is consistent, compliant audio without manual intervention on each chapter.

A mixing engineer is worth hiring when:

Multi-narrator or full-cast productions. If you have two or more narrators who recorded in different environments or on different microphones, a mixing engineer can do room tone matching, EQ matching, and level balancing that makes them sound like they're in the same space. A mastering tool applies the same processing chain to every file but doesn't solve the underlying tonal mismatch.

Productions with music or sound effects. Blending underscores, transitions, or ambient soundscapes with narration requires multi-track mixing. The relative levels, ducking behaviour, and stereo placement of music elements relative to narration are creative decisions a mixing engineer makes. A mastering tool operates on a finished stereo or mono file, not individual elements.

Severe recording problems. If a significant portion of your sessions have clicks, room echo, HVAC noise, or plosive distortion, a mixing engineer with access to iZotope RX or similar restoration tools may be able to rescue the recordings. Mastering cannot fix fundamental recording problems. It can only process what's there.

Pickup sessions with significant character mismatch. If you recorded half a book in one room and the other half in a noticeably different space, a mixing engineer can use EQ and room treatment tools to reduce the difference. A mastering tool will bring both to the same loudness but can't change their acoustic character.

The cost consideration

Hiring a mixing engineer for a full audiobook project adds significant cost per finished hour. For the majority of single-narrator projects, that cost is not justified. The mastering workflow, whether done manually in a DAW or through a tool like ChapterPass, handles everything an ACX submission needs.

The calculation changes for full-cast productions, productions with music, or projects where recording quality problems are severe enough that post-production rescue work is the only alternative to re-recording.

Why Does This Confusion Exist?

The mixing/mastering confusion comes from music terminology bleeding into a different domain.

In music, "mastering" has decades of mystique: expensive analogue gear, acoustically treated rooms, years of training. When audiobook producers hear "mastering," they import all those associations.

Audiobook mastering is important, but it's not that. It's a defined, repeatable technical process. The confusion leads to two common mistakes:

Over-engineering: Producers spend hours tweaking EQ curves and A/B testing limiters, treating the process like music mastering when the targets are fixed numbers. If your RMS is -20 dBFS, it doesn't matter which compressor got you there.

Under-preparing: Producers skip editing and jump straight to "mastering," expecting it to fix recording problems. Mastering can't remove clicks, fix echo, eliminate inconsistent narration, or repair a noisy recording. Those are editing problems.

Can You Master Without Mixing?

Yes, and for most audiobooks, you should. If your source is a cleanly edited single-narrator recording, mastering is the only post-editing step you need.

Can You Mix Without Mastering?

Technically yes, but you won't pass ACX submission. Mixing produces a creatively balanced file. Mastering produces a technically compliant file. Distributors check technical specs, not creative balance. A beautifully mixed audiobook that doesn't meet the loudness, peak, and noise requirements will be rejected.

What About "Mix and Master" Services?

Some audiobook production services advertise "mix and master" packages. For a standard single-narrator audiobook, this usually means editing and mastering. The "mix" part is the editing work, not actual multi-track mixing.

This isn't dishonest, it's imprecise terminology. If the service delivers files that pass ACX quality control with clean audio, the label doesn't matter much. But understand what you're paying for: editing (removing problems) and mastering (meeting specs).

The Bottom Line

For audiobook production, mastering is mandatory and mixing is situational. Every file you submit to ACX or other distributors must be mastered to meet their technical specifications. Mixing is only needed if you're blending multiple audio elements.

Don't conflate the two. Get your recording and editing right first. Then master your files to meet the specs. That's the workflow that gets audiobooks accepted on the first submission.