How to Format Your Manuscript for Audiobook Production
Most authors approach audiobook production by handing over their print manuscript and assuming the narrator will figure out the rest. This approach produces avoidable problems — mispronounced character names throughout an entire recording, awkward handling of print-specific formatting elements, confusion about what to read and what to skip, and back matter that doesn't translate to audio.
Preparing your manuscript for audiobook production is a distinct step that requires specific decisions about how your print content translates to audio. This guide covers every element — for both authors working with professional narrators and authors self-narrating their own books.
Why Audio Preparation Is Different From Print Formatting
Print books and audiobooks are different experiences in fundamental ways that affect your manuscript:
Print readers skip front matter they don't want. Audio listeners hear it sequentially — every element you include in the audio recording is heard
Print books can use footnotes, endnotes, sidebars, tables, and charts. None of these translate naturally to audio
Print readers can see italics and bold. Narrators must interpret emphasis from context unless you provide explicit guidance
Character names, place names, and invented words appear consistently on the page. In audio, a mispronounced name on page 3 is a jarring inconsistency through every subsequent mention
Print can have multiple narrator voices implied by formatting. Audio requires the narrator to actually perform those voices — without guidance, they'll make their own choices
Step 1: Clean Your Audio Script
Your audio script is your print manuscript cleaned for narration. Create a separate document — do not modify your final print manuscript. Make the following changes:
Remove Print-Specific Elements
Delete all footnotes and endnotes — incorporate any essential information into the body text, or omit it. Narrators cannot naturally read footnotes as footnotes
Remove all headers and running heads that are print formatting artifacts
Remove page numbers and any text that references page numbers ('see page 47')
Remove decorative chapter separators, fleurons, and section break symbols — replace with a clear scene break marker the narrator will understand
Simplify or remove tables, charts, and structured data — describe the essential information in prose, or include a note directing the narrator to read it as written with clear pausing guidance
Remove copyright page content — handled separately in the opening credits file
Handle Scene Breaks
Print scene breaks are typically marked with a centered symbol (***) or blank line. In audio, a scene break is handled with a pause — the narrator pauses for 2–3 seconds before beginning the next scene. Mark your scene breaks in the audio script clearly:
Replace *** or similar symbols with the text: [SCENE BREAK — PAUSE 2 SECONDS]
This notation is clear, unambiguous, and gives the narrator explicit direction
For chapter breaks, the narrator will naturally pause while ACX or your editor places the chapter title audio
Handle Epigraphs and Quotes
Epigraphs (quotes at the start of a chapter) are read aloud in audio but need clear context. Mark them:
Add [BEGIN EPIGRAPH] before the quoted text
Add [END EPIGRAPH — attribution: Author Name] after, so the narrator reads the attribution clearly
If you have multiple epigraphs, mark each one
Italics and Emphasis
Italics are invisible in audio. There are two approaches:
Leave italicized words for the narrator to interpret — experienced narrators will generally handle standard emphasis naturally
For critical emphasis where specific stress is essential to meaning, add a bracketed note: [STRESS: 'impossible'] — use this sparingly
Numbers, Symbols, and Abbreviations
Print shorthand doesn't always read naturally aloud:
Write out numbers that would be read awkwardly: '$3.5M' becomes 'three point five million dollars'
Spell out abbreviations if how they should be read isn't clear: 'Dr.' is fine; industry-specific acronyms may need full form
Dates: '01/15/2024' should be '01/15/2024' — or write 'January 15th, 2024' in the audio script
Currency: '$42' — most narrators handle this fine; 'USD 42.00' may need clarification
Step 2: Build Your Pronunciation Guide
The pronunciation guide is the single most important document you provide to your narrator. It is also the document most authors skip, leading to the most common and frustrating audiobook production problem: consistent mispronunciation of character names throughout an entire recording that must then be re-recorded.
Create a simple document — a table or list — that covers every non-standard pronunciation in your book.
What to Include in Your Pronunciation Guide
How to Write Phonetic Pronunciation
Write phonetics in a way a non-linguist narrator can use — not in formal IPA notation. Use familiar English word sounds and syllable breaks:
Vaelindra — VAY-lin-dra (rhymes with 'calendar' with a 'VAY' start)
Aethon — AY-thon (like 'aeon' with a 'th')
Xiomara — Zee-oh-MAR-ah (stress on MAR)
Beaumont — BOH-mont (silent 'e' — not byoo-MONT)
Cholmondeley — CHUM-lee (silent letters — this one surprises everyone)
For each entry, give the phonetic breakdown AND a rhyming word or familiar reference if possible. The rhyming reference is often more useful than the phonetics alone.
Send your pronunciation guide to your narrator before production begins — not when you receive your first chapter and discover the mispronunciation. Address it before recording starts.
Step 3: Front Matter — What to Include in Audio
Audio front matter is handled differently from print front matter. The opening credits file is a specific required audio file, not just the first chapter of your book.
Opening Credits File
ACX and Findaway Voices both require an opening credits audio file — a brief recording at the start of the audiobook stating the essential identification information. Standard format:
[Title] — written by [Author Name] — narrated by [Narrator Name]
The opening credits file is typically 5–15 seconds long. It is a separate audio file from Chapter 1. Many authors and narrators expand this slightly to include series information: '[Title], Book [Number] in the [Series Name] Series — written by [Author Name] — narrated by [Narrator Name].'
What to Include From Print Front Matter
Back Matter — What to Include in Audio
Closing Credits File
The closing credits file is required by ACX and Findaway Voices. Standard format:
[Title] — written by [Author Name] — narrated by [Narrator Name]
Copyright [Year] [Author/Publisher Name]. All rights reserved.
Optionally, some authors include a brief narrator bio and a thank-you to listeners. Keep closing credits under 30 seconds.
Step 4: Chapter File Structure
Your audiobook is not a single audio file — it is a set of individual chapter files, each separately uploaded. Understanding how chapters are structured in audio production helps you prepare your manuscript for that structure.
Each chapter file should contain:
The chapter title/number as typically spoken — 'Chapter One' or 'Chapter One — The Arrival' depending on whether your chapters have titles
The full chapter content
A brief pause at the end (the narrator leaves 0.5–1 second of room tone before stopping recording)
Mark chapter openings clearly in your audio script so the narrator knows where each chapter file begins and ends:
[BEGIN CHAPTER 1 FILE]
[END CHAPTER 1 FILE — PAUSE 1 SECOND ROOM TONE]
For short books, chapters may be combined into fewer files. For very long books, a single chapter may be split across multiple files if it would exceed ACX's 120-minute per-file limit (rare for standard novels). Standard novels produce 15–30 chapter files plus the opening and closing credits files.
Step 5: Preparing Guidance for Your Narrator
Beyond the audio script and pronunciation guide, experienced narrators benefit from additional author guidance — particularly for fiction with complex character casts or unusual genre conventions.
Character Voice Guide
For fiction with multiple significant characters, a brief character voice guide helps the narrator make consistent choices. This does not need to be detailed — a sentence or two per major character:
Vaelindra — the protagonist. Confident, dry wit. Age 28. I imagine her voice as slightly lower than average female pitch, measured pacing. Think Cate Blanchett in Lord of the Rings rather than high and bright
Brennan — the mentor. Older (60s), world-weary but warm. Occasional Irish lilt is appropriate
The Dark King — the antagonist. Do not over-perform — menacing through restraint, not growling
The comparable performer references are the most useful direction you can give. Narrators understand 'think of X performer in Y role' immediately.
Tone and Pacing Notes
A brief note on overall tone — 'this is dark romance, not erotica; the heat is emotional as much as physical, so lean into tension rather than explicit performance' — saves significant re-recording. Narrators who misjudge the genre's expected delivery produce audio that doesn't match listener expectations.
Step 6: Reviewing Finished Audio
When your narrator delivers chapters, your review is the quality gate before the audiobook is submitted to ACX or Findaway. Listen to every chapter, in full, at normal playback speed. Do not skim.
What to Listen For
Giving Effective Correction Notes
Return corrections to your narrator with specific, actionable feedback:
Good: 'At 4:23 in Chapter 3, 'Vaelindra' is pronounced 'Vay-LINN-dra' but should be 'VAY-lin-dra' as in the pronunciation guide. Please re-record this section from 4:15 to 4:35.'
Not useful: 'Some of the names sound off'
Good: 'At 7:14 in Chapter 5, the text reads 'she had never been so afraid' but it was read as 'she had never been afraid' — the word 'so' was omitted. Please re-record from 7:10 to 7:20.'
Not useful: 'I think something is missing in the middle of chapter 5'
Include a timestamp (hours:minutes:seconds), the specific issue, and a clear correction. Narrators work faster with precise direction than with vague notes.
Step 7: Final File Verification
Before submitting to ACX or Findaway Voices, verify the complete file set:
Opening credits file present — correct title, author, narrator
All chapter files present and in sequence — no gaps
Closing credits file present — copyright information correct
All files named consistently — sequential naming convention
All files are MP3 format at 192 kbps minimum constant bitrate
RMS levels between -23 and -18 dB across all files
No file exceeds -3 dB peak
Noise floor at -60 dB or lower throughout
Each file opens and closes with 0.5–1 second of room tone
See HT30 (Audiobook Technical Specifications) for full detail on audio file technical requirements, recording environments, and mastering workflow for self-produced audiobooks.
Audiobook royalties from ACX, Findaway Voices, and Author's Republic sync into ScribeCount once your accounts are connected. For authors building a complete multi-format catalog — ebooks, print books, and audiobooks across wide platforms — ScribeCount's unified dashboard shows total per-title performance across every format and every platform in one view.
Common Audio Preparation Mistakes
Sending the print manuscript without a pronunciation guide — leading to consistent mispronunciation throughout the recording
Not marking scene breaks — resulting in narrator uncertainty about whether to pause or continue reading
Including the full print table of contents in the audio script — creating a long, tedious reading of chapter numbers that listeners cannot use
Not listening to chapter files in full during review — skimming misses mispronunciations and skipped text that appear mid-chapter
Providing vague correction notes without timestamps — making re-recording inefficient and costly
Forgetting to include the opening and closing credits files in the final submission set
The work you put into preparing your manuscript for audio production directly determines the quality of the finished audiobook and the efficiency of the production process. A clean audio script, a thorough pronunciation guide, and clear character notes save your narrator time, save you money on corrections, and produce the audio your readers deserve. Prepare carefully before the recording begins — not after you hear the problems in the finished chapters.
-Randall Wood