Skip to main content
MediaScribe
Deadline Extended: ADA compliance deadline moved to April 26, 2027.Learn what changed →
Technical Guide

Achieving 99% Transcription Accuracy

Your setup matters. This guide covers the audio quality, environment, and speaker practices that separate good transcription from great transcription.

Understanding Accuracy

What Does 99% Actually Mean?

Transcription accuracy is measured by Word Error Rate (WER)—the percentage of words transcribed incorrectly. A 99% accuracy rate means only 1 in 100 words contains an error.

99%
Our Target Accuracy

Achieved with quality audio, proper setup, and custom vocabulary training.

85-95%
Typical Range

Common results without optimization. Adequate but not ideal for official records.

<85%
Needs Attention

Usually indicates audio quality issues, environmental noise, or speaker problems.

Real-World vs. Lab Conditions

Benchmark accuracy rates are often measured with clean studio recordings. Real meetings have background noise, multiple speakers, accents, and technical terminology. This guide helps you bridge that gap by optimizing your actual environment.

Most Important Factor

Audio Quality Fundamentals

Audio quality is the single biggest factor in transcription accuracy. A clear audio signal can achieve 99%+ accuracy, while poor audio may struggle to reach 80% regardless of how good the AI is.

Microphone Types & Best Uses

Lavalier/Lapel Mics

Clip-on microphones worn by individual speakers. Best for consistent audio from mobile presenters.

Best For

Podium speakers, presenters, council members

Tips
  • Position 6-8 inches below chin
  • Avoid clothing rustle
  • Use windscreens outdoors

Gooseneck Mics

Flexible mounted microphones at fixed positions. Standard for council chambers and dais setups.

Best For

Dais seating, witness tables, fixed positions

Tips
  • Position 12-18 inches from speaker
  • Angle toward mouth
  • Avoid paper shuffling near mic

Boundary/PZM Mics

Flat microphones placed on tables that capture from all directions. Good for group discussions.

Best For

Conference tables, round-table discussions

Tips
  • Place in center of table
  • One mic per 4-6 people
  • Keep table surface clear

Handheld/Wireless

Portable microphones passed between speakers. Common for public comment periods.

Best For

Audience participation, Q&A sessions

Tips
  • Hold 2-4 inches from mouth
  • Train speakers on technique
  • Check batteries before meetings

Optimal Audio Levels

Monitor your audio meters during recording. Aim for peaks between -12dB and -6dB.

Too QuietBelow -20dB

Increased background noise ratio, missed words

Optimal-12dB to -6dB

Clear speech with headroom for peaks

Too LoudAbove -3dB

Clipping and distortion, reduced accuracy

Signal Chain & SDI Integration

SDI Embedded Audio

MediaScribe receives audio embedded in your SDI video signal. Ensure your audio is properly embedded on channels 1-2 (or your configured channels).

  • Verify audio presence before meetings
  • Use quality SDI cables (no kinks or damage)
  • Check embedder/de-embedder settings

Audio Processing

Avoid over-processing audio before it reaches MediaScribe. Heavy compression or noise gates can actually reduce accuracy.

  • Avoid aggressive noise gates
  • Avoid heavy compression (>4:1 ratio)
  • Light EQ for clarity is acceptable
Environment Setup

Optimize Your Meeting Space

Your room environment significantly impacts audio quality. Background noise and room echo are the two biggest environmental factors affecting transcription accuracy.

Pre-Meeting Environment Checklist

Noise

  • HVAC set to quiet mode or temporarily reduced during meetings
  • Doors and windows closed to reduce outside noise
  • Electronic devices on silent (phones, pagers, radios)

Acoustics

  • Acoustic panels or soft furnishings to reduce echo
  • Carpet or rugs on hard floors if possible
  • Avoid large empty rooms with hard parallel surfaces

Setup

  • Audio levels checked before meeting starts
  • Backup microphone available
  • Test recording reviewed for quality

Background Noise Sources

  • • HVAC systems cycling on/off
  • • Traffic from open windows
  • • Audience rustling, coughing
  • • Electronic device notifications
  • • Paper shuffling near microphones

Room Echo & Reverberation

  • • Large rooms with hard surfaces reflect sound
  • • Glass walls and tile floors are problematic
  • • High ceilings increase reverberation time
  • • Acoustic panels can dramatically help
  • • Even curtains and carpet make a difference

Quick Win

Position microphones closer to speakers and farther from noise sources. Halving the mic-to-speaker distance can be more effective than any acoustic treatment.

Human Factors

Speaker Best Practices

How people speak matters as much as the equipment. These guidelines help meeting participants contribute to better transcription quality.

Speaking Pace

Do:

120-150 words per minute

Avoid:

Rapid speech over 180 WPM

Natural conversational pace allows AI to accurately capture every word

Enunciation

Do:

Clear pronunciation, complete words

Avoid:

Mumbling, trailing off, swallowed syllables

Speak as if addressing someone with mild hearing difficulty

Microphone Distance

Do:

Consistent 6-12 inches

Avoid:

Moving toward/away from mic

Variations in distance cause volume fluctuations that affect accuracy

Overlapping Speech

Do:

One speaker at a time

Avoid:

Multiple people talking simultaneously

Crosstalk is the #1 accuracy killer—facilitate clear speaker transitions

Meeting Facilitation Tips for Better Captions

For the Chair/Moderator

  • Introduce speakers by name before they speak
  • Repeat audience questions into the microphone
  • Enforce one-speaker-at-a-time rule
  • Pause briefly between agenda items

For All Speakers

  • State your name before making a motion
  • Spell unusual names or technical terms on first use
  • Wait for your microphone to be active
  • Avoid side conversations during proceedings
Accuracy Boost

Custom Vocabulary Training

Generic speech recognition struggles with local names, streets, and terminology. Custom vocabulary training teaches MediaScribe your specific language.

What to Include

People

Council Member RodriguezCity Manager ThompsonChief Williams

Places

Riverside BoulevardHeritage ParkMLK Community Center

Departments

DPW (Public Works)CDBG (Community Development)P&Z (Planning & Zoning)

Local Terms

Ordinance 2024-15Resolution R-847TIF District

Best Practices

  • Update Regularly

    Add new council members, agenda terms, and project names quarterly

  • Include Variations

    Add both "P and Z" and "P&Z" and "Planning and Zoning"

  • Add Phonetic Hints

    For unusual names, include pronunciation guides

  • Review Transcripts

    Note recurring errors and add those terms to vocabulary

Pro Tip

Export your meeting agenda as a starting point for vocabulary. Most agenda terms (resolution numbers, project names, addresses) are excellent candidates for custom vocabulary.

Audio Descriptions

Video Quality for Audio Descriptions

AI audio descriptions analyze your video to narrate visual content. Higher quality video enables better description of on-screen elements.

Resolution

Higher resolution allows better recognition of text, faces, and visual details.

Minimum:720p (1280×720)
Recommended:1080p (1920×1080)

Frame Rate

Smoother video helps AI track movement and transitions between scenes.

Minimum:24 fps
Recommended:30 fps

Lighting

Well-lit scenes are easier to analyze. Avoid harsh shadows or backlighting.

  • • Even, diffused lighting
  • • Speakers' faces visible
  • • Readable presentation slides

Presentation Best Practices

Do

  • Use high-contrast slides (dark text on light backgrounds)
  • Use large, readable fonts (24pt minimum)
  • Leave slides visible long enough to read
  • Verbally describe key visual elements

Avoid

  • Slides packed with dense text
  • Rapid slide transitions
  • Critical information only in visuals
  • Low-contrast color schemes
Multilingual

Optimizing Translation Quality

Translation quality depends on source transcription quality. Clear English input produces better translations in all 72+ supported languages.

Source Language Tips

  • Use Complete Sentences

    Fragments and incomplete thoughts translate poorly

  • Avoid Idioms & Slang

    "Table this motion" vs "postpone this motion"

  • Be Consistent

    Use the same term throughout (not "proposal" then "initiative")

  • Spell Out Acronyms

    First use: "Planning and Zoning, or P&Z"

Custom Translation Dictionaries

For specialized terminology, you can provide translation preferences for specific terms.

Term: "Resolution R-847"
Spanish: "Resolución R-847"
Vietnamese: "Nghị quyết R-847"

Proper nouns, legal terms, and official document names benefit most from custom translation entries.

Troubleshooting

Common Issues & Solutions

*99% Accuracy Conditions: Accuracy rates achieved with quality audio input (properly positioned microphones, -12dB to -6dB levels), custom vocabulary training for local terminology, and optimized environment (minimal background noise, reduced echo). Results vary based on audio quality, speaker clarity, accents, and technical terminology complexity. Real-world accuracy typically ranges from 95-99% depending on conditions.

Need Help Optimizing Your Setup?

Our team can assess your current environment and recommend specific improvements to maximize transcription accuracy.

No obligation
Personalized for your agency
See implementation timeline