Technical Guide

Achieving 99% Transcription Accuracy

Your setup matters. This guide covers the audio quality, environment, and speaker practices that separate good transcription from great transcription.

Book a Call View Platform

Understanding Accuracy

What Does 99% Actually Mean?

Transcription accuracy is measured by Word Error Rate (WER)—the percentage of words transcribed incorrectly. A 99% accuracy rate means only 1 in 100 words contains an error.

99%

Our Target Accuracy

Achieved with quality audio, proper setup, and custom vocabulary training.

85-95%

Typical Range

Common results without optimization. Adequate but not ideal for official records.

<85%

Needs Attention

Usually indicates audio quality issues, environmental noise, or speaker problems.

Real-World vs. Lab Conditions

Benchmark accuracy rates are often measured with clean studio recordings. Real meetings have background noise, multiple speakers, accents, and technical terminology. This guide helps you bridge that gap by optimizing your actual environment.

Most Important Factor

Audio Quality Fundamentals

Audio quality is the single biggest factor in transcription accuracy. A clear audio signal can achieve 99%+ accuracy, while poor audio may struggle to reach 80% regardless of how good the AI is.

Microphone Types & Best Uses

Lavalier/Lapel Mics

Clip-on microphones worn by individual speakers. Best for consistent audio from mobile presenters.

Best For

Podium speakers, presenters, council members

Tips

Position 6-8 inches below chin
Avoid clothing rustle
Use windscreens outdoors

Gooseneck Mics

Flexible mounted microphones at fixed positions. Standard for council chambers and dais setups.

Best For

Dais seating, witness tables, fixed positions

Tips

Position 12-18 inches from speaker
Angle toward mouth
Avoid paper shuffling near mic

Boundary/PZM Mics

Flat microphones placed on tables that capture from all directions. Good for group discussions.

Best For

Conference tables, round-table discussions

Tips

Place in center of table
One mic per 4-6 people
Keep table surface clear

Handheld/Wireless

Portable microphones passed between speakers. Common for public comment periods.

Best For

Audience participation, Q&A sessions

Tips

Hold 2-4 inches from mouth
Train speakers on technique
Check batteries before meetings

Optimal Audio Levels

Monitor your audio meters during recording. Aim for peaks between -12dB and -6dB.

Too QuietBelow -20dB

Increased background noise ratio, missed words

Optimal-12dB to -6dB

Clear speech with headroom for peaks

Too LoudAbove -3dB

Clipping and distortion, reduced accuracy

Signal Chain & SDI Integration

SDI Embedded Audio

MediaScribe receives audio embedded in your SDI video signal. Ensure your audio is properly embedded on channels 1-2 (or your configured channels).

Verify audio presence before meetings
Use quality SDI cables (no kinks or damage)
Check embedder/de-embedder settings

Audio Processing

Avoid over-processing audio before it reaches MediaScribe. Heavy compression or noise gates can actually reduce accuracy.

Avoid aggressive noise gates
Avoid heavy compression (>4:1 ratio)
Light EQ for clarity is acceptable

Environment Setup

Optimize Your Meeting Space

Your room environment significantly impacts audio quality. Background noise and room echo are the two biggest environmental factors affecting transcription accuracy.

Pre-Meeting Environment Checklist

Noise

HVAC set to quiet mode or temporarily reduced during meetings
Doors and windows closed to reduce outside noise
Electronic devices on silent (phones, pagers, radios)

Acoustics

Acoustic panels or soft furnishings to reduce echo
Carpet or rugs on hard floors if possible
Avoid large empty rooms with hard parallel surfaces

Setup

Audio levels checked before meeting starts
Backup microphone available
Test recording reviewed for quality

Background Noise Sources

• HVAC systems cycling on/off
• Traffic from open windows
• Audience rustling, coughing
• Electronic device notifications
• Paper shuffling near microphones

Room Echo & Reverberation

• Large rooms with hard surfaces reflect sound
• Glass walls and tile floors are problematic
• High ceilings increase reverberation time
• Acoustic panels can dramatically help
• Even curtains and carpet make a difference

Quick Win

Position microphones closer to speakers and farther from noise sources. Halving the mic-to-speaker distance can be more effective than any acoustic treatment.

Human Factors

Speaker Best Practices

How people speak matters as much as the equipment. These guidelines help meeting participants contribute to better transcription quality.

Speaking Pace

Do:

120-150 words per minute

Avoid:

Rapid speech over 180 WPM

Natural conversational pace allows AI to accurately capture every word

Enunciation

Do:

Clear pronunciation, complete words

Avoid:

Mumbling, trailing off, swallowed syllables

Speak as if addressing someone with mild hearing difficulty

Microphone Distance

Do:

Consistent 6-12 inches

Avoid:

Moving toward/away from mic

Variations in distance cause volume fluctuations that affect accuracy

Overlapping Speech

Do:

One speaker at a time

Avoid:

Multiple people talking simultaneously

Crosstalk is the #1 accuracy killer—facilitate clear speaker transitions

Meeting Facilitation Tips for Better Captions

For the Chair/Moderator

Introduce speakers by name before they speak
Repeat audience questions into the microphone
Enforce one-speaker-at-a-time rule
Pause briefly between agenda items

For All Speakers

State your name before making a motion
Spell unusual names or technical terms on first use
Wait for your microphone to be active
Avoid side conversations during proceedings

Accuracy Boost

Custom Vocabulary Training

Generic speech recognition struggles with local names, streets, and terminology. Custom vocabulary training teaches MediaScribe your specific language.

What to Include

People

Council Member RodriguezCity Manager ThompsonChief Williams

Places

Riverside BoulevardHeritage ParkMLK Community Center

Departments

DPW (Public Works)CDBG (Community Development)P&Z (Planning & Zoning)

Local Terms

Ordinance 2024-15Resolution R-847TIF District

Best Practices

Update Regularly
Add new council members, agenda terms, and project names quarterly
Include Variations
Add both "P and Z" and "P&Z" and "Planning and Zoning"
Add Phonetic Hints
For unusual names, include pronunciation guides
Review Transcripts
Note recurring errors and add those terms to vocabulary

Pro Tip

Export your meeting agenda as a starting point for vocabulary. Most agenda terms (resolution numbers, project names, addresses) are excellent candidates for custom vocabulary.

Audio Descriptions

Video Quality for Audio Descriptions

AI audio descriptions analyze your video to narrate visual content. Higher quality video enables better description of on-screen elements.

Resolution

Higher resolution allows better recognition of text, faces, and visual details.

Minimum:720p (1280×720)

Recommended:1080p (1920×1080)

Frame Rate

Smoother video helps AI track movement and transitions between scenes.

Minimum:24 fps

Recommended:30 fps

Lighting

Well-lit scenes are easier to analyze. Avoid harsh shadows or backlighting.

• Even, diffused lighting
• Speakers' faces visible
• Readable presentation slides

Presentation Best Practices

Do

Use high-contrast slides (dark text on light backgrounds)
Use large, readable fonts (24pt minimum)
Leave slides visible long enough to read
Verbally describe key visual elements

Avoid

Slides packed with dense text
Rapid slide transitions
Critical information only in visuals
Low-contrast color schemes

Multilingual

Optimizing Translation Quality

Translation quality depends on source transcription quality. Clear English input produces better translations in all 72+ supported languages.

Source Language Tips

Use Complete Sentences
Fragments and incomplete thoughts translate poorly
Avoid Idioms & Slang
"Table this motion" vs "postpone this motion"
Be Consistent
Use the same term throughout (not "proposal" then "initiative")
Spell Out Acronyms
First use: "Planning and Zoning, or P&Z"

Custom Translation Dictionaries

For specialized terminology, you can provide translation preferences for specific terms.

Term: "Resolution R-847"
Spanish: "Resolución R-847"
Vietnamese: "Nghị quyết R-847"

Proper nouns, legal terms, and official document names benefit most from custom translation entries.

Troubleshooting

Common Issues & Solutions

*99% Accuracy Conditions: Accuracy rates achieved with quality audio input (properly positioned microphones, -12dB to -6dB levels), custom vocabulary training for local terminology, and optimized environment (minimal background noise, reduced echo). Results vary based on audio quality, speaker clarity, accents, and technical terminology complexity. Real-world accuracy typically ranges from 95-99% depending on conditions.

Need Help Optimizing Your Setup?

Our team can assess your current environment and recommend specific improvements to maximize transcription accuracy.

Book a Call Platform Overview

No obligation

Personalized for your agency

See implementation timeline

Achieving 99% Transcription Accuracy

What Does 99% Actually Mean?

Real-World vs. Lab Conditions

Audio Quality Fundamentals

Microphone Types & Best Uses

Lavalier/Lapel Mics

Gooseneck Mics

Boundary/PZM Mics

Handheld/Wireless

Optimal Audio Levels

Signal Chain & SDI Integration

SDI Embedded Audio

Audio Processing

Optimize Your Meeting Space

Pre-Meeting Environment Checklist

Noise

Acoustics

Setup

Background Noise Sources

Room Echo & Reverberation

Quick Win

Speaker Best Practices

Speaking Pace

Enunciation

Microphone Distance

Overlapping Speech

Meeting Facilitation Tips for Better Captions

For the Chair/Moderator

For All Speakers

Custom Vocabulary Training

What to Include

People

Places

Departments

Local Terms

Best Practices

Pro Tip

Video Quality for Audio Descriptions

Resolution

Frame Rate

Lighting

Presentation Best Practices

Do

Avoid

Optimizing Translation Quality

Source Language Tips

Custom Translation Dictionaries

Common Issues & Solutions

Accuracy is lower than expected

Some speakers are captured better than others

Background noise is affecting quality

Captions are delayed

Technical terms are being transcribed incorrectly

Audio descriptions miss important visual content

Related Resources

Live Captioning Platform

Audio Descriptions

Languages & Translation

Need Help Optimizing Your Setup?