Context
Descript was evaluating automatic transcription providers to understand which ASR engine offered the strongest combination of speed, accuracy and affordability for its customers.
Case study / Descript
Descript needed a practical way to manage speech-recognition benchmark data, review Word Error Rate results and support a public comparison of automatic transcription services. MT Software delivered an Airtable-backed workflow and a custom WER reader application to help the team organize, inspect and publish the evaluation data.
Benchmark audio, reference transcripts and ASR-generated transcripts organized for repeatable comparison.
A custom workflow for reading Word Error Rate outputs and comparing transcription accuracy across providers.
Structured results prepared for review, filtering and publication in Descript’s public ASR accuracy article.
Overview
The project supported Descript’s comparison of automatic speech-recognition providers by organizing test data, reference transcripts, ASR outputs and WER results into a workflow the team could inspect and publish with confidence.
Descript was evaluating automatic transcription providers to understand which ASR engine offered the strongest combination of speed, accuracy and affordability for its customers.
The benchmark required many audio samples, professional reference transcripts, ASR-generated transcripts and WER outputs to be organized in a way that made the results clear, comparable and publishable.
MT Software built the Airtable backend structure and a custom WER reader application so Descript could review transcription accuracy data and support the final benchmark publication.
Outcome snapshot
The public article presented a transparent comparison of transcription services using Word Error Rate, grounded in representative audio samples and professionally verified reference transcripts.
The benchmark used about 50 clips, each around 3–5 minutes, covering broadcast, calls, meetings and other customer-like audio.
The published results compared providers including Google Speech, Temi, Amazon, Speechmatics, Trint, Microsoft and IBM Watson.
Google Speech Video achieved the strongest reported average Word Error Rate in Descript’s 2018 benchmark.
Solution design
The solution focused on making the evaluation data easier to store, inspect and compare, so Descript could move from raw ASR outputs to a credible public accuracy report.
Structured backend records for audio samples, reference transcripts, ASR providers, test outputs and benchmark metadata.
A custom application layer for reading Word Error Rate outputs and helping the team inspect accuracy results.
Data structures that supported side-by-side comparison across multiple automatic transcription services.
Organized benchmark data that could support public reporting, filtering and explanation of the ASR accuracy results.
Workflow model
The visual model keeps the case technical and credible by representing the evaluation workflow without relying on proprietary screenshots or unsupported interface claims.
Result analysis
WER outputs are framed around audio samples, reference transcripts and provider records so results can be reviewed consistently.
Evidence structure
The structure supports a credible case narrative while avoiding fake screenshots or unsupported production interface claims.
Delivery path
The work is framed around the real operational path: structure the benchmark data, support WER review, organize provider comparisons and prepare the evidence for publication.
Define the relationships between audio samples, reference transcripts, ASR provider outputs and WER results.
Structure the backend tables and fields needed to organize benchmark records and keep evaluation data reviewable.
Build the custom application workflow used to read WER outputs and support transcription accuracy analysis.
Prepare the organized benchmark data so the Descript team could review, explain and publish the ASR comparison.
Technical direction
The technical direction focused on Airtable-backed data organization, WER analysis and benchmark reporting support, using only confirmed project details instead of unsupported production-stack claims.
Project takeaway
“The value of the build was in turning benchmark data, reference transcripts and WER outputs into a workflow Descript could use to compare ASR providers with clarity.”
Let’s talk about driving your project to success!