Automatic Speech Recognition in Court Reporting—It's Toast!
It is safe to say that automated speech recognition systems will become the standard method for transcript production in many industries, including court reporting. Is it ready today? Not yet.
October 09, 2019 at 07:00 AM
7 minute read
|
This article is Part 3 of a three-part series. Part 1 and Part 2 published on LTN in August and September.
CTC 2019 was held last month in New Orleans. This biennial court technology conference is the largest conference of its kind and always a great opportunity to see where technology vendors are focused with their court offerings. This year was all about artificial intelligence. Vendors of every sort were touting their latest AI-enabled applications—some of them brilliant and some of them boring. All of the digital recording vendors were demonstrating some form of speech recognition. None of them claimed to be able to produce an acceptable transcript, much less a certified transcript, but applying speech recognition to closed captioning and assisted listening looked like some potentially viable solutions. Full disclosure: My company, TheRecordXchange, also offers a speech recognition solution called VoiceCopy. We do not claim that the technology can produce an adequate transcript yet either.
|How Good Is the Technology?
I first began working with speech recognition technology in the late 1990s as CEO of FTR (For The Record). Even 20 years ago there were serious companies with plenty of cash trying to crack this nut. The technology has improved dramatically, and it continues to advance at a rapid pace.
There are two significant factors that have changed the landscape for speech recognition. First, as expected, the technologies related to artificial intelligence, machine learning and neural networks have matured. Equally important, big tech, most notably Google, Amazon and Apple have created services that collect unfathomable amounts of voice data. Alexa, Google Home, Siri and other applications amass valuable data by the second. For machine learning, data is gold and big tech has cornered the market.
Big tech is great at solving big problems. But it rarely tries to meet the needs of niche markets. Addressing the specific requirements of court reporting and transcription is exactly what some of the companies at CTC and a handful of innovative startups are trying to do. Google and Amazon rely on these ventures to service niche markets based on the technology they have developed. Smaller companies with domain expertise understand that transcripts must be punctuated accurately, present accurate speaker identifications and be formatted to meet the specifications for different jurisdictions.
Most companies acknowledge that an acceptable legal transcript cannot be produced from current speech technology alone. So what is their answer? Some are promoting their solutions not for transcription but for closed captioning or assisted hearing. Some have given up on the court reporting market and focus resources on markets with less stringent accuracy and formatting requirements. But some are offering a transcription solution that combines AI with human input to produce an acceptable transcript.
|AI with a Human Touch
The AI/human strategy uses automatic speech recognition to complete the first pass of transcription. Transcription is the most labor-intensive part of the process, so if that can be automated, it's a big win. Then, a qualified proofreader, using appropriately designed tools, reviews and corrects the transcript. The review process will take longer than if the proofreader were reviewing a transcript produced by a qualified transcriber, but any additional time and money spent on the proofing process is more than made up for by the savings achieved from the automated transcription.
Today, the transcription providers may be benefiting from this cost savings, but savings may not be passed on to transcript purchasers. But if transcript users are getting an accurate transcript, they probably don't care.
The big beneficiary of this model is the technology provider. Remember my comment above about data being gold to AI developers? This is equally true for these startups chasing opportunities in the court reporting market. These companies will never be able to collect as much data as Amazon can, but they don't need to.
Machine learning, a subset of AI, can be divided into two types: supervised learning or unsupervised learning. When you ask Alexa a question or give it a command, if you accept the response, then Alexa "infers" that its recognition was accurate. If, however, you repeat the request after a response, then the system may infer that its recognition was incorrect. This is an example of unsupervised learning; there is no established truth to be fed back into the system, only inference. Unsupervised learning can take a long time and requires a lot of data.
Supervised learning is based on the idea that there is a known truth. With a transcript, there is something close to a known truth. Accurate final transcripts can be fed back into the system for learning purposes. The system can compare the automated results with the "truth" of the final transcript and make adjustments for future processing. Supervised learning can achieve results much faster and requires far less data to get meaningful improvement. So an AI/human process that results in the technology provider having access to final transcripts can also result in a significant competitive advantage. Eventually, improvements will certainly benefit transcript users, but in the meantime…
|So with AI/Human Processes, Can I Get Good Transcripts?
Probably not. And, here's why.
When you receive an accurate, certified transcript today, that transcript was likely produced by a qualified transcriber and reviewed by a qualified proofreader. Think of the proofreader as the quality assurance step in the process. Good transcription firms have well-developed processes using qualified and efficient teams of transcribers and proofreaders producing quality results. Quality does not happen just because the individuals are good; it happens when qualified individuals follow a good process.
Harold F. Dodge, one of the original architects of the science of statistical quality control stated that "You cannot inspect quality into a product." And, to paraphrase W. Edwards Deming, the father of modern quality control science, proofreading does not improve the quality of the transcript. The quality, good or bad, is already in the transcript.
As a practical matter, what this means is that a qualified proofreader can consistently review and complete accurate transcripts when receiving quality work from transcribers. The lower the quality of the original content is, the lower the quality of the finished product will be. Automated transcripts are of far lower quality than those produced by qualified transcribers. Proofreaders cannot consistently turn them into high-quality transcripts. As of today, you will be disappointed in the results.
To quote W. Edwards Deming, this AI/human combo is a "system of make-and-inspect, which if applied to making toast would be expressed as: 'You burn, I'll scrape.'"
|If Not Today, When?
Predicting that something is going to happen is easy. Predicting when is not easy—timing is everything. It is safe to say that automated speech recognition systems will become the standard method for transcript production in many industries, including court reporting. Is it ready today? No.
Will it be ready in a year? No.
Will it be ready in five years? Maybe.
Ten years? Probably.
If you are a classic early adopter and want to live on the bleeding edge, go for it. If you want to go into court with an accurate transcript from a witness deposition, hire a qualified court reporting firm and make sure your transcript is produced by a qualified transcriber and proofreader.
Steve Townsend is CEO of TheRecordXchange, a web‐based platform for court reporting professionals. He has extensive experience in courtroom and hearing room reporting and transcription. He was CEO of FTR from 1997 to 2007 and CEO of AVTranz from 2008 to 2015. Townsend is a co‐founder of the American Association of Electronic Reporters and Transcribers.
This content has been archived. It is available through our partners, LexisNexis® and Bloomberg Law.
To view this content, please continue to their sites.
Not a Lexis Subscriber?
Subscribe Now
Not a Bloomberg Law Subscriber?
Subscribe Now
NOT FOR REPRINT
© 2024 ALM Global, LLC, All Rights Reserved. Request academic re-use from www.copyright.com. All other uses, submit a request to [email protected]. For more information visit Asset & Logo Licensing.
You Might Like
View AllTrending Stories
- 1Trump Taps Former Fla. Attorney General for AG
- 2Newsom Names Two Judges to Appellate Courts in San Francisco, Orange County
- 3Biden Has Few Ways to Protect His Environmental Legacy, Say Lawyers, Advocates
- 4UN Treaty Enacting Cybercrime Standards Likely to Face Headwinds in US, Other Countries
- 5Clark Hill Acquires L&E Boutique in Mexico City, Adding 5 Lawyers
Who Got The Work
Michael G. Bongiorno, Andrew Scott Dulberg and Elizabeth E. Driscoll from Wilmer Cutler Pickering Hale and Dorr have stepped in to represent Symbotic Inc., an A.I.-enabled technology platform that focuses on increasing supply chain efficiency, and other defendants in a pending shareholder derivative lawsuit. The case, filed Oct. 2 in Massachusetts District Court by the Brown Law Firm on behalf of Stephen Austen, accuses certain officers and directors of misleading investors in regard to Symbotic's potential for margin growth by failing to disclose that the company was not equipped to timely deploy its systems or manage expenses through project delays. The case, assigned to U.S. District Judge Nathaniel M. Gorton, is 1:24-cv-12522, Austen v. Cohen et al.
Who Got The Work
Edmund Polubinski and Marie Killmond of Davis Polk & Wardwell have entered appearances for data platform software development company MongoDB and other defendants in a pending shareholder derivative lawsuit. The action, filed Oct. 7 in New York Southern District Court by the Brown Law Firm, accuses the company's directors and/or officers of falsely expressing confidence in the company’s restructuring of its sales incentive plan and downplaying the severity of decreases in its upfront commitments. The case is 1:24-cv-07594, Roy v. Ittycheria et al.
Who Got The Work
Amy O. Bruchs and Kurt F. Ellison of Michael Best & Friedrich have entered appearances for Epic Systems Corp. in a pending employment discrimination lawsuit. The suit was filed Sept. 7 in Wisconsin Western District Court by Levine Eisberner LLC and Siri & Glimstad on behalf of a project manager who claims that he was wrongfully terminated after applying for a religious exemption to the defendant's COVID-19 vaccine mandate. The case, assigned to U.S. Magistrate Judge Anita Marie Boor, is 3:24-cv-00630, Secker, Nathan v. Epic Systems Corporation.
Who Got The Work
David X. Sullivan, Thomas J. Finn and Gregory A. Hall from McCarter & English have entered appearances for Sunrun Installation Services in a pending civil rights lawsuit. The complaint was filed Sept. 4 in Connecticut District Court by attorney Robert M. Berke on behalf of former employee George Edward Steins, who was arrested and charged with employing an unregistered home improvement salesperson. The complaint alleges that had Sunrun informed the Connecticut Department of Consumer Protection that the plaintiff's employment had ended in 2017 and that he no longer held Sunrun's home improvement contractor license, he would not have been hit with charges, which were dismissed in May 2024. The case, assigned to U.S. District Judge Jeffrey A. Meyer, is 3:24-cv-01423, Steins v. Sunrun, Inc. et al.
Who Got The Work
Greenberg Traurig shareholder Joshua L. Raskin has entered an appearance for boohoo.com UK Ltd. in a pending patent infringement lawsuit. The suit, filed Sept. 3 in Texas Eastern District Court by Rozier Hardt McDonough on behalf of Alto Dynamics, asserts five patents related to an online shopping platform. The case, assigned to U.S. District Judge Rodney Gilstrap, is 2:24-cv-00719, Alto Dynamics, LLC v. boohoo.com UK Limited.
Featured Firms
Law Offices of Gary Martin Hays & Associates, P.C.
(470) 294-1674
Law Offices of Mark E. Salomone
(857) 444-6468
Smith & Hassler
(713) 739-1250