Methods and Systems for Correcting Transcribed Audio Files

US2021118428A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021118428-A1
Application numberUS-202017089179-A
CountryUS
Kind codeA1
Filing dateNov 4, 2020
Priority dateApr 17, 2006
Publication dateApr 22, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems for correcting transcribed text. One method includes receiving audio data from one or more audio data sources and transcribing the audio data based on a voice model to generate text data. The method also includes making the text data available to a plurality of users over at least one computer network and receiving corrected text data over the at least one computer network from the plurality of users. In addition, the method can include modifying the voice model based on the corrected text data.

First claim

Opening claim text (preview).

1 - 10 . (canceled) 11 . A memory device having instructions stored thereon that, in response to execution by a processor, cause the processor to perform operations comprising: generating a transcription of a first portion of audio data using a voice model, wherein the audio data comprises a single audio file with a recording of a plurality of speakers; receiving a correction to the transcription of the first portion of the audio data; generating an updated voice model based on the correction to the transcription of the first portion of the audio data; and generating a transcription of a second portion of audio data using the updated voice model. 12 . The memory device of claim 11 , wherein the voice model comprises a voice-independent model. 13 . The memory device of claim 11 , wherein at least one of the first portion of the audio data and the second portion of the audio data originates from a VoiP voicemail server. 14 . The memory device of claim 11 , wherein at least one of the first portion of the audio data and the second portion of the audio data originates from a client computer coupled to at least one computer network. 15 . The memory device of claim 11 , wherein the operations further comprise extracting at least one of the first portion of the audio data and the second portion of the audio data from an e-mail message. 16 . The memory device of claim 11 , wherein the operations further comprise requesting at least one of the first portion of the audio data and the second portion of the audio data from a remote source. 17 . The memory device of claim 11 , wherein the operations further comprise prioritizing the first portion of the audio data. 18 . The memory device of claim 11 , wherein the first portion of the audio data comprises a first remotely user-selected portion of the audio data, and wherein the second portion of the audio data includes a second remotely user-selected portion of the audio data. 19 . The memory device of claim 11 , wherein the operations further comprise determining when each of the plurality of speakers is speaking. 20 . The memory device of claim 11 , wherein the transcription of the first portion of audio data comprises a first text data set associated with one of the plurality of speakers and a second text data set associated with another one of the plurality of speakers. 21 . A method comprising: generating a transcription of a first portion of audio data using a voice model, wherein the audio data comprises a single audio file with a recording of a plurality of speakers; receiving a correction to the transcription of the first portion of the audio data; generating an updated voice model based on the correction to the transcription of the first portion of the audio data; and generating a transcription of a second portion of audio data using the updated voice model. 22 . The method of claim 21 , wherein the voice model comprises a voice-independent model. 23 . The method of claim 21 , wherein at least one of the first portion of the audio data and the second portion of the audio data originates from a VoiP voicemail server. 24 . The method of claim 21 , wherein at least one of the first portion of the audio data and the second portion of the audio data originates from a client computer coupled to at least one computer network. 25 . The method of claim 21 , further comprising extracting at least one of the first portion of the audio data and the second portion of the audio data from an e-mail message. 26 . The method of claim 21 , further comprising requesting at least one of the first portion of the audio data and the second portion of the audio data from a remote source. 27 . The method of claim 21 , wherein the first portion of the audio data comprises a first remotely user-selected portion of the audio data, and wherein the second portion of the audio data includes a second remotely user-selected portion of the audio data. 28 . The method of claim 21 , further comprising determining when each of the plurality of speakers is speaking. 29 . The method of claim 21 , wherein the transcription of the first portion of audio data comprises a first text data set associated with one of the plurality of speakers and a second text data set associated with another one of the plurality of speakers. 30 . A system comprising: means for generating a transcription of a first portion of audio data using a voice model, wherein the audio data comprises a single audio file with a recording of a plurality of speakers; means for receiving a correction to the transcription of the first portion of the audio data; means for generating an updated voice model based on the correction to the transcription of the first portion of the audio data; and means for generating a transcription of a second portion of audio data using the updated voice model.

Assignees

Inventors

Classifications

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • G10L15/00Primary

    Speech recognition (G10L17/00 takes precedence) · CPC title

  • G10L15/18Primary

    using natural language modelling · CPC title

  • Indexing; Data structures therefor; Storage structures · CPC title

  • Adaptation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021118428A1 cover?
Methods and systems for correcting transcribed text. One method includes receiving audio data from one or more audio data sources and transcribing the audio data based on a voice model to generate text data. The method also includes making the text data available to a plurality of users over at least one computer network and receiving corrected text data over the at least one computer network f…
Who is the assignee on this patent?
Iii Holdings 1 Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Apr 22 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).