Transcription generation technique selection

US11741964B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11741964-B2
Application numberUS-202016885039-A
CountryUS
Kind codeB2
Filing dateMay 27, 2020
Priority dateMay 27, 2020
Publication dateAug 29, 2023
Grant dateAug 29, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method to transcribe communications may include selecting a first transcription generation technique from among multiple transcription generation techniques for generating transcriptions of audio of one or more communication sessions that involve a user device and obtaining performances of the multiple transcription generation techniques with respect to generating the transcriptions of the audio. The method may also include monitoring comparisons between the performances of the multiple transcription generation techniques and obtaining input from the user with respect to the comparisons. The method may further include selecting a second transcription generation technique from among the multiple transcription generation techniques based on the input from the user.

First claim

Opening claim text (preview).

I claim: 1. A method to transcribe communications, the method comprising: obtaining a performance of a first transcription generation technique with respect to generating transcriptions of audio of a first communication session associated with a user; obtaining a performance of a second transcription generation technique with respect to generating transcriptions of the audio of the first communication session; determining a report based on the performance of the first transcription generation technique and the performance of the second transcription generation technique; directing the report to a first device associated with the user; in response to the report, obtaining an indication from the first device; and directing a transcription of a second communication session to a second device for presentation to the user, the transcription being generated by the second transcription generation technique in response to the indication from the first device, wherein the second communication session is independent and distinct from the first communication session and an entirety of the second communication session occurs after the first communication session. 2. The method of claim 1 , wherein the performance of the second transcription generation technique is based on one or more of the following: transcription accuracy, transcription latency, and number of transcription corrections. 3. The method of claim 2 , wherein the performance of the second transcription generation technique is based on a plurality of communications sessions that include the first communication session. 4. The method of claim 1 , wherein the report includes a recommendation for the second transcription generation technique and the indication includes a selection of the second transcription generation technique. 5. The method of claim 1 , wherein the first device and the second device are the same device. 6. The method of claim 1 , further comprising before determining the report, directing a second transcription of the first communication session that involves the second device to the second device, the second transcription generated by the first transcription generation technique. 7. The method of claim 6 , wherein the steps of directing the report and of obtaining the indication occur during the first communication session. 8. The method of claim 1 , wherein one of the first transcription generation technique and the second transcription generation technique includes a revoicing of audio before transcription generation. 9. At least one non-transitory computer-readable media configured to store one or more instructions that when executed by at least one processor cause or direct a system to perform the method of claim 1 . 10. A method to transcribe communications, the method comprising: selecting a first transcription generation technique from among a plurality of transcription generation techniques for generating transcriptions of audio of one or more communication sessions that involve a user device of a user; obtaining performances of the plurality of transcription generation techniques with respect to generating the transcriptions of the audio; monitoring comparisons between the performances of the plurality of transcription generation techniques; obtaining input from the user with respect to the comparisons; and selecting a second transcription generation technique from among the plurality of transcription generation techniques based on the input from the user, the selected second transcription generation technique being used for generating transcriptions of audio of a second communication session that involves the user device, the second communication session occurring after the one or more communication sessions. 11. The method of claim 10 , wherein the performances of the plurality of transcription generation techniques are based on one or more of the following: transcription accuracy and transcription latency. 12. The method of claim 10 , further comprising directing a report to the user based on the comparison, wherein the input is obtained in response to report. 13. The method of claim 10 , wherein the second transcription generation technique does not generate a transcription of the audio such that the performance of the second transcription generation technique is an estimated performance. 14. The method of claim 10 , wherein the selection of the first transcription generation technique is based on the performance of the first transcription generation technique. 15. The method of claim 10 , wherein the monitoring comparisons between the performances of the plurality of transcription generation techniques occur with respect to a first communication session that involves the user device. 16. The method of claim 15 , the second transcription generation technique is selected to generate transcriptions of audio of the first communication session during the first communication session. 17. At least one non-transitory computer-readable media configured to store one or more instructions that when executed by at least one processor cause or direct a system to perform the method of claim 10 . 18. A system comprising: one or more processors; and one or more non-transitory computer-readable mediums configured to store instructions that when executed by the processors cause or direct the system to perform operations, the operations comprising: select a first transcription generation technique from among a plurality of transcription generation techniques for generating transcriptions of audio of one or more communication sessions that involve a user device; obtain performances of the plurality of transcription generation techniques with respect to generating the transcriptions of the audio; monitor comparisons between the performances of the plurality of transcription generation techniques; obtain input from the user device with respect to the comparisons; and select a second transcription generation technique from among the plurality of transcription generation techniques based on the input from the user device, the selected second transcription generation technique being used for generating transcriptions of audio of a second communication session that involves the user device, the second communication session occurring after the one or more communication sessions.

Assignees

Inventors

Classifications

  • G10L15/26Primary

    Speech to text systems (G10L15/08 takes precedence) · CPC title

  • based on feedback of a supervisor · CPC title

  • Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

  • Transforming into visible information · CPC title

  • Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11741964B2 cover?
A method to transcribe communications may include selecting a first transcription generation technique from among multiple transcription generation techniques for generating transcriptions of audio of one or more communication sessions that involve a user device and obtaining performances of the multiple transcription generation techniques with respect to generating the transcriptions of the au…
Who is the assignee on this patent?
Sorenson Ip Holdings Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/26. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 29 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).