Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment
US-9495964-B2 · Nov 15, 2016 · US
US11482240B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11482240-B2 |
| Application number | US-201816650374-A |
| Country | US |
| Kind code | B2 |
| Filing date | Aug 29, 2018 |
| Priority date | Sep 25, 2017 |
| Publication date | Oct 25, 2022 |
| Grant date | Oct 25, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method to present communications is provided. The method may include obtaining, at a device, a request from a user to play back a stored message that includes audio. In response to obtaining the request, the method may include directing the audio of the message to a transcription system from the device. In these and other embodiments, the transcription system may be configured to generate text that is a transcription of the audio in real-time. The method may further include obtaining, at the device, the text from the transcription system and presenting, by the device, the text generated by the transcription system in real-time. In response to obtaining the text from the transcription system, the method may also include presenting, by the device, the audio such that the text as presented is substantially aligned with the audio.
Opening claim text (preview).
What is claimed is: 1. A device comprising: a display; a speaker; a processor communicatively coupled to the display and to the speaker; and at least one non-transitory computer-readable media communicatively coupled to the processor and configured to store one or more instructions that when executed by the processor cause the device to perform operations comprising: obtain, at the device, a request to play a communication that includes video and audio; in response to obtaining the request, direct the audio to a transcription system from the device, the transcription system configured to generate text that is a transcription of the audio in real-time; obtain, at the device, the text generated by the transcription system; determine a buffer length based on a time difference between directing the audio to the transcription system and obtaining the text from the transcription system; buffer the video and the audio based on the time difference; in response to obtaining the text from the transcription system: present, by the display, the text from the transcription system and the buffered video in real-time; and provide the buffered audio to the speaker for presentation by the speaker such that the text is substantially aligned with the buffered audio presented by the speaker. 2. The device of claim 1 , wherein the communication is stored in the at least one non-transitory computer-readable media of the device. 3. The device of claim 1 , wherein the communication is stored outside of the device and the operations further comprise obtain, at the device, the communication over a network, wherein the audio of the communication is directed to the transcription system after being obtained by the device. 4. The device of claim 1 , wherein the operations further comprise during an interval between obtaining the request and presenting the buffered video and the buffered audio, present, on the display, a message notifying of a delay in presenting the buffered video and the buffered audio and the text. 5. The device of claim 1 , wherein the operations further comprise, after determining the buffered length, adjust the buffered length based on a network connection between the transcription system and the device. 6. The device of claim 1 , wherein the communication is a first communication, the operations further comprise: obtain, at the device, a second request to play a second communication that includes second audio; in response to obtaining the second request, direct the second audio to the transcription system from the device, the transcription system configured to generate second text that is a transcription of the second audio in real-time; buffer the second audio based on the time difference; and provide the buffered second audio to the speaker for presentation by the speaker without regard to the second text obtained from the transcription system. 7. A method to present communications, the method comprising: obtaining a communication that includes audio; directing the audio of the communication to a transcription system, the transcription system configured to generate text that is a transcription of the audio in real-time; obtaining the text from the transcription system; determining a buffer length based on a time difference between directing the audio to the transcription system and obtaining the text from the transcription system; buffering the audio based on the time difference; and providing the text and the buffered audio for presentation such that during the presentation the text as presented is substantially aligned with the buffered audio during real-time presentation of the text and the buffered audio. 8. The method of claim 7 , wherein the communication is stored at a device that presents the text and the buffered video and the buffered audio. 9. The method of claim 7 , wherein the communication is stored outside of a device that presents the text and the buffered video and the buffered audio and the method further comprises directing, to the device, the communication over a network. 10. The method of claim 7 , wherein the communication further includes video and the method further comprises: buffering the video based on the time difference; and providing the buffered video for presentation. 11. The method of claim 7 , further comprising after determining the buffered length, adjusting the buffered length based on a network connection with the transcription system. 12. One or more non-transitory computer-readable media configured to store one or more instructions that when executed by one or more processors cause a system to perform operations, the operations comprising: obtain a communication that includes audio; direct the audio of the communication to a transcription system, the transcription system configured to generate text that is a transcription of the audio in real-time; obtain the text from the transcription system; determine a buffer length based on a time difference between directing the audio to the transcription system and obtaining the text from the transcription system; buffer the audio based on the time difference; and provide the text and the buffered audio for presentation such that during the presentation the text as presented is substantially aligned with the buffered audio during real-time presentation of the text and the buffered audio. 13. The non-transitory computer-readable media of claim 12 , wherein the communication is stored at a device that presents the text and the buffered video and the buffered audio. 14. The non-transitory computer-readable media of claim 12 , wherein the communication is stored outside of a device that presents the text and the buffered video and the buffered audio and the operations further comprise direct, to the device, the communication over a network. 15. The non-transitory computer-readable media of claim 12 , wherein the communication further includes video and the operations further comprise: buffer the video based on the time difference; and provide the buffered video for presentation. 16. The non-transitory computer-readable media of claim 12 , further comprising after determining the buffered length, adjust the buffered length based on a network connection with the transcription system.
where the subscribers are hearing-impaired persons, e.g. telephone devices for the deaf · CPC title
Delay circuits; Timers · CPC title
Message receiving aspects · CPC title
Message preview · CPC title
Digital output to display device {; Cooperation and interconnection of the display device with other functional units} · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.