Intelligent text-to-speech conversion
US-8996376-B2 · Mar 31, 2015 · US
US9620124B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9620124-B2 |
| Application number | US-201414193590-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 28, 2014 |
| Priority date | Feb 28, 2014 |
| Publication date | Apr 11, 2017 |
| Grant date | Apr 11, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In some embodiments, a system may process a user interface to identify textual or graphical items in the interface, and may prepare a plurality of audio files containing spoken representations of the items. As the user navigates through the interface, different ones of the audio files may be selected and played, to announce text associated with items selected by the user. A computing device may periodically determine whether a cache offering the interface to users stores audio files for all of the interface's textual items, and if the cache is missing any audio files for any of the textual items, the computing device may take steps to have a corresponding audio file created.
Opening claim text (preview).
We claim: 1. A method comprising: identifying a textual item in a user interface; associating a plurality of different audio files with the textual item, wherein the plurality of different audio files comprise a corresponding plurality of different audio announcements of the textual item; receiving, by a computing device, a first selection of the textual item in the user interface; determining a first one of the plurality of different audio files to use for audibly announcing the textual item, wherein the determining is based on a rule governing audio announcement in response to repeated selection of the textual item; causing playback of the first one of the plurality of different audio files based on the determining and responsive to the first selection of the textual item; receiving, by the computing device, a second selection of the textual item in the user interface; determining whether the second selection is a repeated selection of the textual item; and in response to determining that the second selection is a repeated selection of the textual item, causing playback of a second one of the plurality of different audio files responsive to the repeated selection of the textual item, wherein the second one of the plurality of different audio files comprises a different amount of explanatory speech than the first one of the plurality of different audio files. 2. The method of claim 1 , further comprising playing the first one of the plurality of different audio files. 3. The method of claim 1 , further comprising: identifying a plurality of screens of the user interface; associating a first screen audio file with a first screen of the plurality of screens; and in response to a selection of the first screen for display, retrieving the first screen audio file for playback. 4. The method of claim 1 , further comprising: periodically retrieving a current version of the user interface and determining whether the current version of the user interface comprises any textual items for which there is not already a corresponding audio file. 5. The method of claim 4 , wherein in response to determining that the current version of the user interface comprises one or more textual items for which there is not already a corresponding audio file, the method comprising transmitting an audio file request for the one or more textual items for which there is not already a corresponding audio file. 6. The method of claim 4 , wherein the determining whether the current version of the user interface comprises any textual items for which there is not already a corresponding audio file comprises: transmitting a request for an audio file header corresponding to a textual item under consideration; receiving the audio file header; and determining whether a size indicated in the audio file header is greater than a predetermined minimum size value. 7. The method of claim 4 , wherein the determining whether the current version of the user interface comprises any textual items for which there is not already a corresponding audio file comprises: retrieving a voice announcement identifier for a textual item under consideration; and consulting a database comprising a listing of voice announcement identifiers and corresponding entries indicating whether listed voice announcement identifiers are associated with audio files. 8. The method of claim 1 , wherein the textual item comprises a description of a video program, and wherein the plurality of different audio files comprise: the first one of the plurality of different audio files comprises a full audio reading of the description of the video program; and the second one of the plurality of different audio files comprises an abbreviated reading of the description of the video program, wherein in the abbreviated reading, certain words are shortened. 9. The method of claim 1 , further comprising: presenting the user interface on a first device at a premises, together with audio and video from an audiovisual program; and receiving a request to play the one of the plurality of different audio files on a second device at the premises, and not on the first device. 10. The method of claim 1 , further comprising: presenting the user interface on a first device at a premises, together with audio and video from an audiovisual program; and delivering the first and second ones of the plurality of audio files to corresponding first and second different user devices for playback in response to respective selections of the textual item by the first and second different user devices. 11. The method of claim 1 , wherein the plurality of different audio files correspond to different user expertise levels in navigating a menu corresponding to the user interface. 12. The method of claim 11 , wherein the first one of the plurality of different audio files comprises a greater amount of explanatory speech than the second one of the plurality of different audio files. 13. The method of claim 1 , further comprising storing the plurality of different audio files in a cache in advance of receiving the first selection of the textual item. 14. The method of claim 1 , further comprising: while a user is accessing the user interface, generating a new audio file to correspond with a new textual item in the user interface. 15. The method of claim 14 , further comprising: as the user accesses different screens of the user interface, determining whether the plurality of different audio files are cached in advance of user selection of the textual item in the different screens. 16. The method of claim 1 , wherein the textual item comprises a video program title. 17. A computer-readable medium storing instructions that, when executed, cause the following to occur: identifying a textual item in a user interface; associating a plurality of different audio files with the textual item, wherein the plurality of different audio files comprise a corresponding plurality of different audio announcements of the textual item; receiving, by a computing device, a first selection of the textual item in the user interface; determining a first one of the plurality of different audio files to use for audibly announcing the textual item, wherein the determining is based on a rule governing audio announcement in response to repeated selection of the textual item; causing playback of the first one of the plurality of different audio files based on the determining and responsive to the first selection of the textual item; receiving, by the computing device, a second selection of the textual item in the user interface; determining whether the second selection is a repeated selection of the textual item; and in response to determining that the second selection is a repeated selection of the textual item, causing playback of a second one of the plurality of different audio files responsive to the repeated selection of the textual item, wherein the second one of the plurality of different audio files comprises a different amount of explanatory speech than the first one of the plurality of different audio files. 18. The computer-readable medium of claim 17 , further storing instructions that, when executed, cause playing the first one of the plurality of different audio files. 19. The computer-readable medium of claim 17 , further storing instructions that, when executed, cause the following to occur: identifying a plurality of screens of the user interface; associating a first screen audio file with a first screen of the plurality of
Arrangements for controlling cursor position based on codes indicative of cursor displacements from one discrete location to another, e.g. using cursor control keys associated to different directions or using the tab key (arrangements for controlling cursor position based on coordinate signals G06F3/038) · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content {(end-user interfaces for retrieving video data from a database G06F16/739; network services for supporting unicast streaming H04L65/612)} · CPC title
based on user interaction within the home (receiver circuitry for displaying additional information being controlled by a remote control apparatus H04N21/42204) · CPC title
Selection of displayed objects or displayed text elements (G06F3/0482 takes precedence) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.