Server side hotwording
US-2024412734-A1 · Dec 12, 2024 · US
US10614808B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10614808-B2 |
| Application number | US-201414416156-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 30, 2014 |
| Priority date | Jun 30, 2014 |
| Publication date | Apr 7, 2020 |
| Grant date | Apr 7, 2020 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Aspects of the disclosure provide systems and methods for facilitating dictation. Speech input may be provided to an audio input device of a computing device. A speech recognition engine at the computing device may obtain text corresponding to the speech input. The computing device may transmit the text to a remotely-located storage device. A login webpage that includes a session identifier may be accessed from a target computing device also located remotely relative to the storage device. The session identifier may be transmitted to the storage device and, in response, a text display webpage may be received at the target computing device. The text display webpage may include the speech-derived text and may be configured to automatically copy the text to a copy buffer of the target computing device. The speech-derived text may also be provided to native applications at target computing devices or NLU engines for natural language processing.
Opening claim text (preview).
What is claimed is: 1. A system for facilitating dictation comprising: a mobile speech recognition computing device comprising a microphone and a memory, the microphone generating an electrical signal corresponding to speech received at the microphone, the memory storing instructions that, when executed by a processor of the mobile speech recognition computing device, cause the mobile speech recognition computing device to generate, locally at the mobile speech recognition computing device and from the electrical signal, text corresponding to the speech; a storage device located remotely relative to the mobile speech recognition computing device and in signal communication with the mobile speech recognition computing device, the storage device receiving and storing the text from the mobile speech recognition computing device; and a webserver in signal communication with the storage device and located remotely relative to the mobile speech recognition computing device, the webserver comprising instructions that, when executed by a processor of the webserver, cause the webserver to: provide a first webpage to a second computing device comprising a copy buffer and located remotely relative to the storage device, the first webpage comprising a session identifier, receive the session identifier from one of the mobile speech recognition computing device or the second computing device, associate the text with a user identifier and associate the session identifier with the user identifier, and in response to receipt of the session identifier at the webserver and responsive to determining that the user identifier has been associated with the session identifier, provide a second webpage to the second computing device, the second webpage comprising the text and the second webpage automatically copying the text to the copy buffer of the second computing device in response to display of the second webpage at the second computing device, wherein the speech is first speech, and wherein the webserver updates the second webpage to include additional text obtained from second speech received at the microphone of the mobile speech recognition computing device subsequent to the first speech. 2. The system of claim 1 further comprising: a text-processing engine in signal communication with the storage device, the text-processing engine processing the text and providing output based on the text. 3. The system of claim 2 wherein: the text-processing engine is a natural language processing engine. 4. The system of claim 3 wherein: the output provided by the natural language processing engine is a command that controls a device located remotely relative to the natural language processing engine; and the natural language processing engine provides the command to the device. 5. The system of claim 1 wherein: the webserver encodes the session identifier in a barcode and includes the barcode in the first webpage; and the instructions of the mobile speech recognition computing device, when executed by the processor of the mobile speech recognition computing device, further cause the mobile speech recognition computing device to decode the barcode to obtain the session identifier and transmit the session identifier and the user identifier to the storage device. 6. The system of claim 1 wherein: the text included in the second webpage is obtained before the webserver provides the first webpage to the second computing device. 7. The system of claim 1 wherein: the text included in the second webpage is obtained after the webserver provides the first webpage to the second computing device. 8. The system of claim 1 wherein: the webserver updates the second webpage in real-time such that the additional text corresponding to the second speech is viewable in the second webpage at the second computing device while the second speech is received at the microphone of the mobile speech recognition computing device. 9. The system of claim 8 wherein: the webserver updates the second webpage by pushing the additional text to the second computing device. 10. The system of claim 1 wherein: the storage device stores the text in a dictation record of a set of dictation records; and the second webpage comprises information corresponding to one or more dictation records of the set of dictation records stored by the storage device. 11. A computer-implemented method of performing dictation comprising: receiving speech at a microphone of a mobile speech recognition computing device; generating, by the microphone, an electrical signal corresponding to the speech; obtaining, by the mobile speech recognition computing device locally at the mobile speech recognition computing device, text corresponding to the speech; transmitting the text from the mobile speech recognition computing device to a storage device located remotely relative to the mobile speech recognition computing device; accessing a first webpage from a second computing device that comprises a copy buffer and that is located remotely relative to the storage device, the first webpage comprising a session identifier; transmitting the session identifier to the storage device from one of the mobile speech recognition computing device or the second computing device; associating the text with a user identifier and associating the session identifier with the user identifier; in response to receipt of the session identifier at the storage device and responsive to determining that the user identifier has been associated with the session identifier, receiving a second webpage comprising the text at the second computing device, the second webpage being provided by a webserver located remotely relative to the mobile speech recognition computing device and the second webpage automatically copying the text to the copy buffer of the second computing device in response to display of the second webpage at the second computing device, wherein the speech is first speech; and updating the second webpage based upon, at least in part, additional text obtained from second speech received at the microphone of the mobile speech recognition computing device subsequent to the first speech. 12. The method of claim 11 wherein: the speech is received at the microphone before the first webpage is accessed. 13. The method of claim 11 wherein: the speech is received at the microphone after the first webpage is accessed. 14. The method of claim 11 further comprising: scanning a barcode included in the first webpage with the mobile speech recognition computing device wherein, the barcode encoding the session identifier; decoding the barcode with the mobile speech recognition computing device to obtain the session identifier; and transmitting the session identifier to the storage device from the mobile speech recognition computing device. 15. The method of claim 11 further comprising: transmitting a timestamp from the mobile speech recognition computing device to the storage device with the text and the user identifier; and wherein the timestamp is included in the second webpage with the text. 16. The method of claim 11 wherein: the second computing device receives the update in real-time. 17. The method of claim 11 further comprising: editing the text at the mobile speech recognition computing device before transmitting the text to the storage device.
Speech to text systems (G10L15/08 takes precedence) · CPC title
Execution procedure of a spoken command · CPC title
Semantic analysis · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
using automatically derived transcript of audio data, e.g. lyrics (speech recognition G10L15/00) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.