What technology area does this patent fall under?

Primary CPC classification G10L15/26. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Apr 07 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Mobile device for speech input and text delivery

US10614808B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10614808-B2
Application number	US-201414416156-A
Country	US
Kind code	B2
Filing date	Jun 30, 2014
Priority date	Jun 30, 2014
Publication date	Apr 7, 2020
Grant date	Apr 7, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Aspects of the disclosure provide systems and methods for facilitating dictation. Speech input may be provided to an audio input device of a computing device. A speech recognition engine at the computing device may obtain text corresponding to the speech input. The computing device may transmit the text to a remotely-located storage device. A login webpage that includes a session identifier may be accessed from a target computing device also located remotely relative to the storage device. The session identifier may be transmitted to the storage device and, in response, a text display webpage may be received at the target computing device. The text display webpage may include the speech-derived text and may be configured to automatically copy the text to a copy buffer of the target computing device. The speech-derived text may also be provided to native applications at target computing devices or NLU engines for natural language processing.

First claim

Opening claim text (preview).

What is claimed is: 1. A system for facilitating dictation comprising: a mobile speech recognition computing device comprising a microphone and a memory, the microphone generating an electrical signal corresponding to speech received at the microphone, the memory storing instructions that, when executed by a processor of the mobile speech recognition computing device, cause the mobile speech recognition computing device to generate, locally at the mobile speech recognition computing device and from the electrical signal, text corresponding to the speech; a storage device located remotely relative to the mobile speech recognition computing device and in signal communication with the mobile speech recognition computing device, the storage device receiving and storing the text from the mobile speech recognition computing device; and a webserver in signal communication with the storage device and located remotely relative to the mobile speech recognition computing device, the webserver comprising instructions that, when executed by a processor of the webserver, cause the webserver to: provide a first webpage to a second computing device comprising a copy buffer and located remotely relative to the storage device, the first webpage comprising a session identifier, receive the session identifier from one of the mobile speech recognition computing device or the second computing device, associate the text with a user identifier and associate the session identifier with the user identifier, and in response to receipt of the session identifier at the webserver and responsive to determining that the user identifier has been associated with the session identifier, provide a second webpage to the second computing device, the second webpage comprising the text and the second webpage automatically copying the text to the copy buffer of the second computing device in response to display of the second webpage at the second computing device, wherein the speech is first speech, and wherein the webserver updates the second webpage to include additional text obtained from second speech received at the microphone of the mobile speech recognition computing device subsequent to the first speech. 2. The system of claim 1 further comprising: a text-processing engine in signal communication with the storage device, the text-processing engine processing the text and providing output based on the text. 3. The system of claim 2 wherein: the text-processing engine is a natural language processing engine. 4. The system of claim 3 wherein: the output provided by the natural language processing engine is a command that controls a device located remotely relative to the natural language processing engine; and the natural language processing engine provides the command to the device. 5. The system of claim 1 wherein: the webserver encodes the session identifier in a barcode and includes the barcode in the first webpage; and the instructions of the mobile speech recognition computing device, when executed by the processor of the mobile speech recognition computing device, further cause the mobile speech recognition computing device to decode the barcode to obtain the session identifier and transmit the session identifier and the user identifier to the storage device. 6. The system of claim 1 wherein: the text included in the second webpage is obtained before the webserver provides the first webpage to the second computing device. 7. The system of claim 1 wherein: the text included in the second webpage is obtained after the webserver provides the first webpage to the second computing device. 8. The system of claim 1 wherein: the webserver updates the second webpage in real-time such that the additional text corresponding to the second speech is viewable in the second webpage at the second computing device while the second speech is received at the microphone of the mobile speech recognition computing device. 9. The system of claim 8 wherein: the webserver updates the second webpage by pushing the additional text to the second computing device. 10. The system of claim 1 wherein: the storage device stores the text in a dictation record of a set of dictation records; and the second webpage comprises information corresponding to one or more dictation records of the set of dictation records stored by the storage device. 11. A computer-implemented method of performing dictation comprising: receiving speech at a microphone of a mobile speech recognition computing device; generating, by the microphone, an electrical signal corresponding to the speech; obtaining, by the mobile speech recognition computing device locally at the mobile speech recognition computing device, text corresponding to the speech; transmitting the text from the mobile speech recognition computing device to a storage device located remotely relative to the mobile speech recognition computing device; accessing a first webpage from a second computing device that comprises a copy buffer and that is located remotely relative to the storage device, the first webpage comprising a session identifier; transmitting the session identifier to the storage device from one of the mobile speech recognition computing device or the second computing device; associating the text with a user identifier and associating the session identifier with the user identifier; in response to receipt of the session identifier at the storage device and responsive to determining that the user identifier has been associated with the session identifier, receiving a second webpage comprising the text at the second computing device, the second webpage being provided by a webserver located remotely relative to the mobile speech recognition computing device and the second webpage automatically copying the text to the copy buffer of the second computing device in response to display of the second webpage at the second computing device, wherein the speech is first speech; and updating the second webpage based upon, at least in part, additional text obtained from second speech received at the microphone of the mobile speech recognition computing device subsequent to the first speech. 12. The method of claim 11 wherein: the speech is received at the microphone before the first webpage is accessed. 13. The method of claim 11 wherein: the speech is received at the microphone after the first webpage is accessed. 14. The method of claim 11 further comprising: scanning a barcode included in the first webpage with the mobile speech recognition computing device wherein, the barcode encoding the session identifier; decoding the barcode with the mobile speech recognition computing device to obtain the session identifier; and transmitting the session identifier to the storage device from the mobile speech recognition computing device. 15. The method of claim 11 further comprising: transmitting a timestamp from the mobile speech recognition computing device to the storage device with the text and the user identifier; and wherein the timestamp is included in the second webpage with the text. 16. The method of claim 11 wherein: the second computing device receives the update in real-time. 17. The method of claim 11 further comprising: editing the text at the mobile speech recognition computing device before transmitting the text to the storage device.

Assignees

Nuance Communications Inc

Inventors

Classifications

G10L15/26Primary
Speech to text systems (G10L15/08 takes precedence) · CPC title
G10L2015/223
Execution procedure of a spoken command · CPC title
G06F40/30
Semantic analysis · CPC title
G10L15/22
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
G06F16/685
using automatically derived transcript of audio data, e.g. lyrics (speech recognition G10L15/00) · CPC title

Patent family

Related publications grouped by family.

View patent family 51228489

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10614808B2 cover?: Aspects of the disclosure provide systems and methods for facilitating dictation. Speech input may be provided to an audio input device of a computing device. A speech recognition engine at the computing device may obtain text corresponding to the speech input. The computing device may transmit the text to a remotely-located storage device. A login webpage that includes a session identifier may…
Who is the assignee on this patent?: Nuance Communications Inc
What technology area does this patent fall under?: Primary CPC classification G10L15/26. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Apr 07 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).