Display apparatus and method for question and answer
US-2016133146-A1 · May 12, 2016 · US
US9990923B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9990923-B2 |
| Application number | US-201615294476-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 14, 2016 |
| Priority date | Sep 27, 2016 |
| Publication date | Jun 5, 2018 |
| Grant date | Jun 5, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods and apparatuses are described for automated execution of computer software using intelligent speech recognition techniques. A server captures a digitized voice segment from a remote device, the first digitized voice segment corresponding to speech submitted by a user of the remote device during a voice call. The server analyzes the digitized voice segment to convert the digitized voice segment into text and extract a set of keywords from the converted text. A client computing device displays the extracted keywords in a user interface of a display device. The server determines one or more computer software applications accessible to the client computing device and selects least one of the computer software applications that include functionality responsive to the keywords. The client computing device executes the functionality of the selected computer software applications that is responsive to the keywords.
Opening claim text (preview).
What is claimed is: 1. A method for automated execution of computer software using intelligent speech recognition techniques, the method comprising: capturing, by a server computing device, a bitstream containing a digitized voice segment from a remote device as a speech file, the first digitized voice segment corresponding to speech submitted by a user of the remote device during a voice call; parsing, by the server computing device, the bitstream to locate the digitized voice segment; adjusting, by the server computing device, compression of the bitstream containing the digitized voice segment to enhance audio quality of the bitstream; analyzing, by the server computing device, the speech file to convert the speech file into text and extract a set of keywords from the converted text; displaying, by a client computing device coupled to the server computing device, the extracted keywords in a user interface of a display device; determining, by the server computing device, one or more computer software applications accessible to the client computing device; selecting, by the server computing device, at least one of the computer software applications that include functionality responsive to the keywords, comprising: generating an input vector comprising a sequence of numeric values, each value associated with a keyword and weighted according to a relative position of the keyword in the set of keywords, matching the input vector against a predefined set of vectors to determine one or more vectors that are similar to the input vector, identifying a label corresponding to each matched vector, wherein the label is associated with computer software functionality, and selecting one or more computer software applications that are associated with a most common label of the identified labels; and executing, by the client computing device, the functionality of the selected computer software applications that are responsive to the keywords. 2. The method of claim 1 , wherein matching the input vector comprises determining, by the server computing device, a distance between the input vector and each vector in the predefined set of vectors; and choosing, by the server computing device, one or more of vectors in the predefined set of vectors where the distance is within a predetermined threshold. 3. The method of claim 1 , wherein the label is an identifier that corresponds to a computer software application. 4. The method of claim 1 , further comprising establishing, by the server computing device, a voice connection between the remote device and the client computing device before capturing the digitized voice segment. 5. The method of claim 1 , further comprising establishing, by the server computing device, a voice connection between the remote device and an interactive voice response system before capturing the digitized voice segment. 6. The method of claim 1 , further comprising displaying, by the client computing device, one or more user interface elements in the user interface that correspond to the executed functionality of the selected software applications. 7. The method of claim 1 , wherein extracting a set of keywords from the converted text comprises filtering, by the server computing device, the converted text to remove stopwords. 8. The method of claim 1 , wherein converting the digitized voice segment into text comprises executing, by the server computing device, a speech recognition engine on a digital file containing the digitized voice segment to generate the text. 9. The method of claim 8 , further comprising analyzing, by the server computing device, the text using a grammar recognition engine to validate the generated text. 10. A system for automated execution of computer software using intelligent speech recognition techniques, the system comprising: a server computing device configured to capture a bitstream containing a digitized voice segment from a remote device as a speech file, the digitized voice segment corresponding to speech submitted by a user of the remote device during a voice call; parse the bitstream to locate the digitized voice segment; adjust compression of the bitstream containing the digitized voice segment to enhance audio quality of the bitstream; analyze the speech file to convert the speech file into text and extract a set of keywords from the converted text; determine one or more computer software applications accessible to the client computing device; and select at least one of the computer software applications that include functionality responsive to the keywords, comprising: generating, using a sequenced bag-of-words processing model, an input vector comprising a sequence of numeric values, each value associated with a keyword and weighted according to a relative position of the keyword in the set of keywords, matching, using a K-Nearest Neighbor processing model, the input vector against a predefined set of vectors to determine one or more vectors that are similar to the input vector, identifying a label corresponding to each matched vector, wherein the label is associated with computer software functionality, and selecting one or more computer software applications that are associated with a most common label of the identified labels; and a client computing device coupled to the server computing device, the client computing device configured to display the extracted keywords in a user interface of a display device; and execute the functionality of the selected computer software applications that is responsive to the keywords. 11. The system of claim 10 , wherein when matching the input vector, the server computing device is configured to determine a distance between the input vector and each vector in the predefined set of vectors; and choose one or more of vectors in the predefined set of vectors where the distance is within a predetermined threshold. 12. The system of claim 10 , wherein the label is an identifier that corresponds to a computer software application. 13. The system of claim 10 , wherein the server computing device is configured to establish a voice connection between the remote device and the client computing device before capturing the digitized voice segment. 14. The system of claim 10 , wherein the server computing device is configured to establish a voice connection between the remote device and an interactive voice response system before capturing the digitized voice segment. 15. The system of claim 10 , wherein the server computing device is configured to display one or more user interface elements in the user interface that correspond to the executed functionality of the selected software applications. 16. The system of claim 10 , wherein extracting a set of keywords from the converted text comprises filtering the converted text to remove stopwords. 17. The system of claim 10 , wherein converting the digitized voice segment into text comprises executing a speech recognition engine on a digital file containing the digitized voice segment to generate the text. 18. The system of claim 17 , wherein the server computing device is configured to analyze the text using a grammar recognition engine to validate the generated text. 19. A computer program product, tangibly embodied in a non-transitory computer readable storage device, for automated execution of computer software using intelligent speech recognition techniques, the computer program product including instructions operable to cause a server computing device to capture a bitstream con
Language aspects · CPC title
Word spotting · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Execution procedure of a spoken command · CPC title
Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.