Automated software execution using intelligent speech recognition

US9990923B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9990923-B2
Application numberUS-201615294476-A
CountryUS
Kind codeB2
Filing dateOct 14, 2016
Priority dateSep 27, 2016
Publication dateJun 5, 2018
Grant dateJun 5, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and apparatuses are described for automated execution of computer software using intelligent speech recognition techniques. A server captures a digitized voice segment from a remote device, the first digitized voice segment corresponding to speech submitted by a user of the remote device during a voice call. The server analyzes the digitized voice segment to convert the digitized voice segment into text and extract a set of keywords from the converted text. A client computing device displays the extracted keywords in a user interface of a display device. The server determines one or more computer software applications accessible to the client computing device and selects least one of the computer software applications that include functionality responsive to the keywords. The client computing device executes the functionality of the selected computer software applications that is responsive to the keywords.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for automated execution of computer software using intelligent speech recognition techniques, the method comprising: capturing, by a server computing device, a bitstream containing a digitized voice segment from a remote device as a speech file, the first digitized voice segment corresponding to speech submitted by a user of the remote device during a voice call; parsing, by the server computing device, the bitstream to locate the digitized voice segment; adjusting, by the server computing device, compression of the bitstream containing the digitized voice segment to enhance audio quality of the bitstream; analyzing, by the server computing device, the speech file to convert the speech file into text and extract a set of keywords from the converted text; displaying, by a client computing device coupled to the server computing device, the extracted keywords in a user interface of a display device; determining, by the server computing device, one or more computer software applications accessible to the client computing device; selecting, by the server computing device, at least one of the computer software applications that include functionality responsive to the keywords, comprising: generating an input vector comprising a sequence of numeric values, each value associated with a keyword and weighted according to a relative position of the keyword in the set of keywords, matching the input vector against a predefined set of vectors to determine one or more vectors that are similar to the input vector, identifying a label corresponding to each matched vector, wherein the label is associated with computer software functionality, and selecting one or more computer software applications that are associated with a most common label of the identified labels; and executing, by the client computing device, the functionality of the selected computer software applications that are responsive to the keywords. 2. The method of claim 1 , wherein matching the input vector comprises determining, by the server computing device, a distance between the input vector and each vector in the predefined set of vectors; and choosing, by the server computing device, one or more of vectors in the predefined set of vectors where the distance is within a predetermined threshold. 3. The method of claim 1 , wherein the label is an identifier that corresponds to a computer software application. 4. The method of claim 1 , further comprising establishing, by the server computing device, a voice connection between the remote device and the client computing device before capturing the digitized voice segment. 5. The method of claim 1 , further comprising establishing, by the server computing device, a voice connection between the remote device and an interactive voice response system before capturing the digitized voice segment. 6. The method of claim 1 , further comprising displaying, by the client computing device, one or more user interface elements in the user interface that correspond to the executed functionality of the selected software applications. 7. The method of claim 1 , wherein extracting a set of keywords from the converted text comprises filtering, by the server computing device, the converted text to remove stopwords. 8. The method of claim 1 , wherein converting the digitized voice segment into text comprises executing, by the server computing device, a speech recognition engine on a digital file containing the digitized voice segment to generate the text. 9. The method of claim 8 , further comprising analyzing, by the server computing device, the text using a grammar recognition engine to validate the generated text. 10. A system for automated execution of computer software using intelligent speech recognition techniques, the system comprising: a server computing device configured to capture a bitstream containing a digitized voice segment from a remote device as a speech file, the digitized voice segment corresponding to speech submitted by a user of the remote device during a voice call; parse the bitstream to locate the digitized voice segment; adjust compression of the bitstream containing the digitized voice segment to enhance audio quality of the bitstream; analyze the speech file to convert the speech file into text and extract a set of keywords from the converted text; determine one or more computer software applications accessible to the client computing device; and select at least one of the computer software applications that include functionality responsive to the keywords, comprising: generating, using a sequenced bag-of-words processing model, an input vector comprising a sequence of numeric values, each value associated with a keyword and weighted according to a relative position of the keyword in the set of keywords, matching, using a K-Nearest Neighbor processing model, the input vector against a predefined set of vectors to determine one or more vectors that are similar to the input vector, identifying a label corresponding to each matched vector, wherein the label is associated with computer software functionality, and selecting one or more computer software applications that are associated with a most common label of the identified labels; and a client computing device coupled to the server computing device, the client computing device configured to display the extracted keywords in a user interface of a display device; and execute the functionality of the selected computer software applications that is responsive to the keywords. 11. The system of claim 10 , wherein when matching the input vector, the server computing device is configured to determine a distance between the input vector and each vector in the predefined set of vectors; and choose one or more of vectors in the predefined set of vectors where the distance is within a predetermined threshold. 12. The system of claim 10 , wherein the label is an identifier that corresponds to a computer software application. 13. The system of claim 10 , wherein the server computing device is configured to establish a voice connection between the remote device and the client computing device before capturing the digitized voice segment. 14. The system of claim 10 , wherein the server computing device is configured to establish a voice connection between the remote device and an interactive voice response system before capturing the digitized voice segment. 15. The system of claim 10 , wherein the server computing device is configured to display one or more user interface elements in the user interface that correspond to the executed functionality of the selected software applications. 16. The system of claim 10 , wherein extracting a set of keywords from the converted text comprises filtering the converted text to remove stopwords. 17. The system of claim 10 , wherein converting the digitized voice segment into text comprises executing a speech recognition engine on a digital file containing the digitized voice segment to generate the text. 18. The system of claim 17 , wherein the server computing device is configured to analyze the text using a grammar recognition engine to validate the generated text. 19. A computer program product, tangibly embodied in a non-transitory computer readable storage device, for automated execution of computer software using intelligent speech recognition techniques, the computer program product including instructions operable to cause a server computing device to capture a bitstream con

Assignees

Inventors

Classifications

  • Language aspects · CPC title

  • Word spotting · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Execution procedure of a spoken command · CPC title

  • Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9990923B2 cover?
Methods and apparatuses are described for automated execution of computer software using intelligent speech recognition techniques. A server captures a digitized voice segment from a remote device, the first digitized voice segment corresponding to speech submitted by a user of the remote device during a voice call. The server analyzes the digitized voice segment to convert the digitized voice …
Who is the assignee on this patent?
Fmr Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 05 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).