Automatic speech recognition techniques for multiple languages

US9953631B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9953631-B1
Application numberUS-201514706243-A
CountryUS
Kind codeB1
Filing dateMay 7, 2015
Priority dateMay 7, 2015
Publication dateApr 24, 2018
Grant dateApr 24, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Computer-implemented techniques can include obtaining languages for automatic speech recognition, the languages being associated with a plurality of users and receiving an input indicative of a request to provide speech input. In response to receiving the input, a first indication that automatic speech recognition (ASR) is being performed in each of the plurality of languages can be displayed. A speech input from one of the plurality of users can be received and ASR can identify one of the languages associated with the speech input to obtain a detected language. In response to obtaining the detected language, a second indication that ASR is being performed in the detected language can be displayed. The speech input can be transcribed and translated to another one of the plurality of languages to obtain a translated text that can be displayed.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: obtaining, at a computing device having one or more processors, a plurality of languages for automatic speech recognition, the plurality of languages being associated with a plurality of users; receiving, at the computing device, an input indicative of a request to provide speech input; receiving, by the computing device, a speech input from one of the plurality of users; in response to receiving the input: displaying, by the computing device, a first indication that the computing device is performing automatic speech recognition in two or more of the plurality of languages, the first indication comprising highlighted icons displaying the two or more of the plurality of languages, and performing, by the computing device, automatic speech recognition on the speech input in the two or more of the plurality of languages to identify one of the plurality of languages associated with the speech input to obtain a detected language; obtaining, by the computing device, a transcription of the speech input to obtain a text in the detected language; in response to obtaining the text, displaying, by the computing device, the text in a first area on a display of the computing device; obtaining, by the computing device, a translation of the text from the detected language to another one of the plurality of languages to obtain a translated text; and displaying, by the computing device, the translated text in a separate second area on the display, wherein the highlighted icons displaying the two or more of the plurality of languages are displayed in a separate third area of the display that is distinct from the first and second areas of the display. 2. The computer-implemented method of claim 1 , further comprising in response to obtaining the detected language, displaying, by the computing device, a second indication that the computing device is performing automatic speech recognition in the detected language. 3. The computer-implemented method of claim 2 , wherein the second indication is a highlighted icon displaying the detected language and non-highlighted icons displaying a remainder of the two or more of the plurality of languages, respectively. 4. The computer-implemented method of claim 3 , further comprising displaying, by the computing device, a third indication of whether automatic speech recognition is currently being performed, wherein the third indication is a highlighted icon from when the request is received until a completion of the speech input, and wherein the third indication is a non-highlighted icon upon the completion of the speech input. 5. The computer-implemented method of claim 1 , wherein the text is displayed in at least one of a first color and a first style, and wherein the translated text is displayed in at least one of a different second color and a different second style. 6. A computing device having one or more processors configured to perform operations comprising: obtaining a plurality of languages for automatic speech recognition, the plurality of languages being associated with a plurality of users; receiving an input indicative of a request to provide speech input; receiving a speech input from one of the plurality of users; in response to receiving the input: displaying a first indication that the computing device is performing automatic speech recognition in two or more of the plurality of languages, the first indication comprising highlighted icons displaying the two or more of the plurality of languages, performing automatic speech recognition on the speech input in the two or more of the plurality of languages to identify one of the plurality of languages associated with the speech input to obtain a detected language, and in response to obtaining the detected language, displaying a second indication that the computing device is performing automatic speech recognition in the detected language, wherein the second indication is a highlighted icon displaying the detected language and non-highlighted icons displaying a remainder of the two or more of the plurality of languages, respectively; obtaining a transcription of the speech input to obtain a text in the detected language; obtaining a translation of the text from the detected language to another one of the plurality of languages to obtain a translated text; and displaying the translated text. 7. The computing device of claim 6 , wherein the operations further comprise displaying a third indication of whether automatic speech recognition is currently being performed, wherein the third indication is a highlighted icon from when the request is received until a completion of the speech input, and wherein the third indication is a non-highlighted icon upon the completion of the speech input. 8. The computing device of claim 6 , wherein the operations further comprise displaying the text in the detected language in response to obtaining the text, wherein the text is displayed in a first area on a display of the computing device, and wherein the translated text is displayed in a separate second area on the display. 9. The computing device of claim 8 , wherein the text is displayed in at least one of a first color and a first style, and wherein the translated text is displayed in at least one of a different second color and a different second style. 10. A non-transitory computer-readable medium having instructions stored thereon that, when executed by one or more processors of a computing device, cause the computing device to perform operations comprising: obtaining a plurality of languages for automatic speech recognition, the plurality of languages being associated with a plurality of users; receiving an input indicative of a request to provide speech input; receiving a speech input from one of the plurality of users; in response to receiving the input: displaying a first indication that the computing device is performing automatic speech recognition in two or more of the plurality of languages, the first indication comprising highlighted icons displaying the two or more of the plurality of languages, performing automatic speech recognition on the speech input in the two or more of the plurality of languages to identify one of the plurality of languages associated with the speech input to obtain a detected language, and in response to obtaining the detected language, displaying a second indication that the computing device is performing automatic speech recognition in the detected language, wherein the second indication is a highlighted icon displaying the detected language and non-highlighted icons displaying a remainder of the two or more of the plurality of languages, respectively; obtaining a transcription of the speech input to obtain a text in the detected language; obtaining a translation of the text from the detected language to another one of the plurality of languages to obtain a translated text; and displaying the translated text. 11. The computer-readable medium of claim 10 , wherein the operations further comprise displaying a third indication of whether automatic speech recognition is currently being performed, wherein the third indication is a highlighted icon from when the request is received until a completion of the speech input, and wherein the third indication is a non-highlighted icon upon the completion of the speech input. 12. The computer-readable medium of claim 10 , wherein the operations further comprise displaying the text in the detected language in response to obtaining the text, wherein the text is displayed in a first area on a display of the computing device, wherein the t

Assignees

Inventors

Classifications

  • Language identification · CPC title

  • Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title

  • G10L15/005Primary

    Language recognition · CPC title

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9953631B1 cover?
Computer-implemented techniques can include obtaining languages for automatic speech recognition, the languages being associated with a plurality of users and receiving an input indicative of a request to provide speech input. In response to receiving the input, a first indication that automatic speech recognition (ASR) is being performed in each of the plurality of languages can be displayed. …
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/005. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 24 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).