Automatic synchronization for an offline virtual assistant
US-2024347055-A1 · Oct 17, 2024 · US
US2017301352A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2017301352-A1 |
| Application number | US-201715637526-A |
| Country | US |
| Kind code | A1 |
| Filing date | Jun 29, 2017 |
| Priority date | Feb 5, 2016 |
| Publication date | Oct 19, 2017 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods, including computer programs encoded on a computer storage medium, for improving speech recognition based on external data sources. In one aspect, a method includes obtaining an initial candidate transcription of an utterance using an automated speech recognizer and identifying, based on a language model that is not used by the automated speech recognizer in generating the initial candidate transcription, one or more terms that are phonetically similar to one or more terms that do occur in the initial candidate transcription. Additional actions include generating one or more additional candidate transcriptions based on the identified one or more terms and selecting a transcription from among the candidate transcriptions.
Opening claim text (preview).
What is claimed is: 1 . (canceled) 2 . A computer-implemented method comprising: providing an utterance to a speech recognizer that uses a language model that includes a specified vocabulary; and based on processing the utterance using the speech recognizer and a post-processor, generating a transcription of the utterance that includes a term that is not in the specified vocabulary of the speech recognizer. 3 . The method of claim 2 , wherein the language model indicates likelihoods that words or sequences of words in the specified vocabulary appear. 4 . The method of claim 2 , wherein the post-processor uses a second language model that includes the term that is not in the specified vocabulary of the speech recognizer. 5 . The method of claim 4 , wherein the second language model indicates likelihoods that words or sequences of words in another specified vocabulary that includes the term appear. 6 . The method of claim 2 , wherein based on processing the utterance using the speech recognizer and a post-processor, generating a transcription of the utterance that includes a term that is not in the specified vocabulary of the speech recognizer comprises: obtaining, from the speech recognizer, an initial transcription of the utterance that does not include the term; and generating the transcription that includes the term from the initial transcription. 7 . The method of claim 6 , wherein generating the transcription that includes the term from the initial transcription comprises: receiving, from the speech recognizer, an acoustic match score that reflects a phonetic similarity between the initial transcription and the utterance; and generating the transcription that includes the term from the initial transcription with the acoustic match score. 8 . The method of claim 2 , wherein providing an utterance to a speech recognizer that uses a language model that includes a specified vocabulary comprises: providing acoustic data that reflects the utterance to the speech recognizer. 9 . A system comprising: one or more computers; and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising: providing an utterance to a speech recognizer that uses a language model that includes a specified vocabulary; and based on processing the utterance using the speech recognizer and a post-processor, generating a transcription of the utterance that includes a term that is not in the specified vocabulary of the speech recognizer. 10 . The system of claim 9 , wherein the language model indicates likelihoods that words or sequences of words in the specified vocabulary appear. 11 . The system of claim 9 , wherein the post-processor uses a second language model that includes the term that is not in the specified vocabulary of the speech recognizer. 12 . The system of claim 11 , wherein the second language model indicates likelihoods that words or sequences of words in another specified vocabulary that includes the term appear. 13 . The system of claim 9 , wherein based on processing the utterance using the speech recognizer and a post-processor, generating a transcription of the utterance that includes a term that is not in the specified vocabulary of the speech recognizer comprises: obtaining, from the speech recognizer, an initial transcription of the utterance that does not include the term; and generating the transcription that includes the term from the initial transcription. 14 . The system of claim 13 , wherein generating the transcription that includes the term from the initial transcription comprises: receiving, from the speech recognizer, an acoustic match score that reflects a phonetic similarity between the initial transcription and the utterance; and generating the transcription that includes the term from the initial transcription with the acoustic match score. 15 . The system of claim 9 , wherein providing an utterance to a speech recognizer that uses a language model that includes a specified vocabulary comprises: providing acoustic data that reflects the utterance to the speech recognizer. 16 . A non-transitory computer-readable medium storing instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising: providing an utterance to a speech recognizer that uses a language model that includes a specified vocabulary; and based on processing the utterance using the speech recognizer and a post-processor, generating a transcription of the utterance that includes a term that is not in the specified vocabulary of the speech recognizer. 17 . The medium of claim 16 , wherein the language model indicates likelihoods that words or sequences of words in the specified vocabulary appear. 18 . The medium of claim 16 , wherein the post-processor uses a second language model that includes the term that is not in the specified vocabulary of the speech recognizer. 19 . The medium of claim 18 , wherein the second language model indicates likelihoods that words or sequences of words in another specified vocabulary that includes the term appear. 20 . The medium of claim 16 , wherein based on processing the utterance using the speech recognizer and a post-processor, generating a transcription of the utterance that includes a term that is not in the specified vocabulary of the speech recognizer comprises: obtaining, from the speech recognizer, an initial transcription of the utterance that does not include the term; and generating the transcription that includes the term from the initial transcription. 21 . The medium of claim 20 , wherein generating the transcription that includes the term from the initial transcription comprises: receiving, from the speech recognizer, an acoustic match score that reflects a phonetic similarity between the initial transcription and the utterance; and generating the transcription that includes the term from the initial transcription with the acoustic match score.
Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules · CPC title
for comparison or discrimination · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems · CPC title
Feature extraction for speech recognition; Selection of recognition unit · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.