Speech recognition in a vehicle

US11893978B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11893978-B2
Application numberUS-202117400624-A
CountryUS
Kind codeB2
Filing dateAug 12, 2021
Priority dateAug 12, 2021
Publication dateFeb 6, 2024
Grant dateFeb 6, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An audio sample including speech and ambient sounds is transmitted to a vehicle computer. Recorded audio is received from the vehicle computer, the recorded audio including the audio sample broadcast by the vehicle computer and recorded by the vehicle computer and recognized speech from the recorded audio. The recognized speech and text of the speech are input to a machine learning program that outputs whether the recognized speech matches the text. When the output from the machine learning program indicates that the recognized speech does not match the text, the recognized speech and the text are included in a training dataset for the machine learning program.

First claim

Opening claim text (preview).

The invention claimed is: 1. A system, comprising a computer including a processor and a memory, the memory storing instructions executable by the processor to: transmit, via a network, an audio sample including speech and ambient sounds to a vehicle computer; broadcast the audio sample in a vehicle of the vehicle computer; record, from the vehicle computer, audio from the audio sample; recognize, by the vehicle computer, speech in the recorded audio from the audio sample; receive, via the network from the vehicle computer, the recorded audio including the audio sample; receive, via the network from the vehicle computer, the recognized speech from the recorded audio; identify text of the recognized speech from the recorded audio; input the recognized speech and the identified text of the recognized speech to a machine learning program that outputs whether the recognized speech matches the identified text; and when the output from the machine learning program indicates that the recognized speech does not match the text include the recognized speech and the text in a training dataset for the machine learning program; wherein the machine learning program is trained to output, from the recorded audio, a detection of a vibration from a vehicle subsystem indicating potential maintenance for the vehicle subsystem; and identify a variation in the vehicle subsystem based on the output detected vibration from the recorded audio and a vibration predicted by a prognostics model of the vehicle subsystem. 2. The system of claim 1 , wherein the instructions further include instructions to generate the audio sample with a second machine learning program trained to output audio samples including speech and ambient sounds. 3. The system of claim 2 , wherein the audio sample includes speech of a voice command to actuate one or more vehicle subsystems. 4. The system of claim 1 , wherein the instructions further include instructions to receive, from the vehicle computer, a message indicating that the vehicle computer did not recognize speech from the recorded audio broadcast and to include the audio sample and the message in the training dataset. 5. The system of claim 1 , wherein the instructions further include instructions to receive a message from the vehicle computer indicating that a vehicle in which the vehicle computer is located is stationary, and then transmit the audio sample to the vehicle computer based on receiving the message indicating that the vehicle in which the vehicle computer is located is stationary. 6. The system of claim 1 , wherein the instructions further include instructions to transmit the detected variation to the vehicle computer. 7. The system of claim 1 , wherein the instructions further include instructions to input the detected vibration in the training dataset for the machine learning program. 8. The system of claim 1 , wherein the vehicle computer is further programmed to, upon receiving the audio sample, to actuate a speaker to broadcast the audio sample and to actuate a microphone to record the broadcast audio sample. 9. The system of claim 8 , wherein the vehicle computer is further programmed to input the recorded audio to a speech recognition program trained to output speech recognized from the recorded audio. 10. The system of claim 8 , wherein the vehicle computer is further programmed to actuate each of a plurality of speakers to broadcast the audio sample and to actuate the microphone to record the audio sample broadcast from each of the plurality of speakers. 11. The system of claim 1 , wherein the instructions further include instructions to transmit the audio sample to each of a plurality of vehicle computers, each vehicle computer located in a respective vehicle, and to receive recorded audio and recognized speech from each of the plurality of vehicle computers. 12. The system of claim 11 , wherein the instructions further include instructions to input respective recognized speech and text of the speech from each of the plurality of vehicle computers to the training dataset of the machine learning program. 13. The system of claim 1 , wherein the instructions further include instructions to retrain the machine learning program with the training dataset and to transmit the retrained machine learning program to the vehicle computer. 14. A method, comprising: transmitting, via a network, an audio sample including speech and ambient sounds to a vehicle computer; broadcasting the audio sample in a vehicle of the vehicle computer; recording, from the vehicle computer, audio from the audio sample; recognizing, by the vehicle computer, speech in the recorded audio from the audio sample; receiving, via the network from the vehicle computer, the recorded audio including the audio sample: receiving, via the network from the vehicle computer, the recognized speech from the recorded audio; identifying text of the recognized speech from the recorded audio; inputting the recognized speech and the identified text of the recognized speech to a machine learning program that outputs whether the recognized speech matches the identified text; and when the output from the machine learning program indicates that the recognized speech does not match the text, including the recognized speech and the text in a training dataset for the machine learning program; wherein the machine learning program is trained to output, from the recorded audio, a detection of a vibration from a vehicle subsystem indicating potential maintenance for the vehicle subsystem; and identifying a variation with the vehicle subsystem based on the output detected vibration from the recorded audio and a vibration predicted by a prognostics model of the vehicle subsystem. 15. The method of claim 14 , further comprising generating the audio sample with a second machine learning program trained to output audio samples including speech and ambient sounds. 16. The method of claim 14 , further comprising receiving, from the vehicle computer, a message indicating that the vehicle computer did not recognize speech from the recorded audio broadcast and including the audio sample and the message in the training dataset. 17. The method of claim 14 , further comprising receiving a message from the vehicle computer indicating that a vehicle in which the vehicle computer is located is stationary, and then transmitting the audio sample to the vehicle computer. 18. The method of claim 14 , wherein the vehicle computer is further programmed to, upon receiving the audio sample, to actuate a speaker to broadcast the audio sample and to actuate a microphone to record the broadcast audio sample.

Assignees

Inventors

Classifications

  • G10L15/063Primary

    Training · CPC title

  • Machine learning · CPC title

  • Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech (G10L21/02 takes precedence) · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11893978B2 cover?
An audio sample including speech and ambient sounds is transmitted to a vehicle computer. Recorded audio is received from the vehicle computer, the recorded audio including the audio sample broadcast by the vehicle computer and recorded by the vehicle computer and recognized speech from the recorded audio. The recognized speech and text of the speech are input to a machine learning program that…
Who is the assignee on this patent?
Ford Global Tech Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/063. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 06 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).