Authentication of audio-based input signals

US11880442B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11880442-B2
Application numberUS-202117543371-A
CountryUS
Kind codeB2
Filing dateDec 6, 2021
Priority dateMar 15, 2013
Publication dateJan 23, 2024
Grant dateJan 23, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method implemented by one or more processors, the method comprising: receiving an audio-based input, detected by a microphone of a computing device, that includes a voice input of a user; processing, using facial recognition, video-based input that is detected at a camera of the computing, device; determining, based on processing of the audio-based input and the processing of the video-based input, that a probability, that the audio-based input was generated by a particular registered user of the computing device, satisfies a threshold; determining that a distance, between the computing device and the particular registered user, satisfies a distance threshold; and in response to determining that the probability satisfies the threshold and that the distance satisfies the distance threshold: selecting a content item based on an action identified by the voice input and based on a profile of the particular registered user; and causing the content item to be rendered in response to the audio-based input and to be rendered at the computing device or an additional computing device that is separate from, but associated with, the computing device. 2. The method of claim 1 , further comprising: determining the distance, between the computing device and the particular registered user, based on a separation distance between the computing device and the additional computing device. 3. The method of claim 2 , wherein the additional computing device is a smartphone of the user. 4. The method of claim 1 , wherein causing the content item to be rendered comprises causing the content item to be rendered at the additional computing device. 5. The method of claim 1 , further comprising: processing the voice input using a natural language processor (NLP) component to determine the action identified by the voice input. 6. The method of claim 5 , wherein processing the voice input using the NLP component to determine the action identified by the voice input comprises: converting the voice input into text; and processing the text to determine the action identified by the voice input. 7. The method of claim 1 , wherein the processing of the audio-based input, in determining that the probability satisfies the threshold, occurs locally at the computing device. 8. A computing device, comprising: a microphone; memory storing instructions; one or more processors, executing the instructions, to: receive an audio-based input, detected by the microphone, that includes a voice input of a user; process, using facial recognition, video-based input that is detected at a camera of the computing device; determine, based on processing of the audio-based input and the processing of the video-based input, that a probability, that the audio-based input was generated by a particular registered user of the computing device, satisfies a threshold; determine that a distance, between the computing device and the particular registered user, satisfies a distance threshold; and in response to determining that the probability satisfies the threshold and that the distance satisfies the distance threshold: select a content item based on an action identified by the voice input and based on a profile of the particular registered user; and cause the content item to be rendered in response to the audio-based input and to be rendered at the computing device or an additional computing device that is separate from, but associated with, the computing device. 9. The computing device of claim 8 , wherein in executing the instructions one or more of the processors are further to: determine the distance, between the computing device and the particular registered user, based on a separation distance between the computing device and the additional computing device. 10. The computing device of claim 8 , wherein the additional computing device is a smartphone of the user. 11. The computing device of claim 8 , wherein in causing the content item to be rendered one or more of the processors are to cause the content item to be rendered at the additional computing device. 12. The computing device of claim 8 , wherein in executing the instructions one or more of the processors are further to: process the voice input using a natural language processor (NLP) component to determine the action identified by the voice input. 13. The computing device of claim 12 , wherein in processing the voice input using the NLP component to determine the action identified by the voice input, one or more of the processors are to: convert the voice input into text; and process the text to determine the action identified by the voice input.

Assignees

Inventors

Classifications

  • G06F21/32Primary

    using biometric data, e.g. fingerprints, iris scans or voiceprints · CPC title

  • by observing the pattern of computer usage, e.g. typical user behaviour · CPC title

  • involving the use of external additional devices, e.g. dongles or smart cards · CPC title

  • communicating wirelessly · CPC title

  • Classification, e.g. identification · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11880442B2 cover?
The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across th…
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G06F21/32. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 23 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).