Authentication of packetized audio signals
US-10541997-B2 · Jan 21, 2020 · US
US11880442B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11880442-B2 |
| Application number | US-202117543371-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 6, 2021 |
| Priority date | Mar 15, 2013 |
| Publication date | Jan 23, 2024 |
| Grant date | Jan 23, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.
Opening claim text (preview).
The invention claimed is: 1. A method implemented by one or more processors, the method comprising: receiving an audio-based input, detected by a microphone of a computing device, that includes a voice input of a user; processing, using facial recognition, video-based input that is detected at a camera of the computing, device; determining, based on processing of the audio-based input and the processing of the video-based input, that a probability, that the audio-based input was generated by a particular registered user of the computing device, satisfies a threshold; determining that a distance, between the computing device and the particular registered user, satisfies a distance threshold; and in response to determining that the probability satisfies the threshold and that the distance satisfies the distance threshold: selecting a content item based on an action identified by the voice input and based on a profile of the particular registered user; and causing the content item to be rendered in response to the audio-based input and to be rendered at the computing device or an additional computing device that is separate from, but associated with, the computing device. 2. The method of claim 1 , further comprising: determining the distance, between the computing device and the particular registered user, based on a separation distance between the computing device and the additional computing device. 3. The method of claim 2 , wherein the additional computing device is a smartphone of the user. 4. The method of claim 1 , wherein causing the content item to be rendered comprises causing the content item to be rendered at the additional computing device. 5. The method of claim 1 , further comprising: processing the voice input using a natural language processor (NLP) component to determine the action identified by the voice input. 6. The method of claim 5 , wherein processing the voice input using the NLP component to determine the action identified by the voice input comprises: converting the voice input into text; and processing the text to determine the action identified by the voice input. 7. The method of claim 1 , wherein the processing of the audio-based input, in determining that the probability satisfies the threshold, occurs locally at the computing device. 8. A computing device, comprising: a microphone; memory storing instructions; one or more processors, executing the instructions, to: receive an audio-based input, detected by the microphone, that includes a voice input of a user; process, using facial recognition, video-based input that is detected at a camera of the computing device; determine, based on processing of the audio-based input and the processing of the video-based input, that a probability, that the audio-based input was generated by a particular registered user of the computing device, satisfies a threshold; determine that a distance, between the computing device and the particular registered user, satisfies a distance threshold; and in response to determining that the probability satisfies the threshold and that the distance satisfies the distance threshold: select a content item based on an action identified by the voice input and based on a profile of the particular registered user; and cause the content item to be rendered in response to the audio-based input and to be rendered at the computing device or an additional computing device that is separate from, but associated with, the computing device. 9. The computing device of claim 8 , wherein in executing the instructions one or more of the processors are further to: determine the distance, between the computing device and the particular registered user, based on a separation distance between the computing device and the additional computing device. 10. The computing device of claim 8 , wherein the additional computing device is a smartphone of the user. 11. The computing device of claim 8 , wherein in causing the content item to be rendered one or more of the processors are to cause the content item to be rendered at the additional computing device. 12. The computing device of claim 8 , wherein in executing the instructions one or more of the processors are further to: process the voice input using a natural language processor (NLP) component to determine the action identified by the voice input. 13. The computing device of claim 12 , wherein in processing the voice input using the NLP component to determine the action identified by the voice input, one or more of the processors are to: convert the voice input into text; and process the text to determine the action identified by the voice input.
using biometric data, e.g. fingerprints, iris scans or voiceprints · CPC title
by observing the pattern of computer usage, e.g. typical user behaviour · CPC title
involving the use of external additional devices, e.g. dongles or smart cards · CPC title
communicating wirelessly · CPC title
Classification, e.g. identification · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.