Authentication of packetized audio signals

US10541997B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10541997-B2
Application numberUS-201615395729-A
CountryUS
Kind codeB2
Filing dateDec 30, 2016
Priority dateDec 30, 2016
Publication dateJan 21, 2020
Grant dateJan 21, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.

First claim

Opening claim text (preview).

The invention claimed is: 1. A system to authenticate packetized audio signals in a voice activated computer network environment, comprising: a data processing system comprising at least one processor and memory; a natural language processor component executed by the data processing system to receive, via an interface of the data processing system, data packets comprising an input audio signal detected by a sensor of a client device; the natural language processor component to parse the input audio signal to identify a request and a trigger keyword corresponding to the request; a direct action application programming interface of the data processing system to generate, based on the trigger keyword, a first action data structure responsive to the request, the first action data structure comprising one or more parameters packaged together to cause a third party provider device to perform an action; a conversational application programming interface of the data processing system to establish, based on the first action data structure, a communication session between the third party provider device and the client device; a network security appliance of the data processing system to: compare a parameter of the first action data structure with a first characteristic of the input audio signal; and generate a first alarm condition responsive to the first characteristic of the input audio signal not matching the parameter of the first action data structure; a content selector component of the data processing system to select, responsive to generation of the first alarm condition and based on the trigger keyword, a content item for provision to the client device for authentication of the input audio signal; the network security appliance to: receive a second set of data packets from the client device, the second set of data packets comprising a response audio signal comprising a response to the content item provisioned to the client device for authentication of the input audio signal; generate a second alarm condition responsive to a second characteristic of the response audio signal not matching a second parameter of the first action data structure; and transmit, responsive to generation of the second alarm condition, an instruction to the third party provider device to disable the communication session established with the client device. 2. The system of claim 1 , the network security appliance to: determine the first characteristic of the input audio signal; and determine the second characteristic of the response audio signal, wherein the first characteristic and the second characteristic comprise at least one of a voiceprint, a keyword, a number of voices detected, an identification of the client device, or a location of a source of the input audio signal. 3. The system of claim 1 , wherein the first characteristic is different from the second characteristic. 4. The system of claim 1 , the network security appliance to: receive a location of a second client device; determine a distance between a location of the client device and the location of the second client device; and detect the first alarm condition based on the distance between a location of the client device and the location of the second client device. 5. The system of claim 4 , the network security appliance to: detect the first alarm condition based on the distance between a location of the client device and the location of the second client device being above a predetermined threshold. 6. The system of claim 1 , wherein the content item comprises instructions for generating an auditory signal at the client device. 7. The system of claim 6 , wherein the auditory signal comprises a security question. 8. The system of claim 1 , the network security appliance to: disable the first action data structure responsive to detection of the first alarm condition. 9. The system of claim 1 , the content selector component to: generate instructions to capture the second characteristic of the response audio signal in the content item. 10. The system of claim 1 , the network security appliance to: terminate the communication session established with the client device in response to an interaction with the content item. 11. The system of claim 1 , the network security appliance to determine an amount of computational resources required to complete the request. 12. The system of claim 11 , the network security appliance to set the first alarm condition responsive to the amount of computational resources being above a predetermined threshold. 13. The system of claim 1 , the natural language processor component to parse the response audio signal to identify a passphrase. 14. The system of claim 13 , the network security appliance to set the second alarm condition based on the passphrase not matching a stored passphrase. 15. The system of claim 13 , wherein the passphrase is the second characteristic. 16. A method for authenticating packetized audio signals in a voice activated computer network environment, comprising: receiving, by a natural language processor component executed by a data processing system, data packets comprising an input audio signal detected by a sensor of a client device; parsing, by the natural language processor component, the input audio signal to identify a request and a trigger keyword corresponding to the request; generating, by a direct action application programming interface of the data processing system, based on the trigger keyword, a first action data structure responsive to the request, the first action data structure comprising one or more parameters packaged together to cause a third party provider device to perform an action; generating, by a conversational application programming interface of the data processing system and based on the first action data structure, a communication session between the third party provider device and the client device; comparing, by a network security appliance of the data processing system, a parameter of the first action data structure with a first characteristic of the input audio signal; generating, by the network security appliance, a first to detect an alarm condition responsive to the first characteristic of the input audio signal not matching the parameter of the first action data structure; selecting, by a content selector component executed by the data processing system and responsive to generation of the first alarm condition, a content item for provision to the client device for authentication of the input audio signal; receiving, by the network security appliance, a second set of data packets from the client device, the second set of data packets comprising a response audio signal comprising a response to the content item provisioned to the client device for authentication of the input audio signal; generating, by the network security appliance, a second alarm condition responsive to a second characteristic of the response audio signal not matching a second parameter of the first action data structure; and transmitting, by the network security appliance, responsive to the generation of the second alarm condition, an instruction to the third party provider device to disable the communication session established with the client device. 17. The method of claim 16 , comprising: determining, by the network security appliance, the first characteristic of the input audio signal; and determining, by the network security appliance, the second characteristic of the response audio signal, wherein the first characteristic and

Assignees

Inventors

Classifications

  • Managing session states for stateless protocols; Signalling session states; State transitions; Keeping-state mechanisms · CPC title

  • using biometric data, e.g. fingerprints, iris scans or voiceprints · CPC title

  • Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction · CPC title

  • for comparison or discrimination · CPC title

  • the user being prompted to utter a password or a predefined phrase · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10541997B2 cover?
The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across th…
Who is the assignee on this patent?
Google Inc, Google Llc
What technology area does this patent fall under?
Primary CPC classification H04L63/0861. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jan 21 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).