Detection of potential exfiltration of audio data from digital assistant applications

US10565985B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10565985-B1
Application numberUS-201816131439-A
CountryUS
Kind codeB1
Filing dateSep 14, 2018
Priority dateSep 14, 2018
Publication dateFeb 18, 2020
Grant dateFeb 18, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure is generally related to a data processing system to detect potential exfiltration of audio data by agent applications can include a data processing system. The data processing system can identify, from an I/O record, an input received from the digital assistant application via a microphone of a client device, an output received from the agent application after the input, and a microphone status for the microphone. The data processing system can determine that the output is terminal based on the input and the output. The data processing system can identify the microphone status as in the enabled state subsequent to the input. The data processing system can determine that the agent application is unauthorized to access audio data acquired via the microphone of the client device based on determining that the output is terminal and identifying the microphone status as enabled.

First claim

Opening claim text (preview).

What is claimed: 1. A system to detect potential exfiltration of audio data by agent applications that interface with digital assistant applications, comprising: a data processing system having an exchange logger component, a sequence analyzer component, a microphone use detector component, an extrusion detector component, and an agent manager component; the exchange logger component to access a log database that maintains an input/output (I/O) record between an agent application that executes on a client device and a digital assistant application that executes on the client device; the exchange logger component to identify, from the I/O record of the log database, an input received from the digital assistant application via a microphone of the client device, an output received from the agent application that interfaces with the digital assistant application subsequent to the input, and a microphone status that indicates one of an enabled state and a disabled state of the microphone subsequent to the input; the sequence analyzer component to determine that the output is terminal to the input based on a first set of strings included in the input and a second set of strings included in the output; the microphone use detector component to identify the microphone status of the microphone as in the enabled state subsequent to the input; the extrusion detector component to determine that the agent application is unauthorized to access audio data acquired via the microphone of the client device based on the determination that the output is terminal to the input and the identification of the microphone status as in the enabled state; and the agent manager component to perform an action on at least one of the agent application and the digital assistant application, responsive to the determination that the agent application is unauthorized to access the audio data acquired via the microphone. 2. The system of claim 1 , comprising: the sequence analyzer component to: identify, from the second set of strings included in the output, a subset of strings from at least one of a starting index and an end index; compare the subset of strings to a plurality of predefined keywords that correlate to one of terminal responses or anticipatory responses; and determine that the output is terminal based on the comparison of the subset of strings to the plurality of predefined keywords. 3. The system of claim 1 , comprising: the sequence analyzer component to: compare the first set of strings included in the input to a plurality of predefined keywords that correlate to expectative queries that indicate a subsequent response is to be anticipatory; and determine that the input is expectative based on the comparison of the first set of strings to the plurality of predefined keywords; and the extrusion detector component to determine that the agent application is unauthorized to access audio data acquired via the microphone of the client device based on the determination that the output is terminal and the input is expectative. 4. The system of claim 1 , comprising: the sequence analyzer component to: establish an I/O exchange model to correlate sets of strings communicated between the agent application and the digital assistant application with terminal responses; apply, to the I/O exchange model, the first set of strings included in the input and the second set of strings included in the output to obtain a confidence metric that indicates a likelihood that the output is terminal; and determine, responsive to a determination that the confidence metric is greater than a threshold, that the output is terminal. 5. The system of claim 1 , comprising: the exchange logger component to identify, from the I/O record of the log database, a second input received from the digital assistant application via the microphone of the client device subsequent to the output from the agent application, the second input including a null marker indicating a lack of strings parsed from an audio input signal corresponding to the second input; the sequence analyzer to identify the null marker from the second input received from the digital assistant application; the microphone use detector component to identify the microphone status of the microphone as in the enabled state subsequent to the output determined to be the terminal response; and the extrusion detector component to determine that the agent application is unauthorized to access audio data acquired via the microphone of the client device based on the identification of the null marker in the second response and the identification of the microphone status as in the enabled state subsequent to the output determined to be the terminal response. 6. The system of claim 1 , comprising: the exchange logger component to identify, from the I/O record of the log database, a plurality of inputs received from the digital assistant application via the microphone of the client device, a plurality of output received from the agent application that interfaces with the digital assistant application, the sequence analyzer component to count, from the I/O record, a number of occurrences of a first input of the plurality of inputs directly followed by a second input of the plurality of inputs without an intermediate output of the plurality of outputs; and the extrusion detector component to determine, based on a determination that the number of occurrences of the first input directly followed by the second input without the intermediate output is greater than a threshold number, that the agent application is unauthorized to access audio data acquired via the microphone of the client device. 7. The system of claim 1 , comprising: the exchange logger component to identify, from the I/O record of the log database, the microphone status that indicates one of the enabled state and the disabled state of the microphone subsequent to the output for a specified time duration; the microphone use detector component to identify the specified time duration of the microphone status in the enabled state subsequent to the output; and the extrusion detector component to determine that the agent application is unauthorized to access the audio data acquired via the microphone of the client device based on the determination that the specified time duration of microphone status in the enabled state is greater than the threshold limit. 8. The system of claim 1 , comprising: the exchange logger component to identify, from the I/O record of the log database, a device modality of the client device that executes the agent application and the digital assistant application; and the sequence analyzer component to determine that the output is terminal to the input based on the device modality of the client device. 9. The system of claim 1 , comprising: the agent manager component to perform, responsive to the determination that the agent application is unauthorized to access the audio data acquired via the microphone, the action to cause the digital assistant application to present an alert that indicates that the agent application is unauthorized to access the audio data. 10. The system of claim 1 , comprising: the agent manager component to perform, responsive to the determination that the agent application is unauthorized to access the audio data acquired via the microphone, the action to send a notification to an agent service associated with the agent application. 11. The system of claim 1 , comprising: the agent manager component to perform, responsive to the determination that the agent application is unauthorized to access the audio data acquired via the microphone, the action to cause t

Assignees

Inventors

Classifications

  • Protecting personal data, e.g. for financial or medical purposes · CPC title

  • Execution procedure of a spoken command · CPC title

  • involving event detection and direct action · CPC title

  • involving covert channels, i.e. data leakage between processes (inhibiting the analysis of circuitry or operation with measures against power attack G06F21/755) · CPC title

  • Protecting data · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10565985B1 cover?
The present disclosure is generally related to a data processing system to detect potential exfiltration of audio data by agent applications can include a data processing system. The data processing system can identify, from an I/O record, an input received from the digital assistant application via a microphone of a client device, an output received from the agent application after the input, …
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/18. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 18 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).