What technology area does this patent fall under?

Primary CPC classification H04M3/533. Mapped technology areas include Electricity.

When was this patent published?

Publication date Tue Dec 26 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Systems for identifying the answering party of an automated voice call

US11856144B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11856144-B2
Application number	US-202117498405-A
Country	US
Kind code	B2
Filing date	Oct 11, 2021
Priority date	Jun 9, 2020
Publication date	Dec 26, 2023
Grant date	Dec 26, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are systems and methods for determining whether an automated telephone call has been answered by a particular user, a different person, or a voicemail service. In response to determining that the call has been answered by the user, a different person, or a voicemail service, the systems and methods can output different respective messages, initiate messaging to the user via alternate communication paths, or perform other actions.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory, computer-readable medium having instructions stored thereon that, when executed by one or more processors, causes a system to: receive audio data associated with a call to a user device associated with a user; detect an utterance from the audio data; perform a comparison of the utterance to stored data; determine, based on the comparison, whether the utterance corresponds to a voicemail greeting; responsive to the utterance not corresponding to the voicemail greeting: detect voice biometric data associated with the audio data; perform an identification comparison of the voice biometric data to stored voice biometric data associated with the user to determine an identification level of similarity between the voice biometric data and the stored voice biometric data; and responsive to determining that the identification level of similarity is greater than or equal to a first predetermined threshold, output a first recording that includes information for the user; responsive to the utterance corresponding to the voicemail greeting: identify that the voicemail greeting is a customized voicemail greeting; identify, based on the comparison and a voicemail greeting database, a message duration of the customized voicemail greeting from the stored data; in response to identifying the message duration, disable audio monitoring of the call; and delay output of a second recording until the message duration has expired. 2. The non-transitory, computer-readable medium of claim 1 , wherein: the call is a video call, and the instructions, when executed by the one or more processors, further cause the system to: receive video call data comprising video data and the audio data; and extract the audio data from the video call data. 3. The non-transitory, computer-readable medium of claim 1 , wherein: the stored data comprises data indicative of one or more default voicemail greetings, and determining that the utterance corresponds to the voicemail greeting comprises: determining, based on the comparison, a level of similarity between the utterance and data indicative of a default voicemail greeting of the one or more default voicemail greetings, and determining that the level of similarity is greater than or equal to a predetermined threshold. 4. The non-transitory, computer-readable medium of claim 1 , wherein the instructions, when executed by the one or more processors, further cause the system to: detect an audible prompt to begin recording an initial utterance; determine a message duration associated with the initial utterance, the message duration being indicative an amount of time between the call becoming connected and detection of the audible prompt to begin recording the initial utterance; store the message duration and data indicative of the initial utterance in the stored data. 5. The non-transitory, computer-readable medium of claim 1 , wherein the instructions, when executed by the one or more processors, further cause the system to: initiate an attempt to contact a user of the user device via a communication path that is different from the call. 6. A non-transitory, computer-readable medium having instructions stored thereon that, when executed by one or more processors, causes a system to: receive audio data associated with a call to a user device associated with a user; detect an utterance from the audio data; perform a comparison of the utterance to stored data; determine, based on the comparison, whether the utterance corresponds to a voicemail greeting; responsive to the utterance not corresponding to the voicemail greeting: detect voice biometric data associated with the audio data; perform an identification comparison of the voice biometric data to stored voice biometric data associated with the user to determine an identification level of similarity between the voice biometric data and the stored voice biometric data; and responsive to determining that the identification level of similarity is greater than or equal to a first predetermined threshold, output a first recording that includes information for the user; responsive to the utterance corresponding to the voicemail greeting: identify that the voicemail greeting is a customized voicemail greeting; identify a message duration of the customized voicemail greeting from the stored data; and delay output of a second recording until the message duration has expired. 7. The non-transitory, computer-readable medium of claim 6 , wherein: determining whether the utterance corresponds to the customized voicemail greeting further comprises: determine, based on the comparison, a level of similarity between the utterance and the customized voicemail greeting, and determine that the utterance corresponds to the customized voicemail greeting in response to determining that the level of similarity is greater than or equal to a second predetermined threshold. 8. The non-transitory, computer-readable medium of claim 7 , wherein the first recording is a user-specific recording that includes confidential information associated with the user, and the second recording is a generalized recording that omits confidential information associated with the user. 9. The computer-readable medium of claim 7 , wherein the stored voice biometric data comprises the saved customized voicemail greeting. 10. The computer-readable medium of claim 6 , wherein the instructions, when executed by the one or more processors, further cause the system to: detect baseline voice biometric data associated with the user during an audio interaction with the user or a customized voicemail greeting associated with the user; and store the baseline voice biometric data as the stored voice biometric data. 11. A non-transitory, computer-readable medium having instructions stored thereon that, when executed by one or more processors, causes a system to: receive audio data associated with a call to a user device associated with a user; detect an utterance from the audio data; identify, based on the audio data, a spoken language of the utterance; perform a comparison of the utterance to stored data; determine, based on the comparison, whether the utterance corresponds to a voicemail greeting; responsive to the utterance not corresponding to the voicemail greeting: detect voice biometric data associated with the audio data; perform an identification comparison of the voice biometric data to stored voice biometric data associated with the user to determine an identification level of similarity between the voice biometric data and the stored voice biometric data; and responsive to determining that the identification level of similarity is greater than or equal to a first predetermined threshold, output a first recording that includes information for the user in the spoken languages; responsive to the utterance corresponding to the voicemail greeting: identify that the voicemail greeting is a customized voicemail greeting; identify a message duration of the customized voicemail greeting from the stored data; and delay output of a second recording until the message duration has expired. 12. The non-transitory, computer-readable medium of claim 11 , wherein the instructions, when executed by the one or more processors, further cause the system to: associate the spoken language with the user device in a preferred language database. 13. The non-transitory, computer-readable medium of claim 11 , wherein determining, based on the comparison, whether the utterance corresponds to a voicemail greeting further comprises: perform a

Assignees

Capital One Services Llc

Inventors

Classifications

H04M3/533Primary
Voice mail systems · CPC title
G10L15/22
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
H04M1/82
Line monitoring circuits for call progress or status discrimination {(for detecting caller identity H04M1/573)} · CPC title
H04M3/436
Arrangements for screening incoming calls {, i.e. evaluating the characteristics of a call before deciding whether to answer it (based on the calling party profile H04M3/42059; based on location H04M3/42348; based on presence H04M3/42365; diversion H04M3/54)} · CPC title
H04M2250/74
with voice recognition means · CPC title

Patent family

Related publications grouped by family.

View patent family 76355239

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11856144B2 cover?: Disclosed herein are systems and methods for determining whether an automated telephone call has been answered by a particular user, a different person, or a voicemail service. In response to determining that the call has been answered by the user, a different person, or a voicemail service, the systems and methods can output different respective messages, initiate messaging to the user via alt…
Who is the assignee on this patent?: Capital One Services Llc
What technology area does this patent fall under?: Primary CPC classification H04M3/533. Mapped technology areas include Electricity.
When was this patent published?: Publication date Tue Dec 26 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).