Systems for identifying the answering party of an automated voice call
US-11146686-B1 · Oct 12, 2021 · US
US11856144B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11856144-B2 |
| Application number | US-202117498405-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 11, 2021 |
| Priority date | Jun 9, 2020 |
| Publication date | Dec 26, 2023 |
| Grant date | Dec 26, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Disclosed herein are systems and methods for determining whether an automated telephone call has been answered by a particular user, a different person, or a voicemail service. In response to determining that the call has been answered by the user, a different person, or a voicemail service, the systems and methods can output different respective messages, initiate messaging to the user via alternate communication paths, or perform other actions.
Opening claim text (preview).
What is claimed is: 1. A non-transitory, computer-readable medium having instructions stored thereon that, when executed by one or more processors, causes a system to: receive audio data associated with a call to a user device associated with a user; detect an utterance from the audio data; perform a comparison of the utterance to stored data; determine, based on the comparison, whether the utterance corresponds to a voicemail greeting; responsive to the utterance not corresponding to the voicemail greeting: detect voice biometric data associated with the audio data; perform an identification comparison of the voice biometric data to stored voice biometric data associated with the user to determine an identification level of similarity between the voice biometric data and the stored voice biometric data; and responsive to determining that the identification level of similarity is greater than or equal to a first predetermined threshold, output a first recording that includes information for the user; responsive to the utterance corresponding to the voicemail greeting: identify that the voicemail greeting is a customized voicemail greeting; identify, based on the comparison and a voicemail greeting database, a message duration of the customized voicemail greeting from the stored data; in response to identifying the message duration, disable audio monitoring of the call; and delay output of a second recording until the message duration has expired. 2. The non-transitory, computer-readable medium of claim 1 , wherein: the call is a video call, and the instructions, when executed by the one or more processors, further cause the system to: receive video call data comprising video data and the audio data; and extract the audio data from the video call data. 3. The non-transitory, computer-readable medium of claim 1 , wherein: the stored data comprises data indicative of one or more default voicemail greetings, and determining that the utterance corresponds to the voicemail greeting comprises: determining, based on the comparison, a level of similarity between the utterance and data indicative of a default voicemail greeting of the one or more default voicemail greetings, and determining that the level of similarity is greater than or equal to a predetermined threshold. 4. The non-transitory, computer-readable medium of claim 1 , wherein the instructions, when executed by the one or more processors, further cause the system to: detect an audible prompt to begin recording an initial utterance; determine a message duration associated with the initial utterance, the message duration being indicative an amount of time between the call becoming connected and detection of the audible prompt to begin recording the initial utterance; store the message duration and data indicative of the initial utterance in the stored data. 5. The non-transitory, computer-readable medium of claim 1 , wherein the instructions, when executed by the one or more processors, further cause the system to: initiate an attempt to contact a user of the user device via a communication path that is different from the call. 6. A non-transitory, computer-readable medium having instructions stored thereon that, when executed by one or more processors, causes a system to: receive audio data associated with a call to a user device associated with a user; detect an utterance from the audio data; perform a comparison of the utterance to stored data; determine, based on the comparison, whether the utterance corresponds to a voicemail greeting; responsive to the utterance not corresponding to the voicemail greeting: detect voice biometric data associated with the audio data; perform an identification comparison of the voice biometric data to stored voice biometric data associated with the user to determine an identification level of similarity between the voice biometric data and the stored voice biometric data; and responsive to determining that the identification level of similarity is greater than or equal to a first predetermined threshold, output a first recording that includes information for the user; responsive to the utterance corresponding to the voicemail greeting: identify that the voicemail greeting is a customized voicemail greeting; identify a message duration of the customized voicemail greeting from the stored data; and delay output of a second recording until the message duration has expired. 7. The non-transitory, computer-readable medium of claim 6 , wherein: determining whether the utterance corresponds to the customized voicemail greeting further comprises: determine, based on the comparison, a level of similarity between the utterance and the customized voicemail greeting, and determine that the utterance corresponds to the customized voicemail greeting in response to determining that the level of similarity is greater than or equal to a second predetermined threshold. 8. The non-transitory, computer-readable medium of claim 7 , wherein the first recording is a user-specific recording that includes confidential information associated with the user, and the second recording is a generalized recording that omits confidential information associated with the user. 9. The computer-readable medium of claim 7 , wherein the stored voice biometric data comprises the saved customized voicemail greeting. 10. The computer-readable medium of claim 6 , wherein the instructions, when executed by the one or more processors, further cause the system to: detect baseline voice biometric data associated with the user during an audio interaction with the user or a customized voicemail greeting associated with the user; and store the baseline voice biometric data as the stored voice biometric data. 11. A non-transitory, computer-readable medium having instructions stored thereon that, when executed by one or more processors, causes a system to: receive audio data associated with a call to a user device associated with a user; detect an utterance from the audio data; identify, based on the audio data, a spoken language of the utterance; perform a comparison of the utterance to stored data; determine, based on the comparison, whether the utterance corresponds to a voicemail greeting; responsive to the utterance not corresponding to the voicemail greeting: detect voice biometric data associated with the audio data; perform an identification comparison of the voice biometric data to stored voice biometric data associated with the user to determine an identification level of similarity between the voice biometric data and the stored voice biometric data; and responsive to determining that the identification level of similarity is greater than or equal to a first predetermined threshold, output a first recording that includes information for the user in the spoken languages; responsive to the utterance corresponding to the voicemail greeting: identify that the voicemail greeting is a customized voicemail greeting; identify a message duration of the customized voicemail greeting from the stored data; and delay output of a second recording until the message duration has expired. 12. The non-transitory, computer-readable medium of claim 11 , wherein the instructions, when executed by the one or more processors, further cause the system to: associate the spoken language with the user device in a preferred language database. 13. The non-transitory, computer-readable medium of claim 11 , wherein determining, based on the comparison, whether the utterance corresponds to a voicemail greeting further comprises: perform a
Voice mail systems · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Line monitoring circuits for call progress or status discrimination {(for detecting caller identity H04M1/573)} · CPC title
Arrangements for screening incoming calls {, i.e. evaluating the characteristics of a call before deciding whether to answer it (based on the calling party profile H04M3/42059; based on location H04M3/42348; based on presence H04M3/42365; diversion H04M3/54)} · CPC title
with voice recognition means · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.