Computer-based systems configured to monitor a communication session and generate a training session and methods of use thereof

US12413671B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12413671-B2
Application numberUS-202318450886-A
CountryUS
Kind codeB2
Filing dateAug 16, 2023
Priority dateAug 16, 2023
Publication dateSep 9, 2025
Grant dateSep 9, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In some embodiments, the present disclosure provides an exemplary method that may include steps of monitoring, a conversation script between a call center agent and a customer; utilizing, a speech-to-text deep machine learning model to transcribe the audio call to text; utilizing, a natural language processing deep machine learning model to map intent mappings of the audio call; utilizing, a similarity measurement model to determine a semantic similarity between predefined intent mappings and the intent mappings of the call audio text; determining, an error based on the semantic similarity in the intent mapping call audio text; determining a training session based on the error.

First claim

Opening claim text (preview).

What is claimed: 1. A computer-implemented method comprising: retrieving, by at least one processor, a predefined call script from a predefined call script library, the predefined call script having predefined intent mappings encoding predefined text representing predefined correct dialogue associated with a sample topic; utilizing, by the at least one processor, a computer-based monitoring module to detect an error in a call script conversation based at least in part on the predefined call script; wherein the computer-based monitoring module is configured to: utilize at least one speech-to-text deep machine learning model to transcribe a call script conversation comprising text representative of the call script conversation; utilizing, by at least one processor, at least one natural language processing deep machine learning model to generate a plurality of call intent mappings associated from the script conversation, wherein the at least one natural language processing deep machine learning model comprises a plurality of parameters configured to encode the text of the call script into the plurality of call intent mappings to produce semantic encodings indicative of call script; utilizing, by the at least one processor, at least one similarity measurement model to determine a semantic similarity between the predefined intent mappings and the plurality of call intent mappings based at least in part on at least one similarity measure; determining, based on the semantic similarity, at least one error in at least one call intent mapping of the plurality of call intent mappings; determining, by the at least one processor, a user training need based at least in part on the at least one error; selecting, by the at least one processor, training data for a training call based at least in part on the at least one error and the script conversation; selecting, by the at least one processor, a training call voice for the training call based at least in part on the training data; initiating, by the at least one processor, the training call by calling a user and loading the training data in a user dashboard of a user computing device associated with the user; utilizing, by the at least one processor, a call generation module to automatically generate caller speech for the training call based at least in part on the training data and the training call voice; wherein the call generation module is configured to: receive user speech data representative of speech performed by a user during the training call in response to the generated caller speech; utilizing, by the at least one processor, the at least one speech-to-text deep machine learning model to transcribe a user speech script representative of the user speech data; utilizing, by the at least one processor, the at least one natural language processing deep machine learning model to generate at least one user speech intent mapping associated with the user speech data; detecting, by the at least one processor, a new error in the user speech script based at least in part on the call script and the user speech script; determining, by the at least one processor, a user training need based at least in part on, the at least one new error; and determining, by the at least one processor, a training session initiation based at least in part on the user training need and the at least one new error. 2. The computer-implemented method of claim 1 , wherein the predefined call script library is generated by a deep machine learning algorithm. 3. The computer-implemented method of claim 1 , wherein the predefined call script library is at least in part generated by a person. 4. The computer-implemented method of claim 1 , wherein the computer-based monitoring module schedules a training session when the computer-based monitoring module detects an error. 5. The computer-implemented method of claim 1 , wherein the computer-based monitoring module monitors a user call time and schedules a training session when the computer-based monitoring module detects an error and the user call time on call exceeds a threshold. 6. The computer-implemented method of claim 1 , wherein the semantic similarity is determined from at least a lexical phrase of the call. 7. A computer-implemented method according to claim 1 , wherein a training session schedule of the user is determined by an administrator/manager. 8. A computer-implemented method according to claim 1 , wherein upon detecting an error the computer-based monitoring module notifies an administrator/manager of the user. 9. A system comprising: a non-transient computer memory, storing software instructions; and a least one processor of a first computing devices associated with a user; wherein, then at least one processor executes the software instructions, the first computing device is programmed to: retrieve, by at least one processor, a predefined call script from a predefined call script library, the predefined call script having predefined intent mappings encoding predefined text representing predefined correct dialogue associated with a sample topic; utilize, by the at least one processor, a computer-based monitoring module to detect an error in a call script conversation based at least in part on the call script; wherein the computer-based monitoring module is configured to: utilize at least one speech-to-text deep machine learning model to transcribe a call script comprising text representative of the call script of the conversation; utilize, by the at least on processor, at least one natural language processing deep machine learning model to generate a plurality of call intent mappings associated from the conversation, wherein the at least one natural language processing deep machine learning model comprises a plurality of parameters configured to encode the text of the call script into the plurality of call intent mappings to produce semantic encodings indicative of call script; utilize, by the at least one processor, at least one similarity measurement model to determine a semantic similarity between the predefined intent mappings and the plurality of call intent mappings based at least in part on at least one similarity measure; determining, based on the semantic similarity, at least one error in at least one call intent mapping of the plurality of call intent mappings; determine, by the at least one processor, a user training need based at least in part on the at least one error; select, by the at least one processor, training data for a training call based at least in part on the at least one error and the topic; select, by the at least one processor, a training call voice for the training call based at least in part on the training data; initiate, by the at least one processor, the training call by calling the user and loading the training data in a user dashboard of a user computing device associated with the user; utilize, by the at least one processor, a call generation module to automatically generate caller speech for the training call based at least in part on the training data and the training call voice; wherein the call generation module is configured to: receive user speech data representative of speech performed by the user during the training call in response to the generated caller speech; utilize, by the at least one processor, the at least one speech-to-text deep machine learning model to transcribe a user speech script representative of the user speech data; utilize, by the at least one processor, the at least one natural language processing deep machine learning model to generate at least one user speech intent mapping associated with the user speech data; detect, by the at least one processor, a new error in

Assignees

Inventors

Classifications

  • Discourse or dialogue representation · CPC title

  • using statistical methods · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Agent or workforce training · CPC title

  • using speech recognition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12413671B2 cover?
In some embodiments, the present disclosure provides an exemplary method that may include steps of monitoring, a conversation script between a call center agent and a customer; utilizing, a speech-to-text deep machine learning model to transcribe the audio call to text; utilizing, a natural language processing deep machine learning model to map intent mappings of the audio call; utilizing, a si…
Who is the assignee on this patent?
Capital One Services Llc
What technology area does this patent fall under?
Primary CPC classification H04M3/5175. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Sep 09 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).