Method and system for generating and correcting classification models

US11537820B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11537820-B2
Application numberUS-202016789811-A
CountryUS
Kind codeB2
Filing dateFeb 13, 2020
Priority dateAug 27, 2014
Publication dateDec 27, 2022
Grant dateDec 27, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Data having some similarities and some dissimilarities may be clustered or grouped according to the similarities and dissimilarities. The data may be clustered using agglomerative clustering techniques. The clusters may be used as suggestions for generating groups where a user may demonstrate certain criteria for grouping. The system may learn from the criteria and extrapolate the groupings to readily sort data into appropriate groups. The system may be easily refined as the user gains an understanding of the data.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory computer-readable storage media storing computer-readable instructions that, when executed by a processor of a system, cause the system to: generate a first classification model based on a first indication that a first portion of a first data set is associated with a first bucket and a second classification model based on a second indication that the first portion of the first data set is associated with a second bucket using data comprising structured data, unstructured data, and partially structured data; compare, using a comparator, (1) a first score using test results from the first classification model based on a data set of natural language inputs from conversations of calls or chats and (2) a second score using test results from the second classification model based on the data set of natural language inputs from the conversations of calls or chats; when the first score agrees with the second score within a threshold range, validate the first classification model; when the first score is different from the second score with respect to the threshold range, display, on a display of a user device, a clarification question for a user; when an indication is received from the user, in response to the clarification question, that the first classification model is correct, update the second classification model based on the indication; and when an indication is received from the user, in response to the clarification question, that the second classification model is correct, update the first classification model based on the indication. 2. The non-transitory computer-readable storage media of claim 1 , wherein the first classification model comprises a symbolic language model; and the second classification model comprises a statistical language model. 3. The non-transitory computer-readable storage media of claim 1 , wherein the clarification question comprises a question asking the user to select an appropriate answer to an input statement. 4. The non-transitory computer-readable storage media of claim 1 , wherein the data set comprises disparate types of data. 5. The non-transitory computer-readable storage media of claim 1 , wherein the data set comprises imported data. 6. The non-transitory computer-readable storage media of claim 1 , wherein the data set comprises markup language data. 7. A non-transitory computer-readable storage media storing computer-readable instructions that, when executed by a processor of a system, cause the system to: generate a first classification model based on a first indication that a first portion of a first data set is associated with a first bucket and a second classification model based on a second indication that the first portion of the first data set is associated with a second bucket using data comprising structured data, unstructured data, and partially structured data; compare, by a comparator, (1) a first score using test results from the first classification model based on a data set of natural language inputs from conversations of calls or chats and (2) a second score using test results from the second classification model based on the data set of natural language inputs from the conversations of calls or chats; when the first score agrees with the second score within a threshold range, validate the first classification model; when the first score is different from the second score within a first threshold difference, display, on a display of a user device, a first clarification question for a user, and update the first classification model based on an answer to the first clarification question; and when the first score is different from the second score within a second threshold difference and outside of the first threshold difference, display on a display of the user device, a second clarification question for the user, and update the first classification model based on an answer to the second clarification question. 8. The non-transitory computer-readable storage media of claim 7 , wherein the first classification model comprises a symbolic language model; and the second classification model comprises a statistical language model. 9. The non-transitory computer-readable storage media of claim 7 , wherein the first clarification question comprises a question asking the user to select an appropriate answer to an input statement. 10. The non-transitory computer-readable storage media of claim 7 , wherein the second clarification question comprises a question asking the user to select an appropriate restatement of an input statement, the appropriate restatement of the input being a leading restatement of the input statement. 11. The non-transitory computer-readable storage media of claim 7 , wherein the data set comprises disparate types of data. 12. The non-transitory computer-readable storage media of claim 7 , wherein the data set comprises imported data. 13. The non-transitory computer-readable storage media of claim 7 , wherein the data set comprises markup language data. 14. A system comprising: one or more processors; and a memory communicably coupled to the one or more processors and storing instructions that when executed by the one or more processors cause the system to: generate a first classification model based on a first indication that a first portion of a first data set is associated with a first bucket and a second classification model based on a second indication that the first portion of the first data set is associated with a second bucket using data comprising structured data, unstructured data, and partially structured data; compare, using a comparator, (1) a first score using test results from the first classification model based on a data set of natural language inputs from conversations of calls or chats and (2) a second score using test results from the second classification model based on the data set of natural language inputs from the conversations of calls or chats; when the first score agrees with the second score within a threshold range, validate, by the one or more processors, the first classification model; when the first score is different from the second score with respect to the threshold range, display, on a display of a user device, a clarification question for a user; when an indication is received from the user, in response to the clarification question, that the first classification model is correct, update the second classification model based on the indication; and when an indication is received from the user, in response to the clarification question, that the second classification model is correct, update the first classification model based on the indication. 15. The system of claim 14 , wherein the first classification model comprises a symbolic language model; and the second classification model comprises a statistical language model. 16. The system of claim 14 , wherein the clarification question comprises a question asking the user to select an appropriate answer to an input statement. 17. A system comprising: one or more processors; and a memory communicably coupled to the one or more processors and storing instructions that when executed by the one or more processors cause the system to: generate a first classification model based on a first indication that a first portion of a first data set is associated with a first bucket and a second classification model based on a second indication that the first portion of the first data set is associated with a second bucket using data comprising structured data, unstructured data, an

Assignees

Inventors

Classifications

  • based on feedback of a supervisor · CPC title

  • G06F16/358Primary

    Browsing; Visualisation therefor · CPC title

  • Interactive pattern learning with a human teacher · CPC title

  • G06K9/6263Primary

    Physics · mapped topic

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11537820B2 cover?
Data having some similarities and some dissimilarities may be clustered or grouped according to the similarities and dissimilarities. The data may be clustered using agglomerative clustering techniques. The clusters may be used as suggestions for generating groups where a user may demonstrate certain criteria for grouping. The system may learn from the criteria and extrapolate the groupings to …
Who is the assignee on this patent?
Verint Americas Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/358. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 27 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).