Evaluating text classification anomalies predicted by a text classification model

US11537821B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11537821-B2
Application numberUS-201916380981-A
CountryUS
Kind codeB2
Filing dateApr 10, 2019
Priority dateApr 10, 2019
Publication dateDec 27, 2022
Grant dateDec 27, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In response to running at least one testing phrase on a previously trained text classifier and identifying a separate predicted classification label based on a score calculated for each respective at least one testing phrase, a text classifier decomposes extracted features summed in the score into word-level scores for each word in the at least one testing phrase. The text classifier assigns a separate heatmap value to each of the word-level scores, each respective separate heatmap value reflecting a weight of each word-level score. The text classifier outputs the separate predicted classification label and each separate heatmap value reflecting the weight of each word-level score for defining a heatmap identifying the contribution of each word in the at least one testing phrase to the separate predicted classification label for facilitating client evaluation of text classification anomalies.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: running, by a computer system, a testing phrase on a previously trained text classifier, wherein the testing phrase comprises one or more words; identifying, by a computer system, a separate predicted classification label based on an intent score calculated for the testing phrase by the text classifier; decomposing, by a computer system, a plurality of extracted features summed in the score into a plurality of word-level scores for each word in the testing phrase; assigning, by the computer system, a separate heatmap value to each of the plurality of word-level scores, each respective separate heatmap value reflecting a weight of each word-level score of the plurality of word-level scores; and outputting, by the computer system, the separate predicted classification label and each separate heatmap value reflecting the weight of each respective word-level score of the plurality of word-level scores, for defining a heatmap identifying the contribution of each word in the testing phrase to the separate predicted classification label. 2. The method according to claim 1 , further comprising: summing, by the computer system, for the separate predicted classification label, the plurality of word-level scores by word in response to running the testing phrase; identifying, by the computer system, a list of a top selection of words from among a plurality of words in descending order from a highest summed score by word; and outputting, by the computer system, the separate predicted classification label, each separate heatmap value, and the list of the top selection of words. 3. The method according to claim 1 , further comprising: calculating, by the computer system, the score for the separate predicted classification label based on a weighted sum of a plurality of combinations of a separate extracted feature of the plurality of features and a weighted model parameter fixed in the previously trained text classifier. 4. The method according to claim 1 , wherein decomposing, by the computer system, the plurality of extracted features summed in the score into the plurality of word-level scores for each word in the testing phrase further comprises: decomposing, by the computer system, the plurality of extracted features, the plurality of extracted features comprising one or more of a unigram based feature, a term-based feature, an average pooling of word embedding features, a maximum pooling of word embedding features, and character level features. 5. The method according to claim 1 , further comprising; initiating, by the computer system, a text classifier model; training, by the computer system, the text classifier model by applying a training set of a plurality of training phrases; deploying, by the computer system, the text classifier model as the previously trained text classifier for testing by a client; and responsive to receiving the testing phrase from the client, running, by the computer system, the testing phrase on the previously trained text classifier. 6. The method according to claim 1 , wherein outputting, by the computer system, the separate predicted classification label and each separate heatmap value reflecting the weight of each word-level score of the plurality of word-level scores for providing a heatmap identifying the contribution of each word in the testing phrase to the separate predicted classification label further comprises: outputting, by the computer system, the separate predicted classification label and each separate heatmap value reflecting the weight of each word-level score of the plurality of word-level scores to a client, wherein the client outputs each separate heatmap value in a user interface for graphically representing the weight of each word-level score to identify the contribution of each word in the testing phrase to the separate predicted classification label. 7. The method according to claim 1 , wherein outputting, by the computer system, the separate predicted classification label and each separate heatmap value reflecting the weight of each word-level score of the plurality of word-level scores for providing a heatmap identifying the contribution of each word in the testing phrase to the separate predicted classification label further comprises: outputting, by the computer system, the separate predicted classification label and each separate heatmap value reflecting the weight of each word-level score of the plurality of word-level scores to a client, wherein the client determines whether each separate predicted classification label matches an expected classification label for client evaluation of text classification anomalies. 8. A computer system comprising one or more processors, one or more computer-readable memories, one or more computer-readable storage devices, and program instructions, stored on at least one of the one or more storage devices for execution by at least one of the one or more processors via at least one of the one or more memories, the stored program instructions comprising: program instructions to run a testing phrase on a previously trained text classifier, wherein the testing phrase comprises one or more words; program instructions to identify a separate predicted classification label based on an intent score calculated for the testing phrase by the text classifier; program instructions to decompose a plurality of extracted features summed in the score into a plurality of word-level scores for each word in the testing phrase; program instructions to assign a separate heatmap value to each of the plurality of word-level scores, each respective separate heatmap value reflecting a weight of each word-level score of the plurality of word-level scores; and program instructions to output the separate predicted classification label and each separate heatmap value reflecting the weight of each respective word-level score of the plurality of word-level scores, for defining a heatmap identifying the contribution of each word in the testing phrase to the separate predicted classification label. 9. The computer system according to claim 8 , the program instructions further comprising: program instructions to sum, for the separate predicted classification label, the plurality of word-level scores by word in response to running the testing phrase; program instructions to identify, a list of a top selection of words from among a plurality of words in descending order from a highest summed score by word; and program instructions to output the separate predicted classification label, each separate heatmap value, and the list of the top selection of words. 10. The computer system according to claim 8 , the program instructions further comprising: program instructions to calculate the score for the separate predicted classification label based on a weighted sum of a plurality of combinations of a separate extracted feature of the plurality of features and a weighted model parameter fixed in the previously trained text classifier. 11. The computer system according to claim 8 , wherein the program instructions to decompose a plurality of extracted features summed in the score into the plurality of word-level scores for each word in the testing phrase further comprise: program instructions to decompose the plurality of extracted features, the plurality of extracted features comprising one or more of a unigram based feature, a term-based feature, an average pooling of word embedding features, a maximum pooling of word embedding features, and character level features. 12. The computer system according to claim 8 , the program instructions further comprising: program

Assignees

Inventors

Classifications

  • G06F16/355Primary

    Creation or modification of classes or clusters · CPC title

  • based on specific statistical tests · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • using kernel methods, e.g. support vector machines [SVM] · CPC title

  • Lexical analysis, e.g. tokenisation or collocates · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11537821B2 cover?
In response to running at least one testing phrase on a previously trained text classifier and identifying a separate predicted classification label based on a score calculated for each respective at least one testing phrase, a text classifier decomposes extracted features summed in the score into word-level scores for each word in the at least one testing phrase. The text classifier assigns a …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/355. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 27 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).