Weak supervised abnormal entity detection

US11423227B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11423227-B2
Application numberUS-202016789804-A
CountryUS
Kind codeB2
Filing dateFeb 13, 2020
Priority dateFeb 13, 2020
Publication dateAug 23, 2022
Grant dateAug 23, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A mechanism is provided to implement an abnormal entity detection mechanism that facilitates detecting abnormal entities in real-time response systems through weak supervision. For each first intent from an entity labeled workspace that matches a second intent in labeled chat logs, when the entity score associated with each first entity or second entity is above a predefined significance level the first entity or the second entity is recorded. For each first intent from the entity labeled workspace that matches the second intent in the labeled chat logs: responsive to the first entity being recorded and the second entity failing to be recorded, that first entity is removed from the training data as being mistakenly included; or, responsive to the second entity being recorded and the first entity failing to be recorded, that second entity is added as a potential business case to the training data.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, in a data processing system, for comprising at least one processor and at least one memory, wherein the at least one memory comprises instructions that are executed by the at least one processor to configure the at least one processor to implement an abnormal entity detection mechanism that facilitates detecting abnormal entities in real-time response systems through weak supervision, the method comprising: for each first intent of one or more first intents from an entity labeled workspace, identifying one or more first entities associated with the first intent and an entity score associated with each first entity; for each second intent of one or more second intents from labeled chat logs, identifying one or more second entities associated with the second intent and an entity score associated with each second entity; for each first intent from the entity labeled workspace that matches a second intent in the labeled chat logs, recording the first entity or the second entity in a results data structure when the entity score associated with each first entity or second entity is above a predefined significance level; and for each first intent from the entity labeled workspace that matches the second intent in the labeled chat logs: responsive to the first entity being recorded in a results data structure and the second entity failing to be recorded in the results data structure, removing that the first entity from the training data as being mistakenly included in the training data; or responsive to the second entity being recorded in the results data structure and the first entity failing to be recorded in the results data structure, adding the second entity as a potential business case to the training data. 2. The method of claim 1 , wherein the one or more first entities associated with the first intent in the entity labeled workspace are identified from intent data utilizing weak entity labeling through natural language processing. 3. The method of claim 1 , wherein each second intent is identified from human conversation chat logs utilizing weak intent labeling through natural language processing and wherein the one or more second entities associated with each second intent are identified from human conversation chat logs utilizing weak entity labeling through natural language processing. 4. The method of claim 1 , wherein the entity score associated with each first entity associated with each first intent is generated through correlation analysis of an intent of a sentence identified by a customer to entities identified from the sentence associated with the identified intent. 5. The method of claim 1 , wherein the entity score associated with each second entity associated with each second intent is generated through correlation analysis of a predicted intent of a sentence identified to entities identified from the sentence associated with the predicted intent. 6. The method of claim 1 , further comprising: performing a pairwise semantic evaluation by creating one or more (unigram/bigram)/entity pairs of each of one or more unigrams and bigrams to each first entity in the one or more first entities associated with the first intent; generating a first set of phrase embedding vectors for each unigram/bigram and a second set of phrase embedding vectors for each first entity; determining a similarity score based on a cosine distance between each phrase embedding vector for each unigram/bigram and each phrase embedding vector for each first entity; and responsive to none of the one or more (unigram/bigram)/entity pairs having similarity score greater than a predetermined similarity score, redefining the first entity in the training data. 7. The method of claim 6 , wherein the one or more unigrams and bigrams are identified by: for each sentence on intent data, performing natural language processing on the sentence to identify the one or more unigrams or bigrams of the sentence as being associated with the intent identified for the sentence thereby forming a semantic labeled workspace; performing a comparison of intents identified in labeled chat logs to the intents identified in the semantic labeled workspace; and recording those one or more unigrams and bigrams associated with the intents in the labeled chat logs that match intents in the semantic labeled workspace. 8. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a data processing system, causes the data processing system to implement an abnormal entity detection mechanism that facilitates detecting abnormal entities in real-time response systems through weak supervision, and further causes the data processing system to: for each first intent of one or more first intents from an entity labeled workspace, identify one or more first entities associated with the first intent and an entity score associated with each first entity; for each second intent of one or more second intents from labeled chat logs, identify one or more second entities associated with the second intent and an entity score associated with each second entity; for each first intent from the entity labeled workspace that matches a second intent in the labeled chat logs, record the first entity or the second entity in a results data structure when the entity score associated with each first entity or second entity is above a predefined significance level; and for each first intent from the entity labeled workspace that matches the second intent in the labeled chat logs: responsive to the first entity being recorded in a results data structure and the second entity failing to be recorded in the results data structure, remove that the first entity from the training data as being mistakenly included in the training data; or responsive to the second entity being recorded in the results data structure and the first entity failing to be recorded in the results data structure, add the second entity as a potential business case to the training data. 9. The computer program product of claim 8 , wherein the one or more first entities associated with the first intent in the entity labeled workspace are identified from intent data utilizing weak entity labeling through natural language processing. 10. The computer program product of claim 8 , wherein each second intent is identified from human conversation chat logs utilizing weak intent labeling through natural language processing and wherein the one or more second entities associated with each second intent are identified from human conversation chat logs utilizing weak entity labeling through natural language processing. 11. The computer program product of claim 8 , wherein the entity score associated with each first entity associated with each first intent is generated through correlation analysis of an intent of a sentence identified by a customer to entities identified from the sentence associated with the identified intent. 12. The computer program product of claim 8 , wherein the entity score associated with each second entity associated with each second intent is generated through correlation analysis of a predicted intent of a sentence identified to entities identified from the sentence associated with the predicted intent. 13. The computer program product of claim 8 , wherein the computer readable program further causes the data processing system to: perform a pairwise semantic evaluation by creating one or more (unigram/bigram)/entity pairs of each of one or more unigrams and bigrams to each first entity in the one or

Assignees

Inventors

Classifications

  • using automatic reactions or user delegation, e.g. automatic replies or chatbot-generated messages · CPC title

  • Ensemble learning · CPC title

  • Machine learning · CPC title

  • Discourse or dialogue representation · CPC title

  • G06F40/295Primary

    Named entity recognition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11423227B2 cover?
A mechanism is provided to implement an abnormal entity detection mechanism that facilitates detecting abnormal entities in real-time response systems through weak supervision. For each first intent from an entity labeled workspace that matches a second intent in labeled chat logs, when the entity score associated with each first entity or second entity is above a predefined significance level …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F40/295. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 23 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).