Machine learning for training NLP agent

US12014142B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12014142-B2
Application numberUS-202117354825-A
CountryUS
Kind codeB2
Filing dateJun 22, 2021
Priority dateJun 22, 2021
Publication dateJun 18, 2024
Grant dateJun 18, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented process for training a natural language processing (NLP) agent having a reinforced learning model includes the following operations. A type of document from a document corpus is identified using metadata particularly associated with the document. The NLP agent tokenizes the document to generate a plurality of tokens. Using a schema identified from the type of the document, one of the plurality of tokens is compared to a system of record (SOR) field from the schema. A similarity score between the one of the plurality of tokens with a correct value and a reward based upon the similarity score are generated. A determination is made that an optimum minimum average similarity rate has not been obtained. Based upon the determination, the reinforced learning model is trained using a loss function that includes the reward.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for training a natural language processing (NLP) agent having a reinforced learning model, comprising: identifying, using metadata particularly associated with a document from a document corpus, a type of the document; tokenizing, by the NLP agent, the document to generate a plurality of tokens; comparing one of the plurality of tokens to a system of record (SOR) field from a schema identified from the type of the document; generating a similarity score between the one of the plurality of tokens with a correct value and a reward based upon the similarity score; determining that an optimum minimum average similarity rate has not been obtained; and training, based upon the determining, the reinforced learning model using a loss function that includes the reward. 2. The method of claim 1 , wherein the comparing and the generating are performed for all of a plurality of SOR fields in the schema. 3. The method of claim 2 , wherein the determining is based upon similarity scores for all of the plurality of SOR fields. 4. The method of claim 1 , wherein the reinforced learning model is a Deep Q Network. 5. The method of claim 1 , wherein the similarity score use a Levenshtein distance. 6. The method of claim 1 , wherein after a determination is made that optimum minimum average similarity rate has been obtained, the training is repeated using another document from the document corpus. 7. The method of claim 1 , wherein the tokenizing includes identifying a respective location for each of the plurality of tokens. 8. The method of claim 1 , wherein the NLP agent includes a machine learning engine. 9. A computer hardware system for training a natural language processing (NLP) agent having a reinforced learning model, comprising: a hardware processor configured to perform the following executable operations: identifying, using metadata particularly associated with a document from a document corpus, a type of the document; tokenizing, by the NLP agent, the document to generate a plurality of tokens; comparing one of the plurality of tokens to a system of record (SOR) field from a schema identified from the type of the document; generating a similarity score between the one of the plurality of tokens with a correct value and a reward based upon the similarity score; determining that an optimum minimum average similarity rate has not been obtained; and training, based upon the determining, the reinforced learning model using a loss function that includes the reward. 10. The system of claim 9 , wherein the comparing and the generating are performed for all of a plurality of SOR fields in the schema. 11. The system of claim 10 , wherein the determining is based upon similarity scores for all of the plurality of SOR fields. 12. The system of claim 9 , wherein the reinforced learning model is a Deep Q Network. 13. The system of claim 9 , wherein the similarity score use a Levenshtein distance. 14. The system of claim 9 , wherein after a determination is made that optimum minimum average similarity rate has been obtained, the training is repeated using another document from the document corpus. 15. The system of claim 9 , wherein the tokenizing includes identifying a respective location for each of the plurality of tokens. 16. The system of claim 9 , wherein the NLP agent includes a machine learning engine. 17. A computer program product, comprising: a computer readable storage medium having stored therein program code for training a natural language processing (NLP) agent having a reinforced learning model, the program code, which when executed by a computer hardware system, cause the computer hardware system to perform: identifying, using metadata particularly associated with a document from a document corpus, a type of the document; tokenizing, by the NLP agent, the document to generate a plurality of tokens; comparing one of the plurality of tokens to a system of record (SOR) field from a schema identified from the type of the document; generating a similarity score between the one of the plurality of tokens with a correct value and a reward based upon the similarity score; determining that an optimum minimum average similarity rate has not been obtained; and training, based upon the determining, the reinforced learning model using a loss function that includes the reward. 18. The computer program product of claim 17 , wherein the comparing and the generating are performed for all of a plurality of SOR fields in the schema; and the determining is based upon similarity scores for all of the plurality of SOR fields. 19. The computer program product of claim 17 , wherein the reinforced learning model is a Deep Q Network, and the NLP agent includes a machine learning engine. 20. The computer program product of claim 17 , wherein after a determination is made that optimum minimum average similarity rate has been obtained, the training is repeated using another document from the document corpus.

Assignees

Inventors

Classifications

  • Feedforward networks · CPC title

  • Reinforcement learning · CPC title

  • G06F18/22Primary

    Matching criteria, e.g. proximity measures · CPC title

  • Machine learning · CPC title

  • Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12014142B2 cover?
A computer-implemented process for training a natural language processing (NLP) agent having a reinforced learning model includes the following operations. A type of document from a document corpus is identified using metadata particularly associated with the document. The NLP agent tokenizes the document to generate a plurality of tokens. Using a schema identified from the type of the document…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F18/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 18 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).