Method and apparatus for named entity recognition, and non-transitory computer-readable recording medium

US12450437B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12450437-B2
Application numberUS-202318326292-A
CountryUS
Kind codeB2
Filing dateMay 31, 2023
Priority dateJun 2, 2022
Publication dateOct 21, 2025
Grant dateOct 21, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and an apparatus for named entity recognition, and a non-transitory computer-readable recording medium are provided. In the method, text elements are traversed according to a text span to obtain candidate entity words. Then, a class to which the candidate entity word belongs is recognized. The recognizing of the class includes generating a prompt template corresponding to the candidate entity word, and concatenating the text to be recognized and the prompt template to obtain a concatenated text; generating vector representations of the text elements in the concatenated text; generating the vector representation of the candidate entity word according to the vector representations of the text elements of each candidate entity word in the concatenated text, and the vector representation of the text element of the mask word; and classifying the vector representation of the candidate entity word to obtain the class of the candidate entity word.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for named entity recognition, the method comprising: traversing text elements in a text to be recognized according to an original text span to obtain a plurality of candidate entity words; and for each candidate entity word, recognizing a class to which the candidate entity word belongs, wherein the recognizing of the class includes generating a prompt template corresponding to the candidate entity word, and concatenating the text to be recognized and the prompt template to obtain a concatenated text, the prompt template being used to learn the class to which the candidate entity word belongs by prompt learning, and the prompt template including the candidate entity word, and an entity class replaced with a mask word, according to a span in the prompt template; generating vector representations of the text elements in the concatenated text; generating the vector representation of the candidate entity word according to the vector representations of the text elements of each candidate entity word in the concatenated text, and the vector representation of the text element of the mask word; and classifying the vector representation of the candidate entity word to obtain the class to which the candidate entity word belongs, wherein the generating of the vector representation of the candidate entity word includes performing a first integration process on the vector representations of the text elements of the candidate entity word in the text to be recognized to obtain a first span representation of the candidate entity word; performing the first integration process on the vector representations of the text elements of the candidate entity word in the prompt template to obtain a second span representation of the candidate entity word; and generating the vector representation of the candidate entity word according to the first span representation, the second span representation, and the vector representation of the text element of the mask word, wherein the generating of the vector representation of the candidate entity word further includes performing a second integration process on the first span representation and the second span representation to obtain a third span representation; and concatenating the third span representation and the vector representation of the text element of the mask word to obtain the vector representation of the candidate entity word, and wherein the original text span and the span in the prompt template are combined in the generating of the vector representation of the candidate entity word to obtain a final representation of the candidate entity word, and the final representation of the candidate entity word is provided to a neural network model. 2. The method for named entity recognition as claimed in claim 1 , wherein the generating of the vector representation of the candidate entity word includes obtaining a vector representation corresponding to a width value of a text span of the candidate entity word, and concatenating the third span representation and the vector representation corresponding to the width value of the text span of the candidate entity word to obtain a fourth span representation; and concatenating the fourth span representation and the vector representation of the text element of the mask word to obtain the vector representation of the candidate entity word. 3. The method for named entity recognition as claimed in claim 1 , wherein the concatenated text includes a start identifier, and the generating of the vector representation of the candidate entity word includes generating the vector representation of the candidate entity word according to the first span representation, the second span representation, the vector representation of the start identifier, and the vector representation of the text element of the mask word. 4. The method for named entity recognition as claimed in claim 3 , wherein the generating of the vector representation of the candidate entity word includes concatenating the third span representation, the vector representation of the start identifier, and the vector representation of the text element of the mask word to obtain the vector representation of the candidate entity word. 5. The method for named entity recognition as claimed in claim 3 , wherein the generating of the vector representation of the candidate entity word includes obtaining a vector representation corresponding to a width value of a text span of the candidate entity word, and concatenating the third span representation and the vector representation corresponding to the width value of the text span of the candidate entity word to obtain a fourth span representation; and concatenating the fourth span representation, the vector representation of the start identifier, and the vector representation of the text element of the mask word to obtain the vector representation of the candidate entity word. 6. The method for named entity recognition as claimed in claim 1 , wherein the first integration process includes any one of a max pooling process, an average pooling process, and concatenating of the vector representations of the first text element and the last text element in the candidate entity word, and the second integration process includes any one of a max pooling process and an average pooling process. 7. The method for named entity recognition as claimed in claim 1 , wherein the classifying of the vector representation of the candidate entity word includes inputting the vector representation of the candidate entity word into a softmax function to obtain at least one probability that the candidate entity word is mapped to different candidate classes, which is output by the softmax function; and selecting the candidate class with the highest probability serving as the class to which the candidate entity word belongs. 8. An apparatus for named entity recognition, the apparatus comprising: a memory storing computer-executable instructions; and one or more processors configured to execute the computer-executable instructions such that the one or more processors are configured to perform traversing text elements in a text to be recognized according to an original text span to obtain a plurality of candidate entity words; and for each candidate entity word, recognizing a class to which the candidate entity word belongs, wherein the recognizing of the class includes generating a prompt template corresponding to the candidate entity word, and concatenating the text to be recognized and the prompt template to obtain a concatenated text, the prompt template being used to learn the class to which the candidate entity word belongs by prompt learning, and the prompt template including the candidate entity word, and an entity class replaced with a mask word, according to a span in the prompt template; generating vector representations of the text elements in the concatenated text; generating the vector representation of the candidate entity word according to the vector representations of the text elements of each candidate entity word in the concatenated text, and the vector representation of the text element of the mask word; and classifying the vector representation of the candidate entity word to obtain the class to which the candidate entity word belongs, wherein the generating of the vector representation of the candidate entity word includes performing a first integration process on the vector representations of the text elements of the candidate entity word in the text to be recognized to obtain a first span representation of the candidate entity word; performing the first integration process on the vector representations of the text

Assignees

Inventors

Classifications

  • Processing or translation of natural language (natural language analysis G06F40/20; semantic analysis G06F40/30) · CPC title

  • G06F40/295Primary

    Named entity recognition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12450437B2 cover?
A method and an apparatus for named entity recognition, and a non-transitory computer-readable recording medium are provided. In the method, text elements are traversed according to a text span to obtain candidate entity words. Then, a class to which the candidate entity word belongs is recognized. The recognizing of the class includes generating a prompt template corresponding to the candidate…
Who is the assignee on this patent?
Zhang Yongwei, Dong Bin, Jiang Shanshan, and 3 more
What technology area does this patent fall under?
Primary CPC classification G06F40/295. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 21 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).