Method and apparatus for sequence labeling on entity text, and non-transitory computer-readable recording medium

US11907661B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11907661-B2
Application numberUS-202117455967-A
CountryUS
Kind codeB2
Filing dateNov 22, 2021
Priority dateNov 25, 2020
Publication dateFeb 20, 2024
Grant dateFeb 20, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and an apparatus for sequence labeling on an entity text, and a non-transitory computer-readable recording medium are provided. In the method, a start position of an entity text within a target text is determined. Then, a first matrix is generated based on the start position of the entity text. Elements in the first matrix indicates focusable weights of each word with respect to other words in the target text. Then, a named entity recognition model is generated using the first matrix. The named entity recognition model is obtained by training using first training data, the first training data includes word embeddings corresponding to respective texts in a training text set, and the texts are texts whose entity label has been labeled. Then, the target text is input to the named entity recognition model, and probability distribution of the entity label is output.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for sequence labeling on an entity text, the method comprising: determining a start position of an entity text within a target text, the target text being a text having an entity label to be recognized; generating a first matrix based on the start position of the entity text within the target text, the number of rows and the number of columns of the first matrix being equal to a sequence length of the target text, elements in the first matrix indicating focusable weights of each word with respect to other words in the target text, the focusable weight of a word in the target text that is within the entity text with respect to a first word being greater than the focusable weight of the word with respect to a second word, the first word including the word, and one or more words between the start position of the entity text and the word, and the second word being a word other than the first word in the target text; generating a named entity recognition model using the first matrix, the named entity recognition model being obtained by training using first training data, the first training data including word embeddings corresponding to one or more respective texts in a training text set, and the texts in the training text set being texts whose entity label has been labeled; and inputting the target text to the named entity recognition model, and outputting probability distribution of the entity label corresponding to the target text, wherein determining the start position of the entity text within the target text includes performing at least one of data format conversion and data noise removal on the target text to obtain a first text; segmenting words of the first text to obtain a first word sequence; segmenting word segments of the first word sequence to divide the first word sequence into a first word segment sequence; and inputting the first word segment sequence to a second training model to obtain the start position of the entity text. 2. The method for sequence labeling on an entity text as claimed in claim 1 , wherein segmenting the word segments of the first word sequence includes constructing a first initial model, and training the first initial model using second training data to obtain a first segmentation model, the second training data including a plurality of sets of data, and each set of the data including a word sequence and a word segment sequence obtained by segmenting the word sequence; and segmenting the word segments of the first word sequence using the first segmentation model. 3. The method for sequence labeling on an entity text as claimed in claim 1 , the method further comprising: obtaining the second training model, wherein obtaining the second training model includes constructing a second initial model, and training the second initial model using third training data to obtain the second training model, the third training data including a plurality of word segment sequences whose start position of entity text has been labeled. 4. The method for sequence labeling on an entity text as claimed in claim 1 , wherein the element in the i-th row and the j-th column in the first matrix indicates the focusable weight of the i-th word with respect to the j-th word in the target text, and wherein the elements in the a-th row in the first matrix corresponds to the a-th word in the target text, and if the a-th word and the k words on the left of the a-th word are within the entity text, values of the a-k-th element to the a-th element in the a-th row are set to a non-zero value, and values of other elements in the a-th row are set to zero, where a, i, j and k are positive integers. 5. The method for sequence labeling on an entity text as claimed in claim 1 , wherein the named entity recognition model is a self-attention model, and the named entity recognition model includes an attention layer, a feed forward neural network layer, and a span focus layer, a linear layer, and a softmax layer, wherein input of the attention layer is word embeddings corresponding to the target text, and output of the attention layer is input of the feed forward neural network layer, wherein output of the feed forward neural network layer is input of the span focus layer, and output of the span focus layer is Attention ⁢ ⁢ ( Q , K , V ) = softmax ⁢ ⁢ ( Q ⁢ K T d k ⁢ 1 ) * Mas ⁢ k span * V where Q=W Q *H1, K=W K *H1, V=W V *H1, H1 is the output of the feed forward neural network layer, W Q , W K and W V are trainable parameter matrices, Mask span is the first matrix, and d ki is a dimension of an H1 matrix, wherein the linear layer converts a dimension of the output of the span focus layer, and wherein the softmax layer normalizes an output result. 6. An apparatus for sequence labeling on an entity text, the apparatus comprising: a memory storing computer-executable instructions; and one or more processors configured to execute the computer-executable instructions such that the one or more processors are configured to determine a start position of an entity text within a target text, the target text being a text having an entity label to be recognized; generate a first matrix based on the start position of the entity text within the target text, the number of rows and the number of columns of the first matrix being equal to a sequence length of the target text, elements in the first matrix indicating focusable weights of each word with respect to other words in the target text, the focusable weight of a word in the target text that is within the entity text with respect to a first word being greater than the focusable weight of the word with respect to a second word, the first word including the word, and one or more words between the start position of the entity text and the word, and the second word being a word other than the first word in the target text; generate a named entity recognition model using the first matrix, the named entity recognition model being obtained by training using first training data, the first training data including word embeddings corresponding to one or more respective texts in a training text set, and the texts in the training text set being texts whose entity label has been labeled; and input the target text to the named entity recognition model, and output probability distribution of the entity label corresponding to the target text, wherein determining the start position of the entity

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11907661B2 cover?
A method and an apparatus for sequence labeling on an entity text, and a non-transitory computer-readable recording medium are provided. In the method, a start position of an entity text within a target text is determined. Then, a first matrix is generated based on the start position of the entity text. Elements in the first matrix indicates focusable weights of each word with respect to other …
Who is the assignee on this patent?
Tong Yixuan, Zhang Yongwei, Dong Bin, and 3 more
What technology area does this patent fall under?
Primary CPC classification G06F40/295. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 20 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).