Relation extraction from text using machine learning

US11625573B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11625573-B2
Application numberUS-201816173534-A
CountryUS
Kind codeB2
Filing dateOct 29, 2018
Priority dateOct 29, 2018
Publication dateApr 11, 2023
Grant dateApr 11, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A first neural network is operated on a processor and a memory to encode a first natural language string into a first sentence encoding including a set of word encodings. Using a word-based attention mechanism with a context vector, a weight value for a word encoding within the first sentence encoding is adjusted to form an adjusted first sentence encoding. Using a sentence-based attention mechanism, a first relationship encoding corresponding to the adjusted first sentence encoding is determined. An absolute difference between the first relationship encoding and a second relationship encoding is computed. Using a multi-layer perceptron, a degree of analogical similarity between the first relationship encoding and a second relationship encoding is determined.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: operating a first neural network on a processor and a memory to encode a first natural language string into a first sentence encoding comprising a set of word encodings; adjusting, using a word-based attention mechanism with a context vector, a weight value for a word encoding within the first sentence encoding to form an adjusted first sentence encoding, a sentence layer of the first neural network comprising the word-based attention mechanism with the context vector, the word-based attention mechanism trained to form the adjusted first sentence encoding by adjusting a plurality of weights within the sentence layer; generating, using a sentence-based attention mechanism, a first relationship encoding corresponding to the adjusted first sentence encoding, the sentence-based attention mechanism further adjusting a plurality of adjusted sentence encodings output from the word-based attention mechanism, the first relationship encoding comprising a vector representation of a relationship between entities, the relationship expressed in the first natural language string, a relation layer of the first neural network comprising the sentence-based attention mechanism, the sentence-based attention mechanism trained to generate the first relationship encoding by adjusting a plurality of weights within the relation layer; computing an absolute difference between the first relationship encoding and a second relationship encoding; and determining, using a multi-layer perceptron, a degree of analogical similarity between the first relationship encoding and the second relationship encoding. 2. The method of claim 1 , further comprising: operating a second neural network on a processor and a memory to encode a second natural language string into a second sentence encoding comprising a second set of word encodings; adjusting, using a second word-based attention mechanism with a second context vector, a weight value for a word encoding within the second sentence encoding to form an adjusted second sentence encoding; and determining, using a second sentence-based attention mechanism, the second relationship encoding corresponding to the adjusted second sentence encoding. 3. The method of claim 2 , wherein the first neural network and the second neural network are identically structured. 4. The method of claim 2 , wherein the word-based attention mechanism with the context vector and the second word-based attention mechanism with the second context vector are identically structured. 5. The method of claim 2 , wherein the sentence-based attention mechanism and the second sentence-based attention mechanism are identically structured. 6. The method of claim 2 , further comprising: determining, using an output unit including a sigmoid activation function, that the first relationship encoding and the second relationship encoding correspond to an analogous relationship. 7. The method of claim 2 , further comprising: determining, using an output unit including a sigmoid activation function, that the first relationship encoding and the second relationship encoding do not correspond to an analogous relationship. 8. The method of claim 2 , further comprising: training, using a set of pairs of natural language strings, wherein each natural language string in the set of pairs of natural language strings expresses a relationship between entities included in the natural language string, the first neural network and the second neural network. 9. The method of claim 8 , further comprising: generating a set of relation pairs, wherein each relation pair in the set of relation pairs comprises a pair of entities and a relationship relating the pair of entities; generating a set of positive example pairs, wherein each positive example pair comprises two relation pairs, a relationship of each relation pair in the set of positive example pairs being equivalent to each other; generating a set of negative example pairs, wherein each negative example pair comprises two relation pairs, a relationship of each relation pair in the set of negative example pairs not being equivalent to each other; combining, forming a training set of example pairs, the set of positive example pairs and the set of negative example pairs; and converting, by extracting from a text corpus a natural language string expressing a relationship between entities included in the natural language string, the training set of example pairs to a training set of pairs of natural language strings. 10. A computer usable program product comprising one or more computer-readable storage media, and program instructions stored on at least one of the one or more computer-readable storage media, the stored program instructions comprising: program instructions to operate a first neural network on a processor and a memory to encode a first natural language string into a first sentence encoding comprising a set of word encodings; program instructions to adjust, using a word-based attention mechanism with a context vector, a weight value for a word encoding within the first sentence encoding to form an adjusted first sentence encoding, a sentence layer of the first neural network comprising the word-based attention mechanism with the context vector, the word-based attention mechanism trained to form the adjusted first sentence encoding by adjusting a plurality of weights within the sentence layer; program instructions to generate, using a sentence-based attention mechanism, a first relationship encoding corresponding to the adjusted first sentence encoding, the sentence-based attention mechanism further adjusting a plurality of adjusted sentence encodings output from the word-based attention mechanism, the first relationship encoding comprising a vector representation of a relationship between entities, the relationship expressed in the first natural language string, a relation layer of the first neural network comprising the sentence-based attention mechanism, the sentence-based attention mechanism trained to generate the first relationship encoding by adjusting a plurality of weights within the relation layer; program instructions to compute an absolute difference between the first relationship encoding and a second relationship encoding; and program instructions to determine, using a multi-layer perceptron, a degree of analogical similarity between the first relationship encoding and the second relationship encoding. 11. The computer usable program product of claim 10 , further comprising: program instructions to operate a second neural network on a processor and a memory to encode a second natural language string into a second sentence encoding comprising a second set of word encodings; program instructions to adjust, using a second word-based attention mechanism with a second context vector, a weight value for a word encoding within the second sentence encoding to form an adjusted second sentence encoding; and program instructions to determine, using a second sentence-based attention mechanism, the second relationship encoding corresponding to the adjusted second sentence encoding. 12. The computer usable program product of claim 11 , wherein the first neural network and the second neural network are identically structured. 13. The computer usable program product of claim 11 , wherein the word-based attention mechanism with the context vector and the second word-based attention mechanism with the second context vector are identically structured. 14. The computer usable program product of claim 11 , wherein the sentence-based attention mechanism and the second s

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Semantic analysis · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11625573B2 cover?
A first neural network is operated on a processor and a memory to encode a first natural language string into a first sentence encoding including a set of word encodings. Using a word-based attention mechanism with a context vector, a weight value for a word encoding within the first sentence encoding is adjusted to form an adjusted first sentence encoding. Using a sentence-based attention mech…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 11 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).