What technology area does this patent fall under?

Primary CPC classification G06N3/084. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Apr 17 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Semi-supervised learning of word embeddings

US9947314B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9947314-B2
Application number	US-201715437490-A
Country	US
Kind code	B2
Filing date	Feb 21, 2017
Priority date	May 8, 2015
Publication date	Apr 17, 2018
Grant date	Apr 17, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Software that trains an artificial neural network for generating vector representations for natural language text, by performing the following steps: (i) receiving, by one or more processors, a set of natural language text; (ii) generating, by one or more processors, a set of first metadata for the set of natural language text, where the first metadata is generated using supervised learning method(s); (iii) generating, by one or more processors, a set of second metadata for the set of natural language text, where the second metadata is generated using unsupervised learning method(s); and (iv) training, by one or more processors, an artificial neural network adapted to generate vector representations for natural language text, where the training is based, at least in part, on the received natural language text, the generated set of first metadata, and the generated set of second metadata.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving, by one or more processors, a set of natural language text; generating, by one or more processors, a set of first metadata for the set of natural language text, where the first metadata is generated using supervised learning method(s); generating, by one or more processors, a set of second metadata for the set of natural language text, where the second metadata is generated using unsupervised learning method(s); training, by one or more processors, an artificial neural network adapted to generate vector representations for natural language text, where the training is based, at least in part, on the received natural language text, the generated set of first metadata, and the generated set of second metadata generating, by one or more processors, a set of at least two vector representations for the set of natural language text using the trained artificial neural network, where each vector representation of the set of at least two vector representations pertains to a respective subset of natural language text from the set of natural language text; generating, by one or more processors, a set of third metadata for the generated set of at least two vector representations, where the third metadata is generated using supervised learning method(s); generating, by one or more processors, a set of fourth metadata for the set of at least two vector representations, where the fourth metadata is generated using unsupervised learning method(s); training, by one or more processors, the artificial neural network based, at least in part, on the generated set of at least two vector representations, the generated set of third metadata for the set of at least two vector representations, and the generated set of fourth metadata for the set of at least two vector representations; and storing, by one or more processors, one or more vector representations generated using the trained artificial neural network for use by a natural language processing system. 2. The method of claim 1 , further comprising: determining, by one or more processors, an amount of similarity between at least two subsets of natural language text from the set of natural text by comparing their respectively generated vector representations. 3. The method of claim 2 , wherein each of the at least two subsets of natural language text is a word. 4. The method of claim 1 , further comprising: generating, by one or more processors, a vector representation pertaining to the set of natural language text by adding each of the vector representations in the generated set of at least two vector representations. 5. The method of claim 1 , further comprising: generating, by one or more processors, a set of initial vector representations for the set of natural language text; generating, by one or more processors, a set of first metadata vector representations for the generated set of first metadata; and generating, by one or more processors, a set of second metadata vector representations for the generated set of second metadata; wherein the training of the artificial neural network is further based, at least in part, on the generated set of initial vector representations, the generated set of first metadata vector representations, and the generated set of second metadata vector representations. 6. The method of claim 1 , wherein the supervised learning methods utilize a natural language processing annotator. 7. The method of claim 1 , wherein the unsupervised learning methods are based on reconstruction error. 8. A computer program product comprising a computer readable storage medium having stored thereon: program instructions programmed to receive a set of natural language text; program instructions programmed to generate a set of first metadata for the set of natural language text, where the first metadata is generated using supervised learning method(s); program instructions programmed to generate a set of second metadata for the set of natural language text, where the second metadata is generated using unsupervised learning method(s); program instructions programmed to train an artificial neural network adapted to generate vector representations for natural language text, where the training is based, at least in part, on the received natural language text, the generated set of first metadata, and the generated set of second metadata program instructions programmed to generate a set of at least two vector representations for the set of natural language text using the trained artificial neural network, where each vector representation of the set of at least two vector representations pertains to a respective subset of natural language text from the set of natural language text; program instructions programmed to generate a set of third metadata for the generated set of at least two vector representations, where the third metadata is generated using supervised learning method(s); program instructions programmed to generate a set of fourth metadata for the set of at least two vector representations, where the fourth metadata is generated using unsupervised learning method(s); program instructions programmed to train the artificial neural network based, at least in part, on the generated set of at least two vector representations, the generated set of third metadata for the set of at least two vector representations, and the generated set of fourth metadata for the set of at least two vector representations; and program instructions programmed to store one or more vector representations generated using the trained artificial neural network for use by a natural language processing system. 9. The computer program product of claim 8 , further comprising: program instructions programmed to determine an amount of similarity between at least two subsets of natural language text from the set of natural text by comparing their respectively generated vector representations. 10. The computer program product of claim 9 , wherein each of the at least two subsets of natural language text is a word. 11. The computer program product of claim 8 , further comprising: program instructions programmed to generate a vector representation pertaining to the set of natural language text by adding each of the vector representations in the generated set of at least two vector representations. 12. The computer program product of claim 8 , further comprising: program instructions programmed to generate a set of initial vector representations for the set of natural language text; program instructions programmed to generate a set of first metadata vector representations for the generated set of first metadata; and program instructions programmed to generate a set of second metadata vector representations for the generated set of second metadata; wherein the training of the artificial neural network is further based, at least in part, on the generated set of initial vector representations, the generated set of first metadata vector representations, and the generated set of second metadata vector representations. 13. The computer program product of claim 8 , wherein the supervised learning methods utilize a natural language processing annotator. 14. The computer program product of claim 8 , wherein the unsupervised learning methods are based on reconstruction error. 15. A computer system comprising: a processor(s) set; and a computer readable storage medium; wherein: the processor set is structured, located, connected and/or programmed to run program instructions stored on the computer readable storage medium; and th

Assignees

Inventors

Classifications

G06N3/084Primary
Backpropagation, e.g. using gradient descent · CPC title
G06N20/00
Machine learning · CPC title
G10L15/063Primary
Training · CPC title
G06F40/169
Annotation, e.g. comment data or footnotes · CPC title
G06N3/088
Non-supervised learning, e.g. competitive learning · CPC title

Patent family

Related publications grouped by family.

View patent family 57221919

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9947314B2 cover?: Software that trains an artificial neural network for generating vector representations for natural language text, by performing the following steps: (i) receiving, by one or more processors, a set of natural language text; (ii) generating, by one or more processors, a set of first metadata for the set of natural language text, where the first metadata is generated using supervised learning met…
Who is the assignee on this patent?: IBM
What technology area does this patent fall under?: Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Apr 17 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).