Predictive neural network with sentiment data

US11238339B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11238339-B2
Application numberUS-201715667287-A
CountryUS
Kind codeB2
Filing dateAug 2, 2017
Priority dateAug 2, 2017
Publication dateFeb 1, 2022
Grant dateFeb 1, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A set of vectors may be obtained. The vectors may be multi-dimensional vectors that are associated with and describe tokens from a first set of tokens from a corpus of sources. The description may be based in part on the relationship of the token to at least a portion of the remainder of the corpus. A set of sentiment scores may be obtained. The sentiment scores in the set of sentiment scores may describe a sentiment associated with a corresponding token that is described by a vector from the set of vectors. The set of vectors and the set of sentiment scores may be input into a pattern-recognizer pathway in a first neural network. A probability value of a potential future event may then be generated by the first neural network. The probability value may be based on the set of vectors and the set of sentiment scores.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: obtaining a first set of entity vectors, wherein each entity vector in the first set of entity vectors is a multi-dimensional vector that is associated with and describes a token from a first set of tokens from a corpus of sources, wherein the description is based in part on a relationship of the token to at least a portion of the remainder of the corpus of sources, and wherein the corpus of sources contains at least one natural-language source; obtaining a first set of sentiment scores, wherein each sentiment score in the first set of sentiment scores describes a sentiment associated with a corresponding token that is described by a vector in the first set of entity vectors; inputting the first set of entity vectors and the first set of sentiment scores into a first pattern-recognizer pathway in a first neural network; and generating, by the first neural network and based on the first set of entity vectors and the first set of sentiment scores, a first probability value of a first potential future event wherein the generating the first probability value comprises: reducing, by the first pattern-recognizer pathway, a first dimension of each entity vector in the first set of entity vectors, resulting in a first set of reduced entity vectors; reducing, by a second pattern-recognizer pathway, a second dimension of each keyword vector in a first set of keyword vectors, resulting in a first set of reduced keyword vectors; reducing, by a third pattern-recognizer pathway, a third dimension of each concept vector in a first set of concept vectors, resulting in a first set of reduced concept vectors; merging each of the reduced vectors in the first set of reduced entity vectors with a corresponding vector in the set of reduced keyword vectors and a corresponding vector in the set of reduced concept vectors, resulting in a set of merged vectors; and inputting the merged vectors in the set of merged vectors into a fourth pattern-recognizer pathway in the first neural network. 2. The method of claim 1 , further comprising: obtaining a second set of entity vectors, wherein each entity vector in the second set of vectors is a multi-dimensional entity vector that describes an entity from the corpus of sources, wherein the description is based in part on a relationship of the entity to at least a portion of the remainder of the corpus; obtaining a second set of sentiment scores, wherein each sentiment score in the second set of sentiment scores describes a sentiment associated with a corresponding entity that is described by an entity vector in the second set of entity vectors; inputting the second set of entity vectors and the second set of second sentiment scores into a fifth pattern-recognizer pathway in the first neural network, wherein the first pattern-recognizer pathway and the fifth pattern-recognizer pathways are separate pathways prior to generating the first probability value and the second probability value; and generating, by the first neural network and based on the second set of entity vectors and the second set of sentiment scores, a second probability value of a second potential future event. 3. The method of claim 2 , further comprising creating, by a multiple-regression analysis, an overall projected score, wherein the creating comprises: inputting the first probability value into a multiple-regression function; inputting the second probability value into the multiple-regression function; inputting at least one portion of structured data into the multiple-regression function; and calculating the overall projected score based on the output of the multiple-regression function. 4. The method of claim 1 , further comprising: obtaining the first set of keyword vectors, wherein each keyword vector in the first set of keyword vectors is a multi-dimensional vector that describes a keyword from the corpus of sources; inputting the first set of keyword vectors into the second pattern-recognizer pathway in the first neural network; obtaining the first set of concept vectors, wherein each concept vector in the first set of concept vectors is a multi-dimensional vector that describes a concept from the corpus of sources; and inputting the first set of concept vectors into the third pattern-recognizer pathway in the first neural network. 5. The method of claim 1 , wherein the obtaining a first set of sentiment scores comprises: obtaining a second set of sentiment scores, wherein the second set of sentiment scores comprises the first set of sentiment scores, wherein each sentiment score in the second set of sentiment scores describes the sentiment associated with a token in a second set of tokens, and wherein the second set of tokens comprises the first set of tokens; cross-referencing the first set of tokens and the second set of tokens; and selecting, based on the cross-referencing and from the second set of sentiment scores, the sentiment scores associated with tokens that are in the first set of tokens and second set of tokens, resulting in the first set of sentiment scores. 6. A computer program product, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to: obtain a first set of entity vectors, wherein each entity vector in the first set of entity vectors is a multi-dimensional entity vector that is associated with and describes an entity from a first set of entities from a corpus of sources, wherein the description is based in part on a relationship of the entity to at least a portion of the remainder of the corpus of sources, and wherein the corpus of sources contains at least one natural-language source; obtain a first set of sentiment scores, wherein each sentiment score in the first set of sentiment scores describes a sentiment associated with a corresponding entity that is described by an entity vector in the first set of vectors; obtain a second set of vectors, wherein each vector in the second set of vectors is a multi-dimensional vector that describes a token from a first set of tokens from the corpus of sources, wherein the description is based in part on a relationship of the token to at least a portion of the remainder of the corpus; obtain a second set of sentiment scores, wherein each sentiment score in the second set of sentiment scores describes a sentiment associated with a corresponding token that is described by a vector in the second set of vectors; input the first set of entity vectors and the first set of sentiment scores into a first pattern-recognizer pathway in a first neural network; inputting the second set of vectors and the second set of second sentiment scores into a second pattern-recognizer pathway in a second neural network, wherein the first neural network and the second neural network are separate networks prior to generating the first probability value and the second probability value; generate, by the first neural network and based on the first set of entity vectors and the first set of sentiment scores, a first probability value of a first potential future event; and generate, by the second neural network and based on the second set of vectors and the second set of sentiment scores, a second probability value of a second potential future event. 7. The computer program product of claim 6 , wherein the program instructions further cause the computer to obtain a third set of sentiment scores, wherein the third set of sentiment scores comprises an average sentiment score of the corpus of sources. 8. The computer program product of claim 6 , wherein the first set of entities is identified by a third neural net

Assignees

Inventors

Classifications

  • G06N3/045Primary

    Combinations of networks · CPC title

  • Supervised learning · CPC title

  • Feedforward networks · CPC title

  • G06F40/30Primary

    Semantic analysis · CPC title

  • modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11238339B2 cover?
A set of vectors may be obtained. The vectors may be multi-dimensional vectors that are associated with and describe tokens from a first set of tokens from a corpus of sources. The description may be based in part on the relationship of the token to at least a portion of the remainder of the corpus. A set of sentiment scores may be obtained. The sentiment scores in the set of sentiment scores m…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 01 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).