Performing sentiment analysis

US2016307114A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016307114-A1
Application numberUS-201615044351-A
CountryUS
Kind codeA1
Filing dateFeb 16, 2016
Priority dateOct 24, 2011
Publication dateOct 20, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

There is provided a computer-implemented method of performing sentiment analysis. An exemplary method comprises performing a first sentiment analysis on microblogging data based on a method using an opinion lexicon. The method also includes training a classifier using training data from the first sentiment analysis. Additionally, the method includes identifying a new opinion term in the microblogging data by performing a statistical test. The new opinion terms are not in the opinion lexicon. The method also includes identifying new microblogging data based on the new opinion term. Further, the method includes performing a second sentiment analysis on the new microblogging data using the classifier.

First claim

Opening claim text (preview).

1 . A method comprising: performing a first sentiment analysis on microblogging data using an opinion lexicon; adding a new opinion term to the opinion lexicon, the new opinion term identified in the microblogging data using results of the first sentiment analysis; identifying additional microblogging data, based on the opinion lexicon to which the new opinion term has been added; performing a second sentiment analysis on the additional microblogging data to generate training data; and training a classifier using the training data. 2 . The method of claim 1 , further comprising identifying the new opinion term using the results of the first sentiment analysis. 3 . The method of claim 2 , wherein identifying the new opinion term using the results of the first sentiment analysis comprises performing a statistical test on the results of the first sentiment analysis. 4 . The method of claim 1 , wherein the training data is second training data, wherein performing the first sentiment analysis generates first training data, and wherein the method further comprises: training the classifier using the first training data, wherein training the classifier using the second training data comprises retraining the classifier as has been trained using the first training data, using the second training data. 5 . The method of claim 1 , wherein the opinion lexicon that is used to perform the first sentiment analysis includes non-domain-specific opinion terms, and wherein the new opinion term is domain-specific to the microblogging data. 6 . A non-transitory machine-readable storage medium encoded with instructions executable by at least one processor, the machine-readable storage medium comprising instructions to: perform a first sentiment analysis on data using a lexicon; identify a new term in the data using results of the first sentiment analysis; add the new term to the lexicon; identify additional data, based on the lexicon to which the new term has been added; perform a second sentiment analysis on the additional data to generate training data; and train a classifier using the training data. 7 . The non-transitory machine-readable storage medium of claim 6 , wherein the instructions to identify the new term using the results of the first sentiment analysis comprise instructions to perform a statistical test on the results of the first sentiment analysis. 8 . The non-transitory machine-readable storage medium of claim 7 , wherein the statistic test comprises a Pearsons chi-square method, and wherein the new opinion term has a Pearsons chi-square value greater than a threshold. 9 . The non-transitory machine-readable storage medium of claim 6 , wherein the training data is second training data, wherein the instructions to perform the first sentiment analysis generate first training data, and wherein the instructions further comprises instructions to: train the classifier using the first training data, wherein the instructions to train the classifier using the second training data retrain the classifier as has been trained using the first training data, using the second training data. 10 . The non-transitory machine-readable storage medium of claim 6 , wherein the lexicon that is used to perform the first sentiment analysis includes non-domain-specific terms, and wherein the new term is domain-specific to the data. 11 . A system comprising: a processor; a memory device storing instructions executable by the processor to: identify new microblogging data, based on an opinion lexicon that has been improved by a new opinion term identified in existing microblogging data; perform a sentiment analysis on the new microblogging data to generate training data; and train a classifier using the training data. 12 . The system of claim 11 , wherein the sentiment analysis is a second sentiment analysis, and wherein the processor is to identify the new microblogging data by performing a first sentiment analysis on the existing microblogging data using the opinion lexicon prior to improvement. 13 . The system of claim 12 , wherein the processor is to identify the new microblogging data by further performing a statistical test on results of the first sentiment analysis. 14 . The system of claim 12 , wherein the training data is second training data, performing the first sentiment analysis generates first training data, and wherein the processor is further to: train the classifier using the first training data, wherein the processor is to train the classifier using the second training data by retraining the classifier as has been trained using the first training data, using the second training data. 15 . The system of claim 11 , wherein the lexicon prior to improvement includes non-domain-specific opinion terms, and wherein the new opinion term is domain-specific to the existing microblogging data.

Assignees

Inventors

Classifications

  • Marketing; Price estimation or determination; Fundraising · CPC title

  • Creation or modification of classes or clusters · CPC title

  • Semantic analysis · CPC title

  • Physics · mapped topic

  • G06N99/005Primary

    Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016307114A1 cover?
There is provided a computer-implemented method of performing sentiment analysis. An exemplary method comprises performing a first sentiment analysis on microblogging data based on a method using an opinion lexicon. The method also includes training a classifier using training data from the first sentiment analysis. Additionally, the method includes identifying a new opinion term in the microbl…
Who is the assignee on this patent?
Hewlett Packard Entpr Dev Lp
What technology area does this patent fall under?
Primary CPC classification G06N99/005. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 20 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).