Private-public context analysis for natural language content disambiguation

US9760627B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9760627-B1
Application numberUS-201615154162-A
CountryUS
Kind codeB1
Filing dateMay 13, 2016
Priority dateMay 13, 2016
Publication dateSep 12, 2017
Grant dateSep 12, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Mechanisms are provided for implementing a natural language content (NLC) disambiguation engine. The NLC disambiguation engine operates to identify, in a corpus of natural language content, a portion of natural language content (NLC) that is determined to be ambiguous with regard to the context of the portion of NLC. The NLC disambiguation engine compares the ambiguous content portion to private content information associated with a source of the ambiguous content portion. The NLC disambiguation engine identifies a domain of the ambiguous content portion based on a domain of a matching portion of the private content information. The NLC disambiguation engine performs a clarifying operation that clarifies the ambiguous content portion based on the identified domain of the matching portion of the private content information to thereby generate a clarified content portion for processing by a cognitive operation of a cognitive system.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, in a data processing system comprising a processor and a memory, the memory comprising instructions which are executed by the processor to configure the data processing system to implement a natural language content (NLC) disambiguation engine, the method comprising: identifying, by the NLC disambiguation engine, in a corpus of natural language content, a portion of natural language content (NLC) that is determined to be ambiguous with regard to a context of the portion of NLC, thereby identifying an ambiguous content portion; comparing, by the NLC disambiguation engine, the ambiguous content portion to private content information associated with a source of the ambiguous content portion, wherein the private content information is content information accessible to users and resources associated with the source and is not accessible to users or resources that are not associated with the source; identifying, by the NLC disambiguation engine, a domain of the ambiguous content portion based on a domain of a matching portion of the private content information; and performing, by the NLC disambiguation engine, a clarifying operation that clarifies the ambiguous content portion based on the identified domain of the ambiguous content portion to thereby generate a clarified content portion for processing by a cognitive operation of a cognitive system. 2. The method of claim 1 , wherein identifying the ambiguous content portion comprises determining that the portion of NLC is associated with a flag indicating that at least one user considers the portion of NLC to be ambiguous. 3. The method of claim 1 , wherein identifying the ambiguous content portion comprises comparing the natural language content in the corpus of natural language content to one or more patterns of natural language content corresponding to ambiguous content, and identifying the portion of natural language content to be the ambiguous content portion in response to the portion of natural language content matching at least one of the one or more patterns of natural language content. 4. The method of claim 1 , wherein comparing the ambiguous content portion to private content information comprises: comparing one or more extracted first features of the ambiguous content portion to one or more second features of the private content information; and identifying a matching portion of the private content information based on a degree of correspondence between the one or more extracted first features and the one or more second features, wherein the portion of the private content information comprises a portion having a second feature matching a first feature. 5. The method of claim 1 , wherein the clarifying operation comprises at least one of sending a clarifying question to the source, wherein the clarifying question specifies the identified domain of the ambiguous content portion or sending a notification to the source indicating a need to clarify the ambiguous content portion and specifying the identified domain of the ambiguous content portion. 6. The method of claim 1 , wherein the domain of the matching portion of the private content information comprises a domain, in a taxonomy data structure specifying domains corresponding to private content information associated with the source, which is indicated by the source to be able to be used by the NLC disambiguation engine to disambiguate ambiguous content. 7. The method of claim 6 , wherein the identified domain of the ambiguous content portion is part of a sub-portion of the taxonomy data structure that is registered by the source with the NLC disambiguation engine as being able to be used by the NLC disambiguation engine to disambiguate ambiguous content, and wherein other portions of the taxonomy data structure are not able to be used by the NLC disambiguation engine to disambiguate ambiguous content. 8. The method of claim 1 , wherein the private content information is not exposed to users or resources that are not associated with the source or authorized access by the source when performing the method. 9. The method of claim 1 , wherein the portion of NLC comprises a portion of natural language content of a corpus of natural language content ingested into a cognitive system, and wherein the cognitive operation is one of a cognitive search operation for searching for matching natural language content to a search request submitted to the cognitive system or a cognitive question answering operation for generating a natural language answer to a input natural language question. 10. The method of claim 1 , wherein the portion of NLC comprises a message posted to a collaborative website by a user that is the source, and wherein the private content information comprises a set of messages posted to the collaborative website by users that are part of a private group of the collaborative website. 11. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a computing device, causes the computing device to implement a natural language content (NLC) disambiguation engine, that operates to: identify, in a corpus of natural language content, a portion of natural language content (NLC) that is determined to be ambiguous with regard to a context of the portion of NLC, thereby identifying an ambiguous content portion; compare the ambiguous content portion to private content information associated with a source of the ambiguous content portion, wherein the private content information is content information accessible to users and resources associated with the source and is not accessible to users or resources that are not associated with the source; identify a domain of the ambiguous content portion based on a domain of a matching portion of the private content information; and perform a clarifying operation that clarifies the ambiguous content portion based on the identified domain of the ambiguous content portion to thereby generate a clarified content portion for processing by a cognitive operation of a cognitive system. 12. The computer program product of claim 11 , wherein the NLC disambiguation engine identifying the ambiguous content portion comprises determining that the portion of NLC is associated with a flag indicating that at least one user considers the portion of NLC to be ambiguous. 13. The computer program product of claim 11 , wherein the NLC disambiguation engine identifying the ambiguous content portion comprises comparing the natural language content in the corpus of natural language content to one or more patterns of natural language content corresponding to ambiguous content, and identifying the portion of natural language content to be the ambiguous content portion in response to the portion of natural language content matching at least one of the one or more patterns of natural language content. 14. The computer program product of claim 11 , wherein the NLC disambiguation engine comparing the ambiguous content portion to private content information comprises: comparing one or more extracted first features of the ambiguous content portion to one or more second features of the private content information; and identifying a matching portion of the private content information based on a degree of correspondence between the one or more extracted first features and the one or more second features, wherein the portion of the private content information comprises a portion having a second feature matching a first feature. 15. The computer program product of c

Assignees

Inventors

Classifications

  • using natural language analysis · CPC title

  • Parsing · CPC title

  • Filtering based on additional data, e.g. user or group profiles (filtering in web context G06F16/9535, G06F16/9536) · CPC title

  • G06F40/30Primary

    Semantic analysis · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9760627B1 cover?
Mechanisms are provided for implementing a natural language content (NLC) disambiguation engine. The NLC disambiguation engine operates to identify, in a corpus of natural language content, a portion of natural language content (NLC) that is determined to be ambiguous with regard to the context of the portion of NLC. The NLC disambiguation engine compares the ambiguous content portion to privat…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/3344. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 12 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).