Graph-based event schema induction for information retrieval

US11615152B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11615152-B2
Application numberUS-202117223774-A
CountryUS
Kind codeB2
Filing dateApr 6, 2021
Priority dateApr 6, 2021
Publication dateMar 28, 2023
Grant dateMar 28, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, devices, computer-implemented methods, and/or computer program products that facilitate event schema induction from unstructured or semi-structured data. In one example, a system can comprise a processor that executes computer executable components stored in memory. The computer executable components can comprise a schema component and a retrieval component. The schema component can derive an event schema for a document corpus using parsing results obtained from the document corpus. The retrieval component can populate a response to a query with a document of the document corpus using events extracted from the query and the document using the event schema.

First claim

Opening claim text (preview).

What is claimed is: 1. A system, comprising: a memory that stores computer executable components; and a processor that executes the computer-executable components stored in memory, wherein the computer executable components comprise: a schema component that derives an event schema for a document corpus using parsing results obtained from the document corpus; and a retrieval component that populates a response to a query with a document of the document corpus using events extracted from the query and the document using the event schema, wherein the events extracted from the query and the document include an extracted event that comprises a list of tuple representations, and wherein a tuple representation in the list of tuple representations is a vector formed by concatenating respective vector representations of an event type, an event trigger, an event argument, and an argument role. 2. The system of claim 1 , further comprising: an extraction component that extracts the events from the query and the document of the document corpus using the event schema. 3. The system of claim 2 , wherein the extraction component assigns a weight to an extracted event based on a usage frequency of the extracted event by the retrieval component, a context in which the extracted event appears, or a combination thereof. 4. The system of claim 1 , wherein the schema component derives the event schema for the document corpus by identifying candidate event triggers and event arguments from the parsing results to form proto-events. 5. The system of claim 1 , wherein the schema component derives the event schema for the document corpus by generating vector representations of events using a graph neural network. 6. The system of claim 1 , wherein the schema component derives the event schema for the document corpus by clustering vector representations of events into a plurality of clusters to identify event types. 7. The system of claim 1 , further comprising: a feedback component that adjusts the event schema based on feedback data obtained from usage logs. 8. The system of claim 1 , wherein the parsing results are obtained using a parser. 9. A computer-implemented method, comprising: deriving, by a system operatively coupled to a processor, an event schema for a document corpus using parsing results obtained from the document corpus; and populating, by the system, a response to a query with a document of the document corpus using events extracted from the query and the document using the event schema, wherein the events extracted from the query and the document include an extracted event that comprises a list of tuple representations, and wherein a tuple representation in the list of tuple representations is a vector formed by concatenating respective vector representations of an event type, an event trigger, an event argument, and an argument role. 10. The computer-implemented method of claim 9 , further comprising: extracting, by the system, the events from the query and the document of the document corpus using the event schema. 11. The computer-implemented method of claim 9 , wherein the system derives the event schema for the document corpus by identifying candidate event triggers and event arguments from the parsing results to form proto-events. 12. The computer-implemented method of claim 9 , wherein the system derives the event schema for the document corpus by generating vector representations of events using a graph neural network. 13. The computer-implemented method of claim 9 , wherein the system derives the event schema for the document corpus by clustering vector representations of events into a plurality of clusters to identify event types. 14. The computer-implemented method of claim 9 , further comprising: adjusting, by the system, the event schema based on feedback data obtained from usage logs. 15. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to: derive, by the processor, an event schema for a document corpus using parsing results obtained from the document corpus; and populate, by the processor, a response to a query with a document of the document corpus using events extracted from the query and the document using the event schema, wherein the events extracted from the query and the document include an extracted event that comprises a list of tuple representations, and wherein a tuple representation in the list of tuple representations is a vector formed by concatenating respective vector representations of an event type, an event trigger, an event argument, and an argument role. 16. The computer program product of claim 15 , wherein the processor derives the event schema for the document corpus by identifying candidate event triggers and event arguments from the parsing results to form proto-events. 17. The computer program product of claim 15 , wherein the processor derives the event schema for the document corpus by generating vector representations of events using a graph neural network. 18. The computer program product of claim 15 , wherein the processor derives the event schema for the document corpus by clustering vector representations of events into a plurality of clusters to identify event types. 19. The computer program product of claim 15 , wherein the processor adjusts the event schema based on feedback data obtained from usage logs. 20. The computer program product of claim 15 , wherein the parsing results are obtained using a parser.

Assignees

Inventors

Classifications

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • Generative networks · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Active learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11615152B2 cover?
Systems, devices, computer-implemented methods, and/or computer program products that facilitate event schema induction from unstructured or semi-structured data. In one example, a system can comprise a processor that executes computer executable components stored in memory. The computer executable components can comprise a schema component and a retrieval component. The schema component can de…
Who is the assignee on this patent?
IBM, Univ Illinois
What technology area does this patent fall under?
Primary CPC classification G06F16/93. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 28 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).