Method and system of text synthesis based on extracted information in the form of an RDF graph making use of templates

US10210249B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10210249-B2
Application numberUS-201514717320-A
CountryUS
Kind codeB2
Filing dateMay 20, 2015
Priority dateMar 19, 2015
Publication dateFeb 19, 2019
Grant dateFeb 19, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed are system, method and computer program product for synthesis of natural-language text; receiving information objects; selecting among the received information objects information objects and an associated synthesis templates in a template library, each synthesis template including a template semantic-syntactic tree; generating for each selected information object a synthesis semantic-syntactic tree based on the template semantic-syntactic tree; and generating natural language text based on each generated synthesis semantic-syntactic tree.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving, by a hardware processor, a plurality of information objects extracted from text data in one or more source text documents, the plurality of information objects having representations in a data model, wherein each information object comprises one or more properties and, for each property, a corresponding natural language string value from the text data; selecting, by the hardware processor, at least one information object and a synthesis template associated with the selected information object, wherein the selecting of the at least one information object and the synthesis template associated with the selected information object comprises: selecting the at least one information object from the plurality of information objects; and selecting the synthesis template associated with the selected information object from a plurality of synthesis templates stored in a template library, the synthesis template comprising a sentence in a natural language, a plurality of substitutions comprising one or more words from the sentence and their associated properties, and a template semantic-syntactic tree, and wherein the template semantic-syntactic tree is generated based on the sentence in the natural language, wherein the selecting of the synthesis template associated with the selected information object comprises: for each synthesis template from the plurality of synthesis templates: comparing a property associated with a word in a plurality of substitutions of a respective synthesis template with one or more properties of the selected information object; responsive to determining that the property of the respective synthesis template matches one of the one or more properties of the selected information object, determining whether a natural language string value corresponding to the matching property of the selected information object is nominative or not; and responsive to determining that the natural language string value is not nominative, eliminating the respective synthesis template from the plurality of synthesis templates for selection; generating, by the hardware processor, for each selected information object, a synthesis semantic-syntactic tree based on the template semantic-syntactic tree and the plurality of substitutions of the synthesis template associated with the selected information object; and generating, by the hardware processor, a natural language text based on each generated synthesis semantic-syntactic tree to provide an annotation for the text data in the one or more source text documents. 2. The method of claim 1 , wherein generating for the selected information object a synthesis semantic-syntactic tree further comprises: responsive to determining that the template node is in the substitutions of the associated synthesis template and the associated property of the template node is a filled property of the selected information object, and the filled property of the selected information object has more than one natural-language string value, for each natural-language string value, generating in the synthesis semantic-syntactic tree a node or a sub-tree based on a semantic-syntactic analysis of the natural-language string value of the filled property; and connecting the generated nodes with a coordinating link. 3. The method of claim 1 , further comprising: forming at least one group of selected information objects associated with the same synthesis template; and generating for the at least one group the synthesis semantic-syntactic tree based on the template semantic-syntactic tree of the associated synthesis template. 4. A system comprising: a memory; and a hardware processor, coupled to the memory, to: receive a plurality of information objects extracted from text data in one or more source text documents, the plurality of information objects having representations in a data model, wherein each information object comprises one or more properties and, for each property, a corresponding natural language string value from the text data; select at least one information object and a synthesis template associated with the selected information object, wherein to select the at least one information object and the synthesis template associated with the selected information object, the hardware processor further to: select the at least one information object from the plurality of information objects; and select the synthesis template from a plurality of synthesis templates stored in a template library, the synthesis template comprising a sentence in a natural language, a plurality of substitutions parameters comprising one or more words from the sentence and their associated properties, and a template semantic-syntactic tree, and wherein the template semantic-syntactic tree is generated based on the sentence in the natural language wherein to select the synthesis template associated with the selected information object, the hardware processor further to: for each synthesis template from the plurality of synthesis templates: compare a property associated with a word in a plurality of substitutions of a respective synthesis template with one or more properties of the selected information object; responsive to determining that the property of the respective synthesis template matches one of the one or more properties of the selected information object, determine whether a natural language string value corresponding to the matching property of the selected information object is nominative or not; and responsive to determining that the natural language string value is not nominative, eliminate the respective synthesis template from the plurality of synthesis templates for selection; generate for each selected information object a synthesis semantic-syntactic tree based on the template semantic-syntactic tree and the plurality of substitutions of the synthesis template associated with the selected information object; and generate a natural language text based on each generated synthesis semantic-syntactic tree to provide an annotation for the text data in the one or more source text documents. 5. The system of claim 4 , is further to: form at least one group of selected information objects associated with the same synthesis template; and generate for the at least one group the synthesis semantic-syntactic tree based on the template semantic-syntactic tree of the associated synthesis template. 6. A computer program product stored on a non-transitory computer-readable storage medium, the computer program product comprising computer-executable instructions for synthesis of natural-language text, comprising instructions for: receiving, by a hardware processor, a plurality of information objects extracted from text data in one or more source text documents, the plurality of information objects having representations in a data model, wherein each information object comprises one or more properties and, for each property, a corresponding natural language string value from the text data; selecting, by the hardware processor, at least one information object and a synthesis template associated with the selected information object, wherein the selecting of the at least one information object and the synthesis template associated with the selected information object comprises: selecting the at least one information object from the plurality of information objects; and selecting the synthesis template from a plurality of synthesis templates stored in a template library, the synthesis template comprises a sentence in a natural language, a plurality of substitutions comprising one or more words from the sentence and their associated properties, and a template semantic-syntactic tree, and wherein the template semantic-syntactic

Assignees

Inventors

Classifications

  • G06F40/10Primary

    Text processing (natural language analysis G06F40/20; semantic analysis G06F40/30; processing or translation of natural language G06F40/40) · CPC title

  • Trees · CPC title

  • G06F16/367Primary

    Ontology · CPC title

  • Selection or weighting of terms for indexing · CPC title

  • Trees · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10210249B2 cover?
Disclosed are system, method and computer program product for synthesis of natural-language text; receiving information objects; selecting among the received information objects information objects and an associated synthesis templates in a template library, each synthesis template including a template semantic-syntactic tree; generating for each selected information object a synthesis semantic…
Who is the assignee on this patent?
Abbyy Infopoisk Llc, Abbyy Production Llc
What technology area does this patent fall under?
Primary CPC classification G06F40/10. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 19 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).