Generating parse trees of text segments using neural networks

US10409908B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10409908-B2
Application numberUS-201514976121-A
CountryUS
Kind codeB2
Filing dateDec 21, 2015
Priority dateDec 19, 2014
Publication dateSep 10, 2019
Grant dateSep 10, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating parse trees for input text segments. One of the methods includes obtaining an input text segment, processing the input text segment using a first long short term memory (LSTM) neural network to convert the input text segment into an alternative representation for the input text segment, and processing the alternative representation for the input text segment using a second LSTM neural network to generate a linearized representation of a parse tree for the input text segment.

First claim

Opening claim text (preview).

What is claimed is: 1. A method performed by one or more computers, the method comprising: obtaining an input text segment; processing the input text segment using a first long short term memory (LSTM) neural network to convert the input text segment into an alternative representation for the input text segment; and processing the alternative representation for the input text segment using a second LSTM neural network to generate a linearized representation of a hierarchical parse tree for the input text segment, including using the second LSTM neural network to sequentially select symbols for the linearized representation, the selected symbols including (i) first symbols that represent syntactic elements of the input text segment and (ii) second symbols that represent hierarchical relationships between particular first symbols in the hierarchical parse tree. 2. The method of claim 1 , further comprising: generating the hierarchical parse tree for the input text segment from the linearized representation of the hierarchical parse tree. 3. The method of claim 1 , wherein the linearized representation of the hierarchical parse tree for the input text segment is a sequence of symbols from a predetermined vocabulary of parse tree symbols. 4. The method of claim 1 , wherein the input text segment is a variable length input text segment. 5. The method of claim 1 , wherein the alternative representation is a vector of fixed dimensionality. 6. The method of claim 1 , wherein processing the input text segment comprises: adding an end-of-sentence token to the end of the input text segment to generate a modified input text segment; and processing the modified input text segment using the first LSTM neural network. 7. The method of claim 1 , wherein processing the alternative representation for the input text segment using the second LSTM neural network comprises initializing a hidden state of the second LSTM neural network to the alternative representation for the input text segment. 8. The method of claim 1 , wherein processing the alternative representation for the input text segment using the second LSTM neural network comprises: processing the alternative representation for the input text segment using the second LSTM neural network to generate a respective sequence score for each of a plurality of possible linearized representations of hierarchical parse trees; and selecting a possible linearized representation having a highest sequence score as the linearized representation of the hierarchical parse tree for the input text segment. 9. The method of claim 8 , wherein processing the alternative representation for the input text segment using the second LSTM neural network to generate a respective sequence score for each of a plurality of possible linearized representations of hierarchical parse trees comprises: processing the alternative representation using the second LSTM neural network using a left to right beam search decoding. 10. The method of claim 8 , wherein the set of possible linearized representations comprises possible linearized representations of varying lengths. 11. The method of claim 1 , further comprising: training the first LSTM neural network and the second LSTM neural network using Stochastic Gradient Descent. 12. The method of claim 1 , wherein one or more of the first LSTM neural network or the second LSTM neural network is a deep LSTM neural network. 13. The method of claim 1 , wherein the hierarchical parse tree is a tree that represents the syntactic structure of the text segment according to a context-free grammar, and wherein a linearized representation of a particular hierarchical parse tree is generated by traversing the particular hierarchical parse tree in a depth-first traversal order. 14. The method of claim 1 , comprising using the second LSTM neural network to sequentially select symbols for the linearized representation until an end-of-sentence token is selected that indicates the linearized representation is complete. 15. A system comprising one or more computers and one or more storage devices storing instructions that, when executed by the one or more computers, cause the one or more computers to perform operations comprising: obtaining an input text segment; processing the input text segment using a first long short term memory (LSTM) neural network to convert the input text segment into an alternative representation for the input text segment; and processing the alternative representation for the input text segment using a second LSTM neural network to generate a linearized representation of a hierarchical parse tree for the input text segment, including using the second LSTM neural network to sequentially select symbols for the linearized representation, the selected symbols including (i) first symbols that represent syntactic elements of the input text segment and (ii) second symbols that represent hierarchical relationships between particular first symbols in the hierarchical parse tree. 16. The system of claim 15 , the operations further comprising: generating the hierarchical parse tree for the input text segment from the linearized representation of the hierarchical parse tree. 17. The system of claim 15 , wherein the linearized representation of the hierarchical parse tree for the input text segment is a sequence of symbols from a predetermined vocabulary of parse tree symbols. 18. The system of claim 15 , wherein processing the alternative representation for the input text segment using the second LSTM neural network comprises initializing a hidden state of the second LSTM neural network to the alternative representation for the input text segment. 19. A computer program product encoded on one or more non-transitory computer storage media, the computer program product comprising instruction that, when executed by one or more computers, cause the one or more computers to perform operations comprising: obtaining an input text segment; processing the input text segment using a first long short term memory (LSTM) neural network to convert the input text segment into an alternative representation for the input text segment; and processing the alternative representation for the input text segment using a second LSTM neural network to generate a linearized representation of a hierarchical parse tree for the input text segment, including using the second LSTM neural network to sequentially select symbols for the linearized representation, the selected symbols including (i) first symbols that represent syntactic elements of the input text segment and (ii) second symbols that represent hierarchical relationships between particular first symbols in the hierarchical parse tree.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10409908B2 cover?
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating parse trees for input text segments. One of the methods includes obtaining an input text segment, processing the input text segment using a first long short term memory (LSTM) neural network to convert the input text segment into an alternative representation for the input text segment…
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G06F40/205. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 10 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).