Language independent representations

US9990361B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9990361-B2
Application numberUS-201514878794-A
CountryUS
Kind codeB2
Filing dateOct 8, 2015
Priority dateOct 8, 2015
Publication dateJun 5, 2018
Grant dateJun 5, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Snippets can be represented in a language-independent semantic manner. Each portion of a snippet can be represented by a combination of a semantic representation and a syntactic representation, each in its own dimensional space. A snippet can be divided into portions by constructing a dependency structure based on relationships between words and phrases. Leaf nodes of the dependency structure can be assigned: A) a semantic representation according to pre-defined word mappings and B) a syntactic representation according to the grammatical use of the word. A trained semantic model can assign to each non-leaf node of the dependency structure a semantic representation based on a combination of the semantic and syntactic representations of the corresponding lower-level nodes. A trained syntactic model can assign to each non-leaf node a syntactic representation based on a combination of the syntactic representations of the corresponding lower-level nodes and the semantic representation of that node.

First claim

Opening claim text (preview).

We claim: 1. A method for generating improved machine translations using a language independent representation of a snippet that accounts for semantics of the snippet, the method comprising: receiving the snippet, wherein the snippet comprises a digital representation of one or more words or groups of characters from a first natural language; building a dependency structure based on the received snippet, the dependency structure comprising multiple nodes; obtaining semantic and syntactic representations of leaf nodes of the dependency structure; generating a first semantic representation corresponding to a selected non-leaf node of the dependency structure by applying a semantic model to semantic and syntactic representations of parent nodes of the selected non-leaf node; mapping the first semantic representation into a virtual language independent space; locating a second semantic representation that is the closest, in the virtual language independent space, to the first semantic representation; and providing, as a translation of the snippet, content in a second natural language that was a basis for creating the second semantic representation. 2. The method of claim 1 wherein: the syntactic representations are syntactic vectors; and the semantic representations are semantic vectors. 3. The method of claim 2 wherein the semantic model comprises: a tensor function that generates a tensor based on two syntactic vectors; a first matrix function that generates a first matrix based on two syntactic vectors; a second matrix function that generates a second matrix based on two syntactic vectors; and an offset function that generates an offset vector based on two syntactic vectors. 4. The method of claim 3 wherein generating the semantic representation corresponding to the selected non-leaf node of the dependency structure comprises: obtaining a first syntactic vector corresponding to a first parent node of the selected non-leaf node; obtaining a second syntactic vector corresponding to a second parent node of the selected non-leaf node; generating the tensor by applying the tensor function to the first syntactic vector and the second syntactic vector; generating the first matrix by applying the first matrix function to the first syntactic vector and the second syntactic vector; generating the second matrix by applying the second matrix function to the first syntactic vector and the second syntactic vector; and generating the offset vector by applying the offset function to the first syntactic vector and the second syntactic vector. 5. The method of claim 4 wherein generating the semantic representation corresponding to the selected non-leaf node of the dependency structure comprises: obtaining a first semantic vector corresponding to the first parent node of the selected non-leaf node; obtaining a second semantic vector corresponding to the second parent node of the selected non-leaf node; computing a first result by multiplying together: the tensor, the first semantic vector, and the second semantic vector; computing a second result by multiplying together: the first matrix with the first semantic vector; computing a third result by multiplying together: the second matrix with the second semantic vector; and computing the semantic representation corresponding to the selected non-leaf node as the sum of: the first result, the second result, the third result, and the offset vector. 6. The method of claim 2 further comprising: generating a syntactic representation corresponding to the selected non-leaf node of the dependency structure by applying a syntactic model to syntactic representations of the parent nodes of the selected non-leaf node and to the semantic representation corresponding to the selected non-leaf node. 7. The method of claim 6 wherein the syntactic model comprises: a tensor function that generates a tensor based on two syntactic vectors; a first matrix function that generates a first matrix based on two syntactic vectors; a second matrix function that generates a second matrix based on two syntactic vectors; an offset function that generates an offset vector based on two syntactic vectors; and a mapping matrix that is a linear mapping from semantic space to syntactic space. 8. The method of claim 7 wherein generating the syntactic representation corresponding to the selected non-leaf node comprises: obtaining a first syntactic vector corresponding to a first parent node of the selected non-leaf node; obtaining a second syntactic vector corresponding to a second parent node of the selected non-leaf node; generating the tensor by applying the tensor function to the first syntactic vector and the second syntactic vector; generating the first matrix by applying the first matrix function to the first syntactic vector and the second syntactic vector; generating the second matrix by applying the second matrix function to the first syntactic vector and the second syntactic vector; and generating the offset vector by applying the offset function to the first syntactic vector and the second syntactic vector. 9. The method of claim 8 wherein generating the syntactic representation corresponding to the selected non-leaf node of the dependency structure comprises: computing a first result by multiplying together: the first matrix with a first semantic vector; computing a second result by multiplying together: the second matrix with a second semantic vector; computing a third result by multiplying together: the mapping matrix with the semantic representation corresponding to the selected non-leaf node; and computing the syntactic representation corresponding to the selected non-leaf node as the sum of: the first result, the second result, the third result, the tensor, and the offset vector. 10. The method of claim 1 wherein the dependency structure is a binary tree structure. 11. The method of claim 1 wherein the selected non-leaf node of the dependency structure is the root node of the dependency structure, and wherein the method further comprises: generating, for a composition that includes the semantic representation corresponding to the selected non-leaf node of the dependency structure, a score; and adjusting parameters of the semantic model based on the score. 12. The method of claim 11 wherein generating the score comprises: applying, to the semantic representation corresponding the root node and to a syntactic representation corresponding the root node, a scoring neural network that is trained to receive a semantic vector and a syntactic vector and generate the score indicating how reliably the semantic vector maps into the language independent space. 13. The method of claim 12 wherein generating the score further comprises: applying the scoring neural network to multiple nodes of the dependency structure to compute corresponding scores for the multiple nodes of the dependency structure; and combining, as the score for the composition, the scores for the multiple nodes of the dependency structure. 14. The method of claim 13 wherein combining the scores for the multiple nodes of the dependency structure comprises: summing the scores for the multiple nodes for the dependency structure; or multiplying each selected score of the multiple nodes for the dependency structure by (½)^depth, wherein the depth is the maximum number of edges between the node corresponding to that selected score and the root node of the dependency structure, and summing the results of the multiplications. 15. The method of claim 1 wherein: at least tw

Assignees

Inventors

Classifications

  • Data-driven translation · CPC title

  • G06F40/58Primary

    Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title

  • Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars · CPC title

  • G06F40/30Primary

    Semantic analysis · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9990361B2 cover?
Snippets can be represented in a language-independent semantic manner. Each portion of a snippet can be represented by a combination of a semantic representation and a syntactic representation, each in its own dimensional space. A snippet can be divided into portions by constructing a dependency structure based on relationships between words and phrases. Leaf nodes of the dependency structure c…
Who is the assignee on this patent?
Facebook Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/58. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 05 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).