Generating neighborhood convolutions according to relative importance

US11227014B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11227014-B2
Application numberUS-201916273969-A
CountryUS
Kind codeB2
Filing dateFeb 12, 2019
Priority dateMar 13, 2018
Publication dateJan 18, 2022
Grant dateJan 18, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for generating embeddings for nodes of a corpus graph are presented. More particularly, embedding information of a target node may be based on the node itself, as well as related, relevant nodes to the target node within a corpus graph. The information of various nodes among the relevant nodes to the target node can be used to weight or influence the embedding information. Disclosed systems and methods include generating neighborhood embedding information for a target node, where the neighborhood embedding information includes embedding information from neighborhood nodes of the target node's relevant neighborhood, and where certain nodes having more relevance to the target node can be weighted to influence the generation of the neighborhood embedding information over nodes having less relevance to the target node.

First claim

Opening claim text (preview).

What is claimed: 1. A computer-implemented method, comprising: under the control of one or more processors executing on a computer system: maintaining, in a data store, a corpus graph comprising a first plurality of nodes; conducting a random walk process among the first plurality of nodes originating with a target node and resulting in a subset of nodes of the first plurality of nodes, wherein each node of the subset of nodes is associated with an importance value to the target node according to a visit count to the respective node during the random walk process; determining a relevant neighborhood for the target node comprising a second plurality of nodes of the subset of nodes having highest associated importance values; generating neighborhood embedding information for the target node, the neighborhood embedding information comprising a plurality of embedding elements, each embedding element determined according to an embedding vector of a node of the relevant neighborhood and as a function of the associated importance value of the node to the target node; storing the neighborhood embedding information in the data store in association with the target node; and in response to determining that a query corresponds to the target node, providing at least one item of content corresponding to at least one node as responsive to the query, the at least one node determined based at least in part on the neighborhood embedding information. 2. The computer-implemented method of claim 1 , wherein generating the neighborhood embedding information for the target node comprises: for each element of the neighborhood embedding information: selecting a node of the relevant neighborhood; and selecting an embedding element of an embedding vector of the selected node as a current embedding element of the neighborhood embedding information. 3. The computer-implemented method of claim 2 , wherein selecting a node of the relevant neighborhood comprises selecting a node of the relevant neighborhood according to the importance values of the nodes of the relevant neighborhood. 4. The computer-implemented method of claim 1 , wherein the relevant neighborhood for the target node includes up to a predetermined number of nodes. 5. The computer-implemented method of claim 1 , wherein the second plurality of nodes comprises nodes of the subset of nodes having the highest associated importance values greater than a predetermined threshold value. 6. The computer-implemented method of claim 1 , further comprising: combining the embedding elements of the neighborhood embedding information with an embedding vector of the target node to form an aggregated embedding vector for the target node; and storing the aggregated embedding vector in the data store in association with the target node. 7. The computer-implemented method of claim 1 , wherein the nodes of the relevant neighborhood are further determined, at least in part, according to a frequency of relationships between at least some nodes of the relevant neighborhood. 8. A non-transitory computer-readable medium bearing computer executable instructions which, when executed on a computing system comprising at least a processor, carry out a method, comprising: maintaining, in a data store of the computing system, a corpus graph comprising a first plurality of nodes; conducting a random walk process among the first plurality of nodes originating with target node, the random walk process resulting in a subset of nodes of the first plurality of nodes, wherein each node of the subset of nodes is associated with an importance value to the target node according to a visit count to the respective node during the random walk process; determining a relevant neighborhood for the target node comprising a second plurality of nodes of the subset of nodes having highest associated importance values; generating neighborhood embedding information for the target node, the neighborhood embedding information comprising a plurality of embedding elements, each embedding element determined according to an embedding vector of a node of the relevant neighborhood and as a function of the associated importance value of the node to the target node; storing the neighborhood embedding information in the data store in association with the target node; and in response to determining that a query corresponds to the target node, providing at least one item of content corresponding to at least one node as responsive to the query, the at least one node determined based at least in part on the neighborhood embedding information. 9. The non-transitory computer-readable medium of claim 8 , wherein generating the neighborhood embedding information for the target node comprises: for each embedding element of the neighborhood embedding information: selecting a node of the relevant neighborhood; and selecting an embedding element of an embedding vector of the selected node as a current embedding element of neighborhood embedding information. 10. The non-transitory computer-readable medium of claim 9 , wherein selecting a node of the relevant neighborhood comprises selecting a node of the relevant neighborhood according to the importance values of the nodes of the relevant neighborhood. 11. The non-transitory computer-readable medium of claim 8 , wherein the relevant neighborhood for the target node includes up to a predetermined number of nodes. 12. The non-transitory computer-readable medium of claim 8 , wherein the second plurality of nodes comprises nodes of the subset of nodes having the highest associated importance values greater than a predetermined threshold value. 13. The non-transitory computer-readable medium of claim 8 , wherein the method carried out on the computing system further comprises: combining the embedding values of the neighborhood embedding information with an embedding vector of the target node to form an aggregated embedding vector for the target node; and storing the aggregated embedding vector in the data store in association with the target node. 14. The non-transitory computer-readable medium of claim 8 , wherein the nodes of the relevant neighborhood are further determined, at least in part, according to a frequency of relationships between at least some nodes of the relevant neighborhood. 15. A computer system comprising a processor and a memory, wherein the processor, in executing instructions stored in the memory, causes the computer system to at least: maintain in a data store a corpus graph comprising a first plurality of nodes; conduct a random walk process among the first plurality of nodes originating with a target node and resulting in a subset of nodes of the first plurality of nodes, wherein each node of the subset of nodes is associated with an importance value corresponding to the target node based, at least in part, on a visit count to the respective node during the random walk process; determine a relevant neighborhood for the target node comprising a second plurality of nodes of the subset of nodes having highest associated importance values; generate neighborhood embedding information for the target node, the neighborhood embedding information comprising a plurality of embedding elements, each embedding element determined according to an embedding vector of a node of the relevant neighborhood and as a function of the associated importance value of the node to the target node; store the neighborhood embedding information in the data store in association with the target node; and in response to determining that a query corresponds to the target node, provide at l

Assignees

Inventors

Classifications

  • Graphs; Linked lists (G06F16/9027 takes precedence) · CPC title

  • Combinations of networks · CPC title

  • Selection of the most significant subset of features · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Graphical models, e.g. Bayesian networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11227014B2 cover?
Systems and methods for generating embeddings for nodes of a corpus graph are presented. More particularly, embedding information of a target node may be based on the node itself, as well as related, relevant nodes to the target node within a corpus graph. The information of various nodes among the relevant nodes to the target node can be used to weight or influence the embedding information. D…
Who is the assignee on this patent?
Pinterest Inc, Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/9024. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 18 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).