Methods and systems for approximating embeddings of out-of-knowledge-graph entities for link prediction in knowledge graph

US12505358B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12505358-B2
Application numberUS-202117514512-A
CountryUS
Kind codeB2
Filing dateOct 29, 2021
Priority dateOct 29, 2021
Publication dateDec 23, 2025
Grant dateDec 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure describes methods and systems for generating an approximated embedding of an out-of-knowledge-graph entity based on a knowledge graph. The method includes: receiving a target entity, a dataset associated with the target entity, and an embeddings space of a knowledge graph comprising a set of structured data, wherein the target entity is out of the knowledge graph and the embeddings space includes a set of vectors representing the set of structured data in the embeddings space; selecting a set of elements from the knowledge graph, each element being related to the target entity according to the dataset associated with the target entity; constructing a set of descriptory triples based on the target entity and the set of elements; obtaining an embedding matrix based on the descriptory triples and the embeddings space; and generating an approximated embedding for the target entity based on the embedding matrix.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computing device for generating an approximated embedding of an out-of-knowledge-graph entity based on a knowledge graph, the computing device comprising: a reception circuitry configured to receive a target entity, a dataset associated with the target entity, and an embeddings space of a knowledge graph comprising a set of structured data, wherein the set of structured data comprises a set of entities and links between the set of entities in the knowledge graph, wherein the target entity is out of the knowledge graph and the embeddings space includes a set of vectors representing the set of structured data associated with the knowledge graph in the embeddings space, and wherein the dataset associated with the target entity comprises data records of the target entity in triples stored in the knowledge graph; a selection circuitry configured to select a set of elements from the knowledge graph, each element being related to the target entity according to the dataset associated with the target entity; a construction circuitry configured to construct a set of descriptory triples based on the target entity and the set of elements, wherein the set of descriptory triples comprises the data records of the target entity, wherein each descriptory triple of the set of descriptory triples comprises the target entity, an entity selected from the set of elements, and a predicate obtained from the dataset associated with the target entity, and wherein the predicate defines a relational link between the target entity and the entity selected from the set of elements; an aggregation circuitry configured to obtain an embedding matrix based on the set of descriptory triples and the embeddings space, wherein the embedding matrix comprises a plurality of row vectors, wherein each row vector corresponds to a descriptory triple of the set of descriptory triples, wherein each row vector is formed by combining embeddings, retrieved from the embeddings space, of the target entity, the entity selected from the set of elements, and the predicate of the descriptory triple; an approximator circuitry configured to generate an approximated embedding for the target entity based on a pretrained embedding estimation model to the embedding matrix, wherein the approximated embedding indicates a vector representation corresponding to the target entity in the embeddings space of the knowledge graph; and an output circuitry configured to output a plausibility prediction of linkage between the target entity and one or more entities of the set of entities in the knowledge graph based on the approximated embedding for the target entity, wherein the plausibility prediction allows automatic integration of the target entity into the knowledge graph without retraining the embedding estimation model. 2 . The computing device according to claim 1 , wherein: the set of elements from the knowledge graph comprises a set of neighbor relations and a set of neighbor entities according to the dataset associated with the target entity; and each of the descriptory triples comprises one element from the set of neighbor relations, one element from the set of neighbor entities, and the target entity. 3 . The computing device according to claim 1 , wherein: the set of descriptory triples comprises a set of subjective descriptory triples and a set of objective descriptory triples; each triple of the set of subjective descriptory triples comprises the target entity as a subject in the triple and a subject along with a link belonging to the knowledge graph with known embeddings; and each triple of the set of objective descriptory triples comprises the target entity as the object in the triple and the subject along with a link belonging to the knowledge graph with known embeddings. 4 . The computing device according to claim 3 , wherein: the embedding matrix comprises a first set of rows and a second set of rows; each row of the first set of rows corresponds to each triple of the set of subjective descriptory triples and comprises an embedding of the predicate of each triple of the set of subjective descriptory triples and an embedding of the object of each triple of the set of subjective descriptory triples; and each row of the second set of rows corresponds to each triple of the set of objective descriptory triples and comprises an embedding of the subject of each triple of the set of objective descriptory triples and an embedding of the predicate of each triple of the set of objective descriptory triples. 5 . The computing device according to claim 1 , wherein: the embedding matrix comprising a matrix with m rows and n columns, m and n being positive integers; and a value of n is twice of a number of embedding dimensions of the embeddings space of the knowledge graph. 6 . The computing device according to claim 1 , wherein: the embedding estimation model is pretrained by: for each sample entity or a sample subset of entities in the knowledge graph: generating a modified embeddings space for the knowledge graph with the sample entity or the sample subset of entities excluded, identifying neighborhood elements of the sample entity or the sample subset of entities in the knowledge graph, generating a neighborhood embedding matrix according to the modified embeddings space and the neighborhood elements, processing the neighborhood embedding matrix using the pretrained embedding estimation model to generate an approximated embedding for the sample entity or the sample subset of entities, obtaining an estimation error based on the approximated embedding and a true embedding of the sample entity or the sample subset of entities in the embeddings space of the knowledge graph; and adjusting a set of model parameters of the pretrained embedding estimation model by minimizing an overall error aggregated over the estimation error of each sample entity or the sample subset of entities in the knowledge graph. 7 . A method for generating an approximated embedding of one or more out-of-knowledge-graph entity based on a knowledge graph, the method comprising: receiving, by a reception circuitry, a target entity, a dataset associated with the target entity, and an embeddings space of a knowledge graph comprising a set of structured data, wherein the set of structured data comprises a set of entities and links between the set of entities in the knowledge graph, wherein the target entity is out of the knowledge graph and the embeddings space includes a set of vectors representing the set of structured data associated with the knowledge graph in the embeddings space, and wherein the dataset associated with the target entity comprises data records of the target entity in triples stored in the knowledge graph; selecting, by a selection circuitry, a set of elements from the knowledge graph, each element being related to the target entity according to the dataset associated with the target entity; constructing, by a construction circuitry, a set of descriptory triples based on the target entity and the set of elements, wherein the set of descriptory triples comprises the data records of the target entity, wherein each descriptory triple of the set of descriptory triples comprises the target entity, an entity selected from the set of elements, and a predicate obtained from the dataset associated with the target entity, and wherein the predicate defines a relational link between the target entity and the entity selected from the set of elements; obtaining, by an aggregation circuitry, an embedding matrix based on the descriptory triples and the embeddings space, wherein the embedding matrix comprises a plurality of row vectors, wherein each row vector corresponds to a descriptory triple of the set

Assignees

Inventors

Classifications

  • based on approximation criteria, e.g. principal component analysis · CPC title

  • Adversarial learning · CPC title

  • G06N5/022Primary

    Knowledge engineering; Knowledge acquisition · CPC title

  • G06N5/02Primary

    Knowledge representation; Symbolic representation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12505358B2 cover?
The present disclosure describes methods and systems for generating an approximated embedding of an out-of-knowledge-graph entity based on a knowledge graph. The method includes: receiving a target entity, a dataset associated with the target entity, and an embeddings space of a knowledge graph comprising a set of structured data, wherein the target entity is out of the knowledge graph and the …
Who is the assignee on this patent?
Accenture Global Solutions Ltd
What technology area does this patent fall under?
Primary CPC classification G06N5/022. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).