Method, electronic device, and storage medium for expanding data

US12165072B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12165072-B2
Application numberUS-202117213952-A
CountryUS
Kind codeB2
Filing dateMar 26, 2021
Priority dateApr 8, 2020
Publication dateDec 10, 2024
Grant dateDec 10, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, apparatus, device, and storage medium for expanding data are disclosed. The method includes: acquiring a triplet from a knowledge graph; mining a relationship path equivalent to a relationship in the triplet from the knowledge graph, a subject in the triplet being used as a start point of the relationship path, and an object in the triplet being used as an end point of the relationship path; and expanding the triplet based on the relationship path to generate an expanded triplet. This implementation expands the triplet in the knowledge graph, and strengthens the association between the subject and the object in the triplet in a larger context, such that the association between the subject and the object in the triplet is more global.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for expanding data, comprising: acquiring a triplet from a knowledge graph; mining a relationship path equivalent to a relationship in the triplet from the knowledge graph, a subject in the triplet being used as a start point of the relationship path, and an object in the triplet being used as an end point of the relationship path; expanding the triplet based on the relationship path to generate an expanded triplet; adding the triplet and the expanded triplet into a training sample set as training samples; and obtaining, for the training samples in the training sample set, a prediction model by supervised training with a training sample having a missing subject or object as an input, and with the missing subject or object as an output. 2. The method according to claim 1 , wherein the mining the relationship path equivalent to the relationship in the triplet from the knowledge graph comprises: mining other triplet sequences that statistically cooccur with the subject and the object in the triplet from the knowledge graph; and sequentially combining relationships in the other triplet sequences to generate the relationship path. 3. The method according to claim 1 , wherein the expanding the triplet based on the relationship path to generate the expanded triplet comprises: replacing the relationship in the triplet with the relationship path to generate the expanded triplet. 4. The method according to claim 1 , wherein the expanding the triplet based on the relationship path to generate the expanded triplet comprises: traversing the relationship path with the subject in the triplet as the start point to obtain another object other than the object in the triplet; and replacing the relationship in the triplet with the relationship path, and replacing the object in the triplet with the another object, to generate the expanded triplet. 5. The method according to claim 1 , wherein the method further comprises: acquiring a predicted triplet, wherein the predicted triplet has a missing subject or object; determining a predicted expanded triplet corresponding to the predicted triplet based on the knowledge graph; inputting the predicted triplet into the prediction model to obtain a confidence degree of a predicted subject or object corresponding to the predicted triplet, and inputting the predicted expanded triplet into the prediction model to obtain a confidence degree of the predicted subject or object corresponding to the predicted expanded triplet; and computing an average of the confidence degree of the predicted subject or object corresponding to the predicted triplet and the predicted expanded triplet, to determine the missing subject or object of the predicted triplet. 6. An electronic device, comprising: at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor, such that the at least one processor can perform operations comprising: acquiring a triplet from a knowledge graph; mining a relationship path equivalent to a relationship in the triplet from the knowledge graph, a subject in the triplet being used as a start point of the relationship path, and an object in the triplet being used as an end point of the relationship path; expanding the triplet based on the relationship path to generate an expanded triplet; adding the triplet and the expanded triplet into a training sample set as training samples; and obtaining, for the training samples in the training sample set, a prediction model by supervised training with a training sample having a missing subject or object as an input, and with the missing subject or object as an output. 7. The electronic device according to claim 6 , wherein the mining the relationship path equivalent to the relationship in the triplet from the knowledge graph comprises: mining other triplet sequences that statistically cooccur with the subject and the object in the triplet from the knowledge graph; and sequentially combining relationships in the other triplet sequences to generate the relationship path. 8. The electronic device according to claim 6 , wherein the expanding the triplet based on the relationship path to generate the expanded triplet comprises: replacing the relationship in the triplet with the relationship path to generate the expanded triplet. 9. The electronic device according to claim 6 , wherein the expanding the triplet based on the relationship path to generate the expanded triplet comprises: traversing the relationship path with the subject in the triplet as the start point to obtain another object other than the object in the triplet; and replacing the relationship in the triplet with the relationship path, and replacing the object in the triplet with the another object, to generate the expanded triplet. 10. The electronic device according to claim 6 , wherein the operations further comprise: acquiring a predicted triplet, wherein the predicted triplet has a missing subject or object; determining a predicted expanded triplet corresponding to the predicted triplet based on the knowledge graph; inputting the predicted triplet into the prediction model to obtain a confidence degree of a predicted subject or object corresponding to the predicted triplet, and inputting the predicted expanded triplet into the prediction model to obtain a confidence degree of the predicted subject or object corresponding to the predicted expanded triplet; and computing an average of the confidence degree of the predicted subject or object corresponding to the predicted triplet and the predicted expanded triplet, to determine the missing subject or object of the predicted triplet. 11. A non-transitory computer-readable storage medium storing computer instructions, wherein the computer instructions when executed by a computer, cause the computer to perform operations comprising: acquiring a triplet from a knowledge graph; mining a relationship path equivalent to a relationship in the triplet from the knowledge graph, a subject in the triplet being used as a start point of the relationship path, and an object in the triplet being used as an end point of the relationship path; expanding the triplet based on the relationship path to generate an expanded triplet; adding the triplet and the expanded triplet into a training sample set as training samples; and obtaining, for the training samples in the training sample set, a prediction model by supervised training with a training sample having a missing subject or object as an input, and with the missing subject or object as an output. 12. The non-transitory computer-readable storage medium according to claim 11 , wherein the mining the relationship path equivalent to the relationship in the triplet from the knowledge graph comprises: mining other triplet sequences that statistically cooccur with the subject and the object in the triplet from the knowledge graph; and sequentially combining relationships in the other triplet sequences to generate the relationship path. 13. The non-transitory computer-readable storage medium according to claim 11 , wherein the expanding the triplet based on the relationship path to generate the expanded triplet comprises: replacing the relationship in the triplet with the relationship path to generate the expanded triplet. 14. The non-transitory computer-readable storage medium according to claim 11 , wherein the expanding the triplet based on the relationship path to genera

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • characterised by the process organisation or structure, e.g. boosting cascade · CPC title

  • Graphical models, e.g. Bayesian networks · CPC title

  • Knowledge representation; Symbolic representation · CPC title

  • Entity relationship models · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12165072B2 cover?
A method, apparatus, device, and storage medium for expanding data are disclosed. The method includes: acquiring a triplet from a knowledge graph; mining a relationship path equivalent to a relationship in the triplet from the knowledge graph, a subject in the triplet being used as a start point of the relationship path, and an object in the triplet being used as an end point of the relationshi…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F16/2465. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 10 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).