Knowledge graph construction method and device

US11720629B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11720629-B2
Application numberUS-201816034799-A
CountryUS
Kind codeB2
Filing dateJul 13, 2018
Priority dateJul 14, 2017
Publication dateAug 8, 2023
Grant dateAug 8, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present invention provides a knowledge graph construction method and device. The method includes: obtaining structured data, where the structured data includes a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information includes a first attribute and a first attribute value; performing, based on measurement of a similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, where the measurement of the similarity includes at least one of the following types: measurement of a character similarity, measurement of a structure similarity of a classification tree on which an entity is located, and measurement of an attribute similarity; and importing the structured data into the knowledge graph according to an entity alignment processing result. Embodiments may ensure correctness of data in the knowledge graph.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented knowledge graph construction method, comprising: obtaining structured data, wherein the structured data comprises a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information comprises a first attribute and a first attribute value; performing, based on a measurement of similarity between the first entity and a second entity in a knowledge graph, entity alignment on the first entity, wherein the measurement of similarity comprises at least one of the following types: measurement of a character similarity, and measurement of an attribute similarity; and importing, the structured data into the knowledge graph based on the entity alignment, wherein the importing comprises: when the entity alignment indicates that the first entity is aligned with the second entity, and attribute alignment is performed on the first attribute of the first entity and a second attribute of the second entity, determining whether the second attribute exists in the knowledge graph; if the second attribute does not exist in the knowledge graph, importing the first attribute and the first attribute value to the second entity; and if the second attribute exists in the knowledge graph: when the first attribute is a single-value attribute, determining whether the first attribute value corresponding to the first attribute conflicts with a second attribute value corresponding to the second attribute, and if the first attribute value does not conflict with the second attribute value, performing deduplication processing; if the first attribute value conflicts with the second attribute value, when a reliability degree of the first attribute value is higher than a reliability degree of the second attribute value, importing the first attribute value to the second entity, and deleting the second attribute value; or when the first attribute is a multi-value attribute, and comprises a plurality of first attribute values that do not conflict with the second attribute value, determining, in the plurality of first attribute values, an attribute value different from the second attribute value, and importing the determined attribute value to the second entity. 2. The computer-implemented knowledge graph construction method according to claim 1 , wherein the performing, based on the measurement of similarity between the first entity and the second entity in the knowledge graph, entity alignment processing on the first entity comprises: determining, according to a type of a data source of the structured data, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph; and performing entity alignment processing on the first entity according to the determined measurement type. 3. The computer-implemented knowledge graph construction method according to claim 2 , wherein the performing entity alignment processing on the first entity according to the determined measurement type comprises: determining whether a child node and a parent node of the first entity are the same as a child node and a parent node of the second entity; and if yes, determining that the entities are aligned, and if not, determining that the entities are not aligned. 4. The computer-implemented knowledge graph construction method according to claim 2 , wherein the performing entity alignment processing on the first entity according to the determined measurement type comprises: determining whether a character similarity between the first entity name and the second entity name in the knowledge graph is greater than a preset threshold; and if yes, determining that the entities are aligned, and if not, determining that the entities are not aligned. 5. The computer-implemented knowledge graph construction method according to claim 2 , wherein the first attribute comprises a key attribute and a non-key attribute; and the performing entity alignment processing on the first entity according to the determined measurement type comprises: determining whether the second attribute exists in the knowledge graph, and if yes, determining whether attribute values corresponding to the key attribute and the second attribute are the same; and if yes, determining that the entities are aligned, and if not, determining that the entities are not aligned. 6. The computer-implemented knowledge graph construction method according to claim 1 , wherein before the determining, according to a type of a data source of the structured data, a measurement type for performing similarity processing between the first entity and the second entity in the knowledge graph, the method further comprises: obtaining a description type of each piece of attribute information; and performing cleansing and normalization processing on each piece of attribute information according to a standard description statement corresponding to the description type, so that attribute information being semantically the same has the same description. 7. The computer-implemented knowledge graph construction method according to claim 1 , wherein the method further comprises: in the knowledge graph, for a second attribute used to represent a relationship between entities, determining an implied relationship between entities by using a preset chain rule, and mapping the implied relationship to the knowledge graph. 8. A knowledge graph construction device, comprising a processor and a non-transitory computer-readable storage medium storing instructions that, when execute by the processor, cause the processor to perform a method comprising: obtaining structured data, wherein the structured data comprises a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information comprises a first attribute and a first attribute value; performing, based on a measurement of similarity between the first entity and a second entity in a knowledge graph, entity alignment processing on the first entity, wherein the measurement of similarity comprises at least one of the following types: measurement of a character similarity, and measurement of an attribute similarity; and importing the structured data into the knowledge graph based on the entity alignment, wherein the importing comprises: when the entity alignment indicates that the first entity is aligned with the second entity, and attribute alignment is performed on the first attribute of the first entity and a second attribute of the second entity, determining whether the second attribute exists in the knowledge graph; if the second attribute does not exist in the knowledge graph, importing the first attribute and the first attribute value to the second entity; and if the second attribute exists in the knowledge graph: when the first attribute is a single-value attribute, determining whether the first attribute value corresponding to the first attribute conflicts with a second attribute value corresponding to the second attribute, and if the first attribute value does not conflict with the second attribute value, performing deduplication processing, or if the first attribute value conflicts with the second attribute value, when a reliability degree of the first attribute value is higher than a reliability degree of the second attribute value, importing the first attribute value to the second entity, and deleting the second attribute value; or when the first attribute is a multi-value attribute, and comprises a plurality of first attribute values that do not conflict with the second attribute value, determining, in the plurality of first attribute values, an attribute value different from the s

Assignees

Inventors

Classifications

  • Graphs; Linked lists (G06F16/9027 takes precedence) · CPC title

  • Pattern recognition · CPC title

  • Matching criteria, e.g. proximity measures · CPC title

  • Tree-organised classifiers · CPC title

  • Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11720629B2 cover?
The present invention provides a knowledge graph construction method and device. The method includes: obtaining structured data, where the structured data includes a first entity name of a first entity and attribute information corresponding to the first entity name, and the attribute information includes a first attribute and a first attribute value; performing, based on measurement of a simil…
Who is the assignee on this patent?
Guangdong Shenma Search Tech Co Ltd, Alibaba Group Holding Ltd
What technology area does this patent fall under?
Primary CPC classification G06F16/9024. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 08 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).