Data complementing system and data complementing method

US2021286782A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2021286782-A1
Application numberUS-202117190185-A
CountryUS
Kind codeA1
Filing dateMar 2, 2021
Priority dateMar 10, 2020
Publication dateSep 16, 2021
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A data complementing system stores cell-region characteristic data that includes values of a plurality of data items regarding a cell region that is a region obtained by dividing the region into a mesh, information indicating a missing data item that is the data item of missing data being data missed in the cell-region characteristic data, external region characteristic data that includes values of a plurality of data items regarding an external region that is different from the region, and an external cell-region characteristic data that includes values of a plurality of data items regarding an external cell region obtained by dividing the external region into a mesh, generates a complement model for generating complement data indicating a value of the missing data item based on the external region characteristic data and the external cell-region characteristic data, and generates the complement data based on the complement model.

First claim

Opening claim text (preview).

What is claimed is: 1 . A data complementing system including an information processing apparatus, the system comprising: a storage unit configured to store region characteristic data that includes values of a plurality of data items regarding a predetermined region, cell-region characteristic data that includes values of a plurality of data items regarding a cell region that is a region obtained by dividing the region into a mesh, information indicating a missing data item that is the data item of missing data being data missed in the cell-region characteristic data, external region characteristic data that includes values of a plurality of data items regarding an external region that is different from the region, and an external cell-region characteristic data that includes values of a plurality of data items regarding an external cell region obtained by dividing the external region into a mesh; and a complement model generation unit configured to generate a complement model for generating complement data that is for complementing the missing data, based on the external region characteristic data and the external cell-region characteristic data. 2 . The data complementing system according to claim 1 , wherein the complement model generation unit is configured to generate a first regression equation for each of a plurality of the external regions, the first regression equation in which the missing data item is expressed with the data item in the external cell-region characteristic data, which corresponds to the missing data item, generate a second regression equation for all combinations obtained by selecting two of the plurality of the external regions, the second regression equation in which a difference between partial regression coefficients in the first regression equation for each of the external regions is used as an objective variable, and a difference between the values of the same data items in the external region characteristic data for the two selected external regions is used as an explanatory variable, and generate the complement model based on the first regression equation and the second regression equation, the complement model in which the missing data item is expressed with the data item included in an explanatory variable pattern, the partial regression coefficient in the first regression equation, and a partial regression coefficient in the second regression equation, the explanatory variable pattern being a combination of one or more data items in the cell-region characteristic data. 3 . The data complementing system according to claim 2 , wherein the storage unit is configured to store a plurality of the explanatory variable patterns, and the complement model generation unit is configured to generate the complement model for each of the plurality of the explanatory variable patterns. 4 . The data complementing system according to claim 3 , wherein the storage unit is configured to store a significance level used for determining significance of each of a plurality of the complement models, and the complement model generation unit is configured to determine the significance of the first regression equation and the significance of the second regression equation based on the significance level for each of the plurality of the complement models, and generate a first significance determination result indicating the significance of the first regression equation and a second significance determination result indicating the significance of the second regression equation, for each of the plurality of the complement models. 5 . The data complementing system according to claim 4 , further comprising: a complement model selection unit configured to obtain a deviation between the value of the data item corresponding to the missing data item in the region characteristic data and a sum of the values of the complement data for all cell regions belonging to the predetermined region, for each of the plurality of the complement models, output at least any of the deviation, the first significance determination result, and the second significance determination result, and receive a selection of the complement model corresponding to each of the plurality of the explanatory variable patterns. 6 . The data complementing system according to claim 5 , further comprising: a data complementing unit configured to generate the complement data based on the selected complement model. 7 . The data complementing system according to claim 6 , further comprising: a complement data correction unit configured to correct the generated complement data based on the deviation. 8 . The data complementing system according to claim 6 , further comprising: an output unit configured to output at least any of information on the selected complement model and complement data generated by the complement model. 9 . The data complementing system according to claim 1 , further comprising: a receiving unit configured to receive an input of at least any of the region characteristic data, the cell-region characteristic data, and the information indicating the missing data item. 10 . A data complementing method, the method comprising: by an information processing apparatus, storing region characteristic data that includes values of a plurality of data items regarding a predetermined region, cell-region characteristic data that includes values of a plurality of data items regarding a cell region that is a region obtained by dividing the region into a mesh, information indicating a missing data item that is the data item of missing data being data missed in the cell-region characteristic data, external region characteristic data that includes values of a plurality of data items regarding an external region that is different from the region, and an external cell-region characteristic data that includes values of a plurality of data items regarding an external cell region obtained by dividing the external region into a mesh; and generating a complement model for generating complement data that is for complementing the missing data, based on the external region characteristic data and the external cell-region characteristic data. 11 . The data complementing method according to claim 10 , further comprising: by the information processing apparatus, generating a first regression equation for each of a plurality of the external regions, the first regression equation in which the missing data item is expressed with the data item in the external cell-region characteristic data, which corresponds to the missing data item; generating a second regression equation for all combinations obtained by selecting two of the plurality of the external regions, the second regression equation in which a difference between partial regression coefficients in the first regression equation for each of the external regions is used as an objective variable, and a difference between the values of the same data items in the external region characteristic data for the two selected external regions is used as an explanatory variable; and generating the complement model based on the first regression equation and the second regression equation, the complement model in which the missing data item is expressed with the data item included in an explanatory variable pattern, the partial regression coefficient in the first regression equation, and a partial regression coefficient in the second regression equation, the explanatory variable pattern being a combination of one or more data items in the cell-region characteristic data. 12 . The data complementing method according to

Assignees

Inventors

Classifications

  • G06F16/29Primary

    Geographical information databases · CPC title

  • G06F16/215Primary

    Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title

  • Ensuring data consistency and integrity · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2021286782A1 cover?
A data complementing system stores cell-region characteristic data that includes values of a plurality of data items regarding a cell region that is a region obtained by dividing the region into a mesh, information indicating a missing data item that is the data item of missing data being data missed in the cell-region characteristic data, external region characteristic data that includes value…
Who is the assignee on this patent?
Hitachi Ltd
What technology area does this patent fall under?
Primary CPC classification G06F16/29. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Sep 16 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).