Data import and reconciliation

US11030224B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11030224-B2
Application numberUS-201715684113-A
CountryUS
Kind codeB2
Filing dateAug 23, 2017
Priority dateAug 23, 2017
Publication dateJun 8, 2021
Grant dateJun 8, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In one embodiment, the present disclosure pertains to data import and reconciliation. In one embodiment, a location field is compared against alternative geo-descriptors to link the location fields to areaIDs and geospatial shapes. A similarity search is performed against unmatched location fields. In one example embodiment, a table is generated with unique location IDs, areaIDs, and metadata describing the results of the comparison.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: storing a plurality of unique area identifications each associated with a corresponding geospatial shape of geospatial shapes, wherein the plurality of unique area identifications are stored in a first field in one or more tables and the geospatial shapes are stored in a second field in the one or more tables; storing a plurality of master geo-descriptors each associated with one of the unique area identifications, wherein the plurality of master geo-descriptors are stored in a third field in the one or more tables and the unique area identifications are stored in a fourth field in the one or more tables; storing a plurality of alternative geo-descriptors in association with the unique area identifications, wherein each unique area identification is associated with a plurality of the alternative geo-descriptors, the plurality of alternative geo-descriptors are alternate representations of each of the master geo-descriptors, the plurality of alternative geo-descriptors and the master geo-descriptors are stored in a fifth field in the one or more tables, and the unique area identifications are stored in a sixth field in the one or more tables; receiving a data set comprising at least one location field to be mapped to the unique area identifications; joining the one or more tables; and comparing data elements in the at least one location field to the plurality of alternative geo-descriptors, wherein when particular data elements match one of the plurality of alternative geo-descriptors, determining the unique area identification associated with the matched one of the plurality of alternative geo-descriptors and associating the unique area identification with the matching data elements; and when a particular data element does not match any of the plurality of alternative geo-descriptors: performing a similarity search of the particular data element against the plurality alternative geo-descriptors to return a result set comprising one or more of the plurality alternative geo-descriptors having a likelihood greater than a threshold; presenting the result set to a user for selection; and determining the unique area identification associated with the user selected result. 2. The method of claim 1 wherein the plurality of master geo-descriptors and the associated plurality of unique area identifications are stored in an area description table, the unique area identifications and the associated geospatial shapes are stored in a shape table, and the plurality of alternative geo-descriptors and the associated unique area identifications are stored in an alternative description table. 3. The method of claim 1 further comprising, sending data elements that do not match any of the plurality of alternative geo-descriptors for display to a user, wherein the similarity search is performed only for one particular displayed data element when the user selects the particular displayed data element. 4. The method of claim 1 further comprising, when data elements in the at least one location field match multiple of the plurality of alternative geo-descriptors, then: presenting the multiple of the plurality alternative geo-descriptors matching the particular data elements to a user for selection; and determining the unique area identification associated with the user selected alternative geo-descriptor. 5. The method of claim 1 further comprising, based on said comparing data elements step, generating a table comprising a first field storing each unique location in the data set, a second field storing either none, one, or multiple unique area identifications, and a third field storing metadata describing a result of the comparing data elements step. 6. The method of claim 1 further comprising storing a locale in association with each of the plurality of alternative geo-descriptors, the locale specifying a language of a particular geo-descriptor, wherein the locale limits the scope of said comparing data elements step, and wherein the similarity search of the particular data element is further performed against at least one locale. 7. The method of claim 1 further comprising associating a geo-level descriptor with the unique area identifications, the geo-level descriptor specifying a geographic hierarchy, wherein the geo-level descriptor limits the scope of said comparing data elements step, and wherein the similarity search of the particular data element is further performed against at least one geo-level descriptor. 8. The method of claim 1 further comprising storing a plurality of ISO country codes each associated with one of the unique area identifications, wherein a user specifies that countries in the data set are represented as ISO country codes, said comparing data elements comprising: comparing data elements in a first location field of the at least one location field of the data set to the ISO country codes; comparing data elements in a second location field of the at least one location field of the data set to the plurality of alternative geo-descriptors; and wherein the similarity search is performed for the second location field when the second location field does not match any of the plurality of alternative geo-descriptors, and the similarity search is performed for the first location field against the ISO country codes when the first location field does not match any of the stored ISO country codes. 9. The method of claim 1 wherein: the plurality of master geo-descriptors are stored in a first field of a first table and the plurality of unique area identifications are stored in a second field of the first table; the unique area identifications are stored in a first field of a second table and the geospatial shapes are stored in a second field of the second table; and the plurality of alternative geo-descriptors and the master geo-descriptors are stored in a first field of a third table and the unique area identifications are stored in a second field of the third table; and the method further comprising, before said comparing step, joining the first table, the second table, and the third table. 10. The method of claim 9 wherein the joining step is performed during a data reconciliation phase, and during a data consumption phase occurring after the data reconciliation phase the method comprising joining the first table and the second table. 11. The method of claim 1 further comprising, based on said comparing data elements step, generating a table comprising a first field storing each unique location in the data set, a second field storing either none, one, or multiple unique area identifications, and a third field storing metadata describing a result of the comparing data elements step, identifying data values in the table; accessing formatting information for the data values; and generating one or more arrays of formatted data. 12. A non-transitory machine-readable medium storing a program executable by at least one processing unit of a device, the program comprising sets of instructions for: storing a plurality of unique area identifications each associated with a corresponding geospatial shape of geospatial shapes, wherein the plurality of unique area identifications are stored in a first field in one or more tables and the geospatial shapes are stored in a second field in the one or more tables; storing a plurality of master geo-descriptors each associated with one of the unique area identifications, wherein the plurality of master geo-descriptors are stored in a third field in the one or more tables and the unique area identifications are stored in a fourth field in the one or more tables; storing a plur

Assignees

Inventors

Classifications

  • G06F16/29Primary

    Geographical information databases · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11030224B2 cover?
In one embodiment, the present disclosure pertains to data import and reconciliation. In one embodiment, a location field is compared against alternative geo-descriptors to link the location fields to areaIDs and geospatial shapes. A similarity search is performed against unmatched location fields. In one example embodiment, a table is generated with unique location IDs, areaIDs, and metadata d…
Who is the assignee on this patent?
Sap Se
What technology area does this patent fall under?
Primary CPC classification G06F16/29. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 08 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).