Querying spatial data in column stores using grid-order scans

US10380130B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10380130-B2
Application numberUS-201715635004-A
CountryUS
Kind codeB2
Filing dateJun 27, 2017
Priority dateMay 9, 2014
Publication dateAug 13, 2019
Grant dateAug 13, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A query of spatial data is received by a database comprising a columnar data store storing data in a column-oriented structure. Thereafter, a minimal bounding rectangle associated with the query is identified using a grid order scanning technique. The spatial data set corresponding to the received query is then mapped to physical storage in the database using the identified minimal bounding rectangle so that the spatial data set can be retrieved. Related apparatus, systems, techniques and articles are also described.

First claim

Opening claim text (preview).

What is claimed is: 1. A method implemented on one or more systems comprising one or more programmable processors, the method comprising: mapping spatial data stored in a database comprising a columnar data store storing the spatial data in a column-oriented structure, the mapping comprising preserving spatial proximity of a plurality of spatial point objects when the spatial data are physically stored using a grid ordering comprising: dividing a bounded space containing the plurality of spatial point objects into a grid having fixed boundaries and comprising rectangular cells, and indexing the cells of the grid, wherein the indexing comprises assigning each spatial point object of the plurality of spatial point objects to a particular cell of the grid, assigning one or more bounding boxes to one or more of the cells, maintaining a record of the assignment of the one or more bounding boxes to the one or more of the cells, and creating index vectors to represent the cells of the grid, wherein assigning one or more bounding boxes to one or more of the cells comprises assigning a bounding box to a cell if the bounding box intersects a predefined number of adjacent cells with respect to a single characteristic point of the bounding box, and assigning the bounding box to an overflow cell if the intersection of the bounding box exceeds the predefined cellsize; receiving a query of the spatial data and checking overflow cell for every received query; identifying a target bounding box based on the received query of the spatial data, the identifying comprising scanning the cells of the grid using the index vectors and return bit vector to check valid entries; determining, based on the mapping and the identified target bounding box, a spatial data set corresponding to the received query and a physical storage location in the database from which to retrieve the spatial data set; retrieving the spatial data set from the physical storage location based on the determining and in response to the received query of the spatial data; and providing the retrieved spatial data set in response to the received query of the spatial data. 2. The method of claim 1 , wherein the grid ordering comprises the grid of cells being of uniform length along each axis of a plurality of axes. 3. The method of claim 1 , wherein indexing the cells further comprises adding a separate vector that stores, for each replicated entry, an object identifier. 4. The method of claim 3 , wherein the one or more bounding boxes have a sequentially increasing integer object identifier. 5. The method of claim 4 , further comprising, upon retrieving the spatial data set for the received query, applying the mapping to identify the plurality of spatial point objects. 6. The method of claim 1 , wherein the grid ordering comprises the grid of cells having non-uniform dimensions. 7. A non-transitory computer program product storing instructions which, when executed by at least one data processor forming part of at least one computing system, result in operations comprising: mapping spatial data stored in a database comprising a columnar data store storing the spatial data in a column-oriented structure, the mapping comprising preserving spatial proximity of a plurality of spatial point objects when the spatial data are physically stored using a grid ordering comprising: dividing a bounded space containing the plurality of spatial point objects into a grid having fixed boundaries and comprising rectangular cells, and indexing the cells of the grid, wherein the indexing comprises assigning each spatial point object of the plurality of spatial point objects to a particular cell of the grid, assigning one or more bounding boxes to one or more of the cells, maintaining a record of the assignment of the one or more bounding boxes to the one or more of the cells, and creating index vectors to represent the cells of the grid, wherein assigning one or more bounding boxes to one or more of the cells comprises assigning a bounding box to a cell if the bounding box intersects a predefined number of adjacent cells with respect to a single characteristic point of the bounding box, and assigning the bounding box to an overflow cell if the intersection of the bounding box exceeds the predefined cellsize; receiving a query of the spatial data and checking overflow cell for every received query; identifying a target bounding box based on the received query of the spatial data, the identifying comprising scanning the cells of the grid using the index vectors and return bit vector to check valid entries; determining, based on the mapping and the identified target bounding box, a spatial data set corresponding to the received query and a physical storage location in the database from which to retrieve the spatial data set; retrieving the spatial data set from the physical storage location based on the determining and in response to the received query of the spatial data; and providing the retrieved spatial data set in response to the received query of the spatial data. 8. The computer program product of claim 7 , wherein the grid ordering comprises the grid of cells being of uniform length along each axis of a plurality of axes. 9. The computer program product of claim 7 , wherein indexing the cells further comprises adding a separate vector that stores, for each replicated entry, an object identifier. 10. The computer program product of claim 9 , wherein the one or more bounding boxes have a sequentially increasing integer object identifier. 11. The computer program product of claim 10 , wherein the operations further comprise, upon retrieving the spatial data set for the received query, applying the mapping to identify the plurality of spatial point objects. 12. The computer program product of claim 7 , wherein the grid ordering comprises the grid of cells having non-uniform dimensions. 13. A system comprising: a database comprising a columnar data store storing data in a column-oriented structure; at least one data processor; and memory storing instructions which, when executed by the at least one data processor, result in operations comprising: mapping spatial data stored in a database comprising a columnar data store storing the spatial data in a column-oriented structure, the mapping comprising preserving spatial proximity of a plurality of spatial point objects when the spatial data are physically stored using a grid ordering comprising: dividing a bounded space containing the plurality of spatial point objects into a grid having fixed boundaries and comprising rectangular cells, and indexing the cells of the grid, wherein the indexing comprises assigning each spatial point object of the plurality of spatial point objects to a particular cell of the grid, assigning one or more bounding boxes to one or more of the cells, maintaining a record of the assignment of the one or more bounding boxes to the one or more of the cells, and creating index vectors to represent the cells of the grid, wherein assigning one or more bounding boxes to one or more of the cells comprises assigning a bounding box to a cell if the bounding box intersects a predefined number of adjacent cells with respect to a single characteristic point of the bounding box, and assigning the bounding box to an overflow cell if the intersection of the bounding box exceeds the predefined cellsize; receiving a query of the spatial data and checking overflow cell for every received query; identifying a target bounding box based on the received query of the spatial data, the identifying comprising scanning the cells of the grid using the index vecto

Assignees

Inventors

Classifications

  • Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries · CPC title

  • Mapping to a database · CPC title

  • G06F16/29Primary

    Geographical information databases · CPC title

  • Column-oriented storage; Management thereof · CPC title

  • Spatial or temporal dependent retrieval, e.g. spatiotemporal queries · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10380130B2 cover?
A query of spatial data is received by a database comprising a columnar data store storing data in a column-oriented structure. Thereafter, a minimal bounding rectangle associated with the query is identified using a grid order scanning technique. The spatial data set corresponding to the received query is then mapped to physical storage in the database using the identified minimal bounding rec…
Who is the assignee on this patent?
Sap Se
What technology area does this patent fall under?
Primary CPC classification G06F16/2458. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 13 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).