Data segment-based indexing
US-2021209103-A1 · Jul 8, 2021 · US
US12360980B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12360980-B2 |
| Application number | US-202318469733-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 19, 2023 |
| Priority date | Oct 14, 2020 |
| Publication date | Jul 15, 2025 |
| Grant date | Jul 15, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A database system is operable to generate a first segment that includes a first subset of a plurality of rows. A first secondary indexing scheme for indexing the first segment is selected from a plurality of secondary indexing options, and first index data for the first segment is generated in accordance with the first secondary indexing scheme. A second segment is generated that includes a second subset of the plurality of rows. A second secondary indexing scheme for indexing the second segment is selected from the plurality of secondary indexing options, and second index data is generated for the second segment in accordance with the second secondary indexing scheme. Execution of a query is facilitated by utilizing the first index data and the second index data.
Opening claim text (preview).
What is claimed is: 1. A method for execution by at least one processor of a database system, comprising: generating a first segment that includes, for each of a first subset of a plurality of rows of a dataset, a first plurality of column values corresponding to all of a set of columns of the dataset; selecting, for the first segment, a first secondary indexing scheme for indexing the set of columns from a plurality of secondary indexing options, wherein the plurality of secondary indexing options includes a set of secondary indexing types, wherein a first one of the set of secondary indexing types includes a first set of configurable parameters, and wherein selecting the first secondary indexing scheme includes selecting the first one of the set of secondary indexing types and includes further selecting first parameter selections for each of the first set of configurable parameters for the first one of the set of secondary indexing types; generating first index data for the first segment in accordance with the first secondary indexing scheme; storing the first segment and the first index data for the first segment in memory; generating a second segment that includes, for each of a second subset of the plurality of rows of the dataset, a second plurality of column values corresponding to all of the set of columns of the dataset, wherein the second subset of the plurality of rows and the first subset of the plurality of rows are mutually exclusive; selecting, for the second segment, a second secondary indexing scheme for indexing the set of columns from the plurality of secondary indexing options, wherein the second secondary indexing scheme is different from the first secondary indexing scheme based on indexing of at least one column of the set of columns in the first secondary indexing scheme being different from the indexing of the at least one column in the second secondary indexing scheme, and wherein selecting the second secondary indexing scheme includes selecting the first one of the set of secondary indexing types and includes further selecting second parameter selections for each of the first set of configurable parameters for the first one of the set of secondary indexing types that are different from the first parameter selections; generating second index data for the second segment in accordance with the second secondary indexing scheme; storing the second segment and the second index data for the second segment in memory; and facilitating execution of a query against the dataset by utilizing the first index data and the second index data. 2. The method of claim 1 , further comprising: generating first local distribution information for the first segment, wherein the first secondary indexing scheme is selected for the first segment from the plurality of secondary indexing options based on the first local distribution information; and generating second local distribution information for the second segment, wherein the second secondary indexing scheme is selected for the second segment from the plurality of secondary indexing options based on the second local distribution information, and wherein the second secondary indexing scheme is different from the first secondary indexing scheme based on the second local distribution information being different from the first local distribution information. 3. The method of claim 1 , wherein the set of column includes at least a first column, a second column, and a third column, wherein plurality of secondary indexing options includes a set of secondary indexing options corresponding to different subsets of the set of columns, wherein the first secondary indexing scheme includes indexing a first subset of the set of columns, wherein the first subset of the set of columns includes at least the first column and the second column, wherein the first subset of the set of column excludes at least the third column, wherein the second secondary indexing scheme includes indexing a second subset of the set of columns, wherein the second subset of the set of columns includes at least the first column and the third column, and wherein the second subset of the set of column excludes at least the second column. 4. The method of claim 3 , wherein the first subset of the set of columns includes a first number of columns, wherein the first index data generated for the first segment correspond to indexes for exactly the first number of columns based on applying the first secondary indexing scheme, wherein the second subset of the set of columns includes a second number of columns, wherein the second index data generated for the second segment correspond to indexes for exactly the second number of columns based on applying the second secondary indexing scheme, and wherein the first number of columns is one of: a greater than the second number of columns, or less than the second number of columns. 5. The method of claim 1 , wherein the set of column includes a first column, wherein the plurality of secondary indexing options includes a set of secondary indexing options corresponding to a plurality of different indexing types for indexing the first column, wherein the first secondary indexing scheme includes indexing the first column via a first indexing type of the plurality of different indexing types, and wherein the second secondary indexing scheme includes indexing the first column via a second indexing type of the plurality of different indexing types that is different from the first indexing type. 6. The method of claim 5 , wherein the plurality of different indexing types includes at least two of: a bloom filter, a projection index, a data-backed index, a filtering index, a composite index, a zone map, a bit map, or a B-tree. 7. The method of claim 1 , wherein the set of column includes a first column, wherein the plurality of secondary indexing options includes corresponding to selection of whether to index each of the set of columns, wherein the second secondary indexing scheme includes indexing the first column, and wherein the second secondary indexing scheme excludes indexing of the first column. 8. The method of claim 1 , wherein the set of secondary indexing types includes at least one of: a bloom filter, a projection index, a data-backed index, a filtering index, a composite index, a zone map, a bit map, or a B-tree, wherein the first index data is generated in accordance with the first one of the set of secondary indexing types, further comprising: generating a third segment that includes, for each of a third subset of the plurality of rows of the dataset, a third plurality of column values corresponding to all of the set of columns of the dataset, wherein the third subset of the plurality of rows, the second subset of the plurality of rows, and the first subset of the plurality of rows are mutually exclusive; and selecting, for the third segment, a third secondary indexing scheme for indexing the set of columns from the plurality of secondary indexing options, wherein the third secondary indexing scheme is different from the first secondary indexing scheme and the second secondary indexing scheme, wherein selecting the third secondary indexing scheme includes selecting a second one of the set of secondary indexing types, and wherein third index data for the third segment is generated in accordance with the second one of the set of secondary indexing types. 9. The method of claim 1 , wherein the set of secondary indexing types includes at least two of: a bloom filter, a projection index, a data-backed index, a filtering index, a composite index, a zone map, a bit map, or a B-tree. 10. The method of claim 1 , wherein the first index data is distinct from at least one primary
Column-oriented storage; Management thereof · CPC title
Updates performed during online database operations; commit processing · CPC title
User-Defined Types; Storage management thereof · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.