Method, apparatus, and computer-readable medium for postal address identification
US-2024428099-A1 · Dec 26, 2024 · US
US2016203404A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2016203404-A1 |
| Application number | US-201314917074-A |
| Country | US |
| Kind code | A1 |
| Filing date | Sep 14, 2013 |
| Priority date | Sep 14, 2013 |
| Publication date | Jul 14, 2016 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Example embodiments relate to predicting execution times of concurrent queries. In example embodiments, historical data is iteratively generated for a machine learning model by varying a concurrency level of query executions in a database, determining a query execution plan for a pending concurrent query, extracting query features from the query execution plan, and executing the pending concurrent query to determine a query execution time. The machine learning model may then be created based on the query features, variation in the concurrency level, and the query execution time. The machine learning model is used to generate an execution schedule for production queries, where the execution schedule satisfies service level agreements of the production queries.
Opening claim text (preview).
1 . A system for predicting execution times of concurrent queries, the system comprising of: a processor to: iteratively generate historic data for creating a machine learning model by: varying a concurrency level of query executions in a database; determining a query execution plan for a pending concurrent query; extracting a plurality of query features from the query execution plan; and executing the pending concurrent query to determine a query execution time; create the machine learning model based on the plurality of query features, variation in the concurrency level, and the query execution time; and use the machine learning model to generate an execution schedule for a plurality of production queries, wherein the execution schedule satisfies service level agreements of the plurality of production queries. 2 . The system of claim 1 , wherein the processor uses the machine learning model to generate the execution schedule for the plurality of production queries by: matching one of the plurality of production queries to a subset to the plurality of query features; determining a predicted execution time for the one of the plurality of production queries based on the subset; and determining an execution order for the plurality of production queries based on the predicted execution time. 3 . The system of claim 2 , wherein the processor is further to: identify significant features of the plurality of features that are statistically used more often in production, wherein the subset includes the significant features. 4 . The system of claim 1 , wherein the processor is further to: determine a production query execution plan for each of the plurality of production queries; extract a plurality of production query features from each of the production query execution plan; execute each of the plurality of production queries to determine a production query execution time; update the machine learning model based on the plurality of production query features and the production query execution time of each of the plurality of production queries. 5 . The system of claim 1 , wherein the concurrency level is in a range of two to a maximum value greater than two, wherein each value in the range is iteratively used as the concurrency level to generate the historic data. 6 . The system of claim 1 , wherein the machine learning model is created using a boosted trees technique that generates a group of decision trees based on the plurality of query features, variation in the concurrency level, and the query execution time. 7 . A method for predicting execution times of concurrent querues, comprising: receiving historic data associated with a database for creating a machine learning model, wherein the historic data includes query execution times for training queries that have been iteratively executed at varying concurrency levels and a plurality of query features that have been extracted from query execution plans of the training queries; using a boosted trees technique to create the machine learning model based on the plurality of query features, the varying concurrency levels, and the query execution times; and using the machine learning model to generate an execution schedule for a plurality of production queries, wherein the execution schedule satisfies service level agreements of the plurality of production queries. 8 . The method of claim 7 , wherein using the machine learning model to generate the execution schedule for the plurality of production queries comprises: matching one of the plurality of production queries to a subset of the plurality of query features; determining a predicted execution time for the one of the plurality of production queries based on the subset; and determining an execution order for the plurality of production queries based on the predicted execution time. 9 . The method of claim 8 , further comprising: identifying significant features of the plurality of features that are statistically used more often in production, wherein the subset includes the significant features. 10 . The method of claim 7 , further comprising: determining a production query execution plan for each of the plurality of production queries; extracting a plurality of production query features from each of the production query execution plan; executing each of the plurality of production queries to determine a production query execution time; updating the machine learning model based on the plurality of production query features and the production query execution time of each of the plurality of production queries. 11 . The method of claim 7 , wherein the varying concurrency levels are in a range of two to a maximum value greater than two, wherein each value in the range has been iteratively used to generate the historic data. 12 . A non-transitory machine-readable storage medium encoded with instructions executable by a processor for predicting execution times of concurrent queries, the machine-readable storage medium comprising instructions to: iteratively generate historic data for creating a machine learning model by: varying a concurrency level of query executions in a database, wherein the concurrency level is iteratively varied to values in a range of two to a maximum value greater than two; determining a query execution plan for pending concurrent query; extracting a plurality of query features from the query execution plan; and executing the pending concurrent query to determine a query execution time; create the machine learning model based on the plurality of query features, variation in the concurrency level, and the query execution time; and use the machine learning model to generate an execution schedule for a plurality of production queries, wherein the execution schedule satisfies service level agreements of the plurality of production queries. 13 . The non-transitory machine-readable storage medium of claim 12 , wherein using the machine learning model to generate the execution schedule for the plurality of production queries comprises: matching one of the plurality of production queries to a subset of the plurality of query features; determining a predicted execution time for the one of the pluralityy of production queries based on the subset; and determining an execution order for the plurality of production queries based on the predicted execution time. 14 . The non-transitory machine-readable storage medium of claim 13 , further comprising instructions to: identify significant features of the plurality of features that are statistically used more often in production, wherein the subset includes the significant features. 15 . The non-transitory machine-readable storage medium of claim 12 , further comprising instructions to: determine a production query execution plan for each of the plurality of production queries; extract a plurality of production query features from each of the production query execution plan; execute each of the plurality of production queries to determine a production query execution time; update the machine learning model based on the plurality of production query features and the production query execution time of each of the plurality of production queries.
Related publications grouped by family.
Answers are generated from the same data shown on this page.