What technology area does this patent fall under?

Primary CPC classification G06F12/0871. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Apr 23 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Automated time series forecasting pipeline generation

US11966340B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11966340-B2
Application number	US-202217654965-A
Country	US
Kind code	B2
Filing date	Mar 15, 2022
Priority date	Feb 18, 2021
Publication date	Apr 23, 2024
Grant date	Apr 23, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

To automate time series forecasting machine learning pipeline generation, a data allocation size of time series data may be determined based on one or more characteristics of a time series data set. The time series data may be allocated for use by candidate machine learning pipelines based on the data allocation size. Features for the time series data may be determined and cached by the candidate machine learning pipelines. Predictions of each of the candidate machine learning pipelines using at least the one or more features may be evaluated. A ranked list of machine learning pipelines may be automatically generated from the candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for automated time series forecasting machine learning pipeline generation in a computing environment by one or more processors comprising: determining a data allocation size of time series data based on one or more characteristics of a time series data set; executing machine learning logic to allocate the time series data to one or more candidate machine learning pipelines based on the data allocation size, wherein a sequential order of the time series data set is used while allocating the time series data based on the data allocation size; executing the machine learning logic to train the one or more candidate machine learning pipelines using selected data associated with the time series data set; executing the machine learning logic to determine and cache one or more features for the time series data by the one or more candidate machine learning pipelines; evaluating predictions of each of the one or more candidate machine learning pipelines using at least the one or more features; and automatically generating a ranked list of machine learning pipelines from the one or more candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines. 2. The method of claim 1 , further including determining a holdout data set, a test data set, and a training data set from the time series data for allocating the time series data. 3. The method of claim 1 , further including evaluating the trained one or more candidate machine learning pipelines using the time series data, a hold data set, a test data set, and a training data set from the time series data. 4. The method of claim 1 , further including allocating the time series data backward in time. 5. The method of claim 1 , further including combining the one or more features with previously determined features for use by the one or more candidate machine learning pipelines. 6. The method of claim 1 , further including caching the one or more features at a final estimator of the one or more candidate machine learning pipelines. 7. A system for automated time series forecasting machine learning pipeline generation in a computing environment, comprising: one or more computers with executable instructions that when executed cause the system to: determine a data allocation size of time series data based on one or more characteristics of a time series data set; execute machine learning logic to allocate the time series data to one or more candidate machine learning pipelines based on the data allocation size, wherein a sequential order of the time series data set is used while allocating the time series data based on the data allocation size; execute the machine learning logic to train the one or more candidate machine learning pipelines using selected data associated with the time series data set; execute the machine learning logic to determine and cache one or more features for the time series data by the one or more candidate machine learning pipelines; evaluate predictions of each of the one or more candidate machine learning pipelines using at least the one or more features; and automatically generate a ranked list of machine learning pipelines from the one or more candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines. 8. The system of claim 7 , wherein the executable instructions when executed cause the system to determine a holdout data set, a test data set, and a training data set from the time series data for allocating the time series data. 9. The system of claim 7 , wherein the executable instructions when executed cause the system to evaluate the trained one or more candidate machine learning pipelines using the time series data, a hold data set, a test data set, and a training data set from the time series data. 10. The system of claim 7 , wherein the executable instructions when executed cause the system to allocate the time series data backward in time. 11. The system of claim 7 , wherein the executable instructions when executed cause the system to combine the one or more features with previously determined features for use by the one or more candidate machine learning pipelines. 12. The system of claim 7 , wherein the executable instructions when executed cause the system to cache the one or more features at a final estimator of the one or more candidate machine learning pipelines. 13. A computer program product for automated time series forecasting machine learning pipeline generation in a computing environment, the computer program product comprising: one or more computer non-transitory readable storage media, and program instructions collectively stored on the one or more non-transitory computer readable storage media, the program instructions comprising: program instructions to determine a data allocation size of time series data based on one or more characteristics of a time series data set; program instructions to execute machine learning logic to allocate the time series data to one or more candidate machine learning pipelines based on the data allocation size, wherein a sequential order of the time series data set is used while allocating the time series data based on the data allocation size; program instructions to execute the machine learning logic to train the one or more candidate machine learning pipelines using selected data associated with the time series data set; program instructions to execute the machine learning logic to determine and cache one or more features for the time series data by the one or more candidate machine learning pipelines; program instructions to evaluate predictions of each of the one or more candidate machine learning pipelines using at least the one or more features; and program instructions to automatically generate a ranked list of machine learning pipelines from the one or more candidate machine learning pipelines for time series forecasting based upon evaluating predictions of each of the one or more candidate machine learning pipelines. 14. The computer program product of claim 13 , further including program instructions to: determine a holdout data set, a test data set, and a training data set from the time series data for allocating the time series data; and evaluate the trained one or more candidate machine learning pipelines using the time series data, the hold data set, the test data set, and the training data set from the time series data. 15. The computer program product of claim 13 , further including program instructions to allocate the time series data backward in time. 16. The computer program product of claim 13 , further including program instructions to combine the one or more features with previously determined features for use by the one or more candidate machine learning pipelines. 17. The computer program product of claim 13 , further including program instructions to cache the one or more features at a final estimator of the one or more candidate machine learning pipelines.

Assignees

Inventors

Classifications

G06F12/0871Primary
Allocation or management of cache space · CPC title
G06N20/00
Machine learning · CPC title
G06F2212/604
Details relating to cache allocation · CPC title
G06N20/10Primary
using kernel methods, e.g. support vector machines [SVM] · CPC title
G06F12/0862
with prefetch · CPC title

Patent family

Related publications grouped by family.

View patent family 83510800

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11966340B2 cover?: To automate time series forecasting machine learning pipeline generation, a data allocation size of time series data may be determined based on one or more characteristics of a time series data set. The time series data may be allocated for use by candidate machine learning pipelines based on the data allocation size. Features for the time series data may be determined and cached by the candida…
Who is the assignee on this patent?: IBM
What technology area does this patent fall under?: Primary CPC classification G06F12/0871. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Apr 23 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

System and method for a personalized machine learning pipeline selection and result interpretation

Using hyperparameter predictors to improve accuracy of automatic machine learning model selection

Cognitive distributed network

Data transformation caching in an artificial intelligence infrastructure

Anomaly detection based on relationships between multiple time series

Frequently asked questions