Hierarchy driven time series forecasting

US2024394522A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024394522-A1
Application numberUS-202318200642-A
CountryUS
Kind codeA1
Filing dateMay 23, 2023
Priority dateMay 23, 2023
Publication dateNov 28, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for lightweight and efficient long sequence time-series forecasting and representation learning includes segmenting a time-series dataset from a plurality of sensors into a plurality of patches. The method further includes applying gated multilayer perceptron (MLP) mixing across different directions of the patched input time-series. The method further includes capturing local and global and interrelated correlations across the plurality of patches and within the plurality of patches. The method further includes applying a patch-time aggregated hierarchy to guide lowest-level predictions based on aggregated hierarchy signals at a patch-level. The method further includes chaining MLP-mixers in a patch length context aware hierarchy fashion to enhance time-series short and long-term correlation capture.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method comprising: segmenting a time-series dataset from a plurality of sensors into a plurality of patches; applying gated multilayer perceptron (MLP) mixing across different directions of the patched input time-series; capturing local and global and interrelated correlations across the plurality of patches and within the plurality of patches; and applying a patch-time aggregated hierarchy to guide lowest-level predictions based on aggregated hierarchy signals at a patch-level. 2 . The computer-implemented method of claim 1 , wherein the MLP mixing is channel independent. 3 . The computer-implemented method of claim 1 , wherein the MLP mixing uses layers that are stacked in linear fashion. 4 . The computer-implemented method of claim 1 , wherein the MLP mixing uses layers that are chained in a patch length context aware hierarchy fashion. 5 . The computer-implemented method of claim 1 , wherein the MLP mixing is mixed with respect to patches and features. 6 . The computer-implemented method of claim 1 , further comprising a pretraining task of masking random patches. 7 . The computer-implemented method of claim 4 , further comprising reconstructing the masked random patches. 8 . The computer-implemented method of claim 1 , further comprising a downstream task of forecasting values of the sensors. 9 . The computer-implemented method of claim 1 , further comprising a downstream task of executing regression analysis regarding values of the sensors. 10 . The computer-implemented method of claim 1 , further comprising a downstream task of classifying values of the sensors into one of a variety of predetermined classifications. 11 . A system comprising: a processor; and a memory in communication with the processor, the memory containing instructions that, when executed by the processor, cause the processor to: segment a time-series dataset from a plurality of sensors into a plurality of patches; apply gated multilayer perceptron (MLP) mixing across different directions of the patched input time-series; capture local and global and interrelated correlations across the plurality of patches and within the plurality of patches; and apply a patch-time aggregated hierarchy to guide lowest-level predictions based on aggregated hierarchy signals at a patch-level. 12 . The system of claim 11 , wherein MLP the mixing is channel independent. 13 . The system of claim 11 , wherein the MLP mixing is mixed with respect to patches and features. 14 . The system of claim 11 , wherein the MLP mixing uses layers that are either stacked in linear fashion or chained in a patch length context aware hierarchy fashion. 15 . The system of claim 11 , the memory containing additional instructions that, when executed by the processor, cause the processor to execute a pretraining task of masking random patches. 16 . The system of claim 15 , the memory containing additional instructions that, when executed by the processor, cause the processor to reconstruct the masked random patches. 17 . The system of claim 11 , the memory containing additional instructions that, when executed by the processor, cause the processor to execute a downstream task of forecasting values of the sensors. 18 . The system of claim 11 , the memory containing additional instructions that, when executed by the processor, cause the processor to execute a downstream task of executing regression analysis regarding values of the sensors. 19 . The system of claim 11 , the memory containing additional instructions that, when executed by the processor, cause the processor to execute a downstream task of classifying values of the sensors into one of a variety of predetermined classifications 20 . A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to: segment a time-series dataset from a plurality of sensors into a plurality of patches; apply gated multilayer perceptron (MLP) mixing across different directions of the patched input time-series; capture local and global and interrelated correlations across the plurality of patches and within the plurality of patches; and apply a patch-time aggregated hierarchy to guide lowest-level predictions based on aggregated hierarchy signals at a patch-level.

Assignees

Inventors

Classifications

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Combinations of networks · CPC title

  • G06N3/0499Primary

    Feedforward networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024394522A1 cover?
A method for lightweight and efficient long sequence time-series forecasting and representation learning includes segmenting a time-series dataset from a plurality of sensors into a plurality of patches. The method further includes applying gated multilayer perceptron (MLP) mixing across different directions of the patched input time-series. The method further includes capturing local and globa…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N3/0499. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 28 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).