Adaptive filtering and modeling via adaptive experimental designs to identify emerging data patterns from large volume, high dimensional, high velocity streaming data

US11443206B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11443206-B2
Application numberUS-202016751051-A
CountryUS
Kind codeB2
Filing dateJan 23, 2020
Priority dateMar 23, 2015
Publication dateSep 13, 2022
Grant dateSep 13, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system for identifying information in high dimensional, low latency streaming data having dynamically evolving data patterns. The system processes, continuously and in real-time, the streaming data. Processing includes filtering the data based on event data to identify diagnostic data points by comparing the event data with an experimental design matrix and performing a modeling operation using the identified diagnostic data points in order to identify efficiently any current and emerging patterns of relationships between at least one outcome variable and predictor variables. The at least one a-priori, pre-designed experimental design matrix is generated based on combinations of the predictor variables and at least one outcome variable. The experimental design matrix is also generated based on at least one of main effects, limitations, constraints, and interaction effects of the predictor variables and combinations.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implementable method for identifying information in high dimensional data streams having dynamically evolving data patterns, the method comprising: loading at least one a-priori, pre-designed experimental design matrix and at least one modeling operation into memory; processing streaming data continuously, wherein processing comprises: filtering the streaming data based on event data to identify diagnostic data points by comparing the event data with the at least one a-priori, pre-designed experimental design matrix; and performing the modeling operation using the identified diagnostic data points to identify current and emerging patterns of relationships between at least one outcome variable and predictor variables, wherein the at least one a-priori, pre-designed experimental design matrix is generated based on combinations of the predictor variables, wherein the combinations are based on an outcome variable, wherein the at least one a-priori, pre-designed experimental design matrix is generated further based on at least one of: main effects of predictor variable values; limitations of the combinations of predictor variable values; constraints of the combination of predictor variable values; and interaction effects between selected predictor variables. 2. The computer-implementable method of claim 1 wherein the modeling operation is one of a prediction modeling operation and a clustering modeling operation. 3. The computer-implemented method of claim 1 wherein limitations of the predictor variable values are determined based on range values for the predictor variables, wherein the predictor variable values are continuous predictor variables. 4. The computer-implemented method of claim 1 wherein constraints of the combination of predictor variable values are based on a region of interest, wherein the predictor variable values are discrete predictor variable values. 5. The computer-implemented method of claim 1 wherein the at least one a-priori, pre-designed experimental design matrix is generated based on one of a space-filling design and an optimal experimental design. 6. The computer-implemented method of claim 1 wherein processing further comprises dynamically updating a visualization time window of the streaming data. 7. A system for identifying information in high dimensional data streams having dynamically evolving data patterns, the system comprises: one or more processors; a memory coupled to the one or more computer processors and comprising instructions, which when performed by the one or more computer processors, cause the one or more processors to perform operations to: load at least one a-priori, pre-designed experimental design matrix and at least one modeling operation into memory; filter streaming data based on event data continuously to identify diagnostic data points by comparing the event data with the at least one a-priori, pre-designed experimental design matrix; and perform the modeling operation using the identified diagnostic data points to identify current and emerging patterns of relationships between at least one outcome variable and predictor variables, wherein the at least one a-priori, pre-designed experimental design matrix is generated based on combinations of the predictor variables, wherein the combinations are based on an outcome variable, wherein the at least one a-priori, pre-designed experimental design matrix is generated further based on at least one of: main effects of predictor variable values; limitations of the combinations of predictor variable values; constraints of the combination of predictor variable values; and interaction effects between selected predictor variables. 8. The system of claim 7 wherein the modeling operation is one of a prediction modeling operation and a clustering modeling operation. 9. The system of claim 7 wherein limitations of the predictor variable values are determined based on range values for the predictor variables, wherein the predictor variable values are continuous predictor variables. 10. The system of claim 7 wherein constraints of the combination of predictor variable values are based on a region of interest, wherein the predictor variable values are discrete predictor variable values. 11. The system of claim 7 wherein the at least one a-priori, pre-designed experimental design matrix is generated based on one of a space-filling design and an optimal experimental design. 12. The system of claim 7 wherein the instructions further cause the at least one processor to perform operations to dynamically update a visualization time window of the streaming data. 13. At least one non-transitory computer readable medium comprising instructions for identifying information in high dimensional streaming data having dynamically evolving data patterns, when executed by at least one processor, cause the at least one processor to perform operations to: load at least one a-priori, pre-designed experimental design matrix and at least one modeling operation into memory; filter, continuously and real-time, streaming data based on event data to identify diagnostic data points by comparing the event data with the at least one a-priori, pre-designed experimental design matrix; and perform, continuously and in real-time, the modeling operation using the identified diagnostic data points to identify current and emerging patterns of relationships between at least one outcome variable and predictor variables, wherein the at least one a-priori, pre-designed experimental design matrix is generated based on combinations of the predictor variables, wherein the combinations are based on an outcome variable, wherein the at least one a-priori, pre-designed experimental design matrix is generated further based on at least one of: main effects of predictor variable values; limitations of the combinations of predictor variable values; constraints of the combination of predictor variable values; and interaction effects between selected predictor variables. 14. The at least one non-transitory computer readable medium of claim 13 wherein the modeling operation is one of a prediction modeling operation and a clustering modeling operation.

Assignees

Inventors

Classifications

  • Sequence data queries, e.g. querying versioned data · CPC title

  • Visual data mining; Browsing structured data · CPC title

  • Clustering or classification · CPC title

  • Machine learning · CPC title

  • Filtering based on additional data, e.g. user or group profiles · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11443206B2 cover?
A system for identifying information in high dimensional, low latency streaming data having dynamically evolving data patterns. The system processes, continuously and in real-time, the streaming data. Processing includes filtering the data based on event data to identify diagnostic data points by comparing the event data with an experimental design matrix and performing a modeling operation usi…
Who is the assignee on this patent?
Tibco Software Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/24568. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 13 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).