Online optimal control under constraints

US12554247B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12554247-B2
Application numberUS-202117386589-A
CountryUS
Kind codeB2
Filing dateJul 28, 2021
Priority dateJul 28, 2021
Publication dateFeb 17, 2026
Grant dateFeb 17, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present invention provide computer-implemented methods, computer program products and computer systems. Embodiments of the present invention can identify a plurality of constraints on states of data and actions of data associated with a data model. Embodiments of the present invention can then identify constraints on safety policy parameters associated with a computing device. Embodiments of the present invention can then convert the identified constraints into a uniform domain syntax that considers coupled and decoupled constraints and introduce buffer data within the converted constraints, wherein the buffer data filters outlier constraints within the plurality of constraints. Embodiments of the present invention can then dynamically generate optimal safety policies associated with the computing device based on the remaining constraints.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method comprising: identifying a plurality of constraints on states of data and actions of data associated with a data model; identifying constraints on safety policy parameters associated with a computing device; converting the identified constraints into a uniform domain syntax that considers coupled and decoupled constraints; introducing buffer data within the converted constraints, wherein the buffer data filters outlier constraints within the plurality of constraints; and dynamically generating optimal control policies and optimal safety policies associated with the computing device based on the remaining constraints, wherein the optimal control policies simultaneously minimizes the sum of adversarial varying costs and satisfy constraints despite disturbances by computing a feasibly policy class, computing a safety policy class, and utilizing an optimization tool. 2 . The computer-implemented method of claim 1 , wherein identifying the plurality of constraints on states of data and actions of data comprises: performing a query on an external data source for optimal performance for each state of data; storing at least one result of the performed query within a database on the computing device; and introducing approximate states and actions that correspond with at least one historical policy associated with the state and action of the data. 3 . The computer-implemented method of claim 1 , wherein identifying constraints on safety policy parameters comprises receiving preferences of a user associated with the safety policy parameters encompassing the computing device. 4 . The computer-implemented method of claim 1 , wherein converting the identified constraints comprises: using the identified constraints as input data; generating a safety policy configuration via a safety translation algorithm; and imposing constraints on the approximate states and actions of data that are dynamically translated as constraints on the at least one historical policy using the safety translation algorithm and a policy configuration data model. 5 . The computer-implemented method of claim 1 , wherein introducing buffer data comprises: adding buffer data to the generated safety policy configuration, wherein the buffer data filters outlier constraints within the plurality of constraints and reduce the range of the identified constraints associated with the state and action of data and the safety policy parameters of the computing device. 6 . The computer-implemented method of claim 1 , wherein dynamically generating optimal safety policies comprises: analyzing the generated safety policy configuration; generating a parallel subroutine to the generated safety policy configuration; predicting the cost output of the estimated data model; predicting the utility output of the estimated data model; and validating that the generated safety policy configuration does not meet or exceed the upper bounds of the identified constraints associated with the computing device. 7 . The computer-implemented method of claim 1 , further comprising: identifying the constraints associated with the approximate state and action data; identifying a temporal-coupled constraints associated with the policy parameters of the computing device; and determining the variations associated with the temporal-decoupled online policies. 8 . A computer program product comprising: one or more computer readable storage media and program instructions stored on the one or more computer readable storage media, the program instructions comprising: program instructions to identify a plurality of constraints on states of data and actions of data associated with a data model; program instructions to identify constraints on safety policy parameters associated with a computing device; program instructions to convert the identified constraints into a uniform domain syntax that considers coupled and decoupled constraints; program instructions to introduce buffer data within the converted constraints, wherein the buffer data filters outlier constraints within the plurality of constraints; and program instructions to dynamically generate optimal control policies and optimal safety policies associated with the computing device based on the remaining constraints, wherein the optimal control policies simultaneously minimizes the sum of adversarial varying costs and satisfy constraints despite disturbances by computing a feasibly policy class, computing a safety policy class, and utilizing an optimization tool. 9 . The computer program product of claim 8 , wherein the program instructions to identify the plurality of constraints on states of data and actions of data comprise: program instructions to perform a query on an external data source for optimal performance for each state of data; program instructions to store at least one result of the performed query within a database on the computing device; and program instructions to introduce approximate states and actions that correspond with at least one historical policy associated with the state and action of the data. 10 . The computer program product of claim 8 , wherein the program instructions to identify constraints on safety policy parameters comprise: program instructions to receive preferences of a user associated with the safety policy parameters encompassing the computing device. 11 . The computer program product of claim 8 , wherein the program instructions to convert the identified constraints comprises: program instructions to use the identified constraints as input data; program instructions to generate a safety policy configuration via a safety translation algorithm; and program instructions to impose constraints on the approximate states and actions of data that are dynamically translated as constraints on the at least one historical policy using the safety translation algorithm and a policy configuration data model. 12 . The computer program product of claim 8 , wherein the program instructions to introduce buffer data comprises: program instructions to add buffer data to the generated safety policy configuration, wherein the buffer data filters outlier constraints within the plurality of constraints and reduce the range of the identified constraints associated with the state and action of data and the safety policy parameters of the computing device. 13 . The computer program product of claim 8 , wherein the program instructions to dynamically generate optimal safety policies comprise: program instructions to analyze the generated safety policy configuration; program instructions to generate a parallel subroutine to the generated safety policy configuration; program instructions to predict the cost output of the estimated data model; program instructions to predict the utility output of the estimated data model; and program instructions to validate that the generated safety policy configuration does not meet or exceed the upper bounds of the identified constraints associated with the computing device. 14 . The computer-implemented method of claim 8 , wherein the program instructions stored on the one or more computer readable storage media further comprise: program instructions to identify the constraints associated with the approximate state and action data; program instructions to identify a temporal-coupled constraints associated with the policy parameters of the computing device; and program instructions to determine the variations associated with the temporal-decoupled online policies.

Assignees

Inventors

Classifications

  • Machine learning · CPC title

  • Computing arrangements using knowledge-based models · CPC title

  • Querying (for retrieval from the web G06F16/953) · CPC title

  • Temperature · CPC title

  • HVAC, heating, ventillation, climate control · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12554247B2 cover?
Embodiments of the present invention provide computer-implemented methods, computer program products and computer systems. Embodiments of the present invention can identify a plurality of constraints on states of data and actions of data associated with a data model. Embodiments of the present invention can then identify constraints on safety policy parameters associated with a computing device…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G05B19/4155. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 17 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).