Decision-making under selective labels

US12586683B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12586683-B2
Application numberUS-202117381141-A
CountryUS
Kind codeB2
Filing dateJul 20, 2021
Priority dateJul 20, 2021
Publication dateMar 24, 2026
Grant dateMar 24, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method of decision-making using selective labels, includes receiving a conditional success probability value of a feature associated with an entity. A confidence value of the received success probability value is received. A parameter value that is a trade-off between a short-term learning and a long-term utility is selected. A decision is rendered to accept or reject the feature associated with the entity according to a machine learning policy.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method of automatic decision-making using selective labels, the method comprising: receiving, at a success probability model of a computing device, a feature vector associated with an entity and generating, via the success probability model analyzing the feature vector, a conditional success probability value representing a likelihood of success of a feature associated with the entity if the entity is accepted; receiving, at a confidence model of the computing device, a sample size associated with prior accepted entities having similar features and generating, via the confidence model analyzing the sample size, a confidence value associated with the conditional success probability value; determining, at a discount factor model of the computing device, a tuning parameter value that is <1 and that represents a trade-off between a short-term learning cost and a long-term utility; rendering a decision, using a policy model of the computing device, to accept the entity based on a threshold function of the conditional success probability value, the confidence value, and the tuning parameter value; and iteratively updating the success probability model and the confidence model using an observed outcome that results from accepting the entity. 2 . The computer-implemented method of claim 1 , wherein: the machine learning policy for rendering the decision comprises an optimal homogeneous policy; and the feature associated with the entity is non-distinguishable from a population of other entities. 3 . The computer-implemented method of claim 1 , wherein the machine learning policy for rendering the decision comprises a homogeneous policy for rendering the decision to accept based on the feature associated with the entity. 4 . The computer-implemented method of claim 1 , wherein the machine learning policy for rendering the decision comprises a finite-domain case policy used for rendering the decision to accept based on the feature associated with the entity. 5 . The computer-implemented method of claim 1 , wherein the machine learning policy for rendering the decision comprises an infinite-domain case policy used for rendering the decision to accept the feature associated with the entity. 6 . The computer-implemented method of claim 1 , further comprising updating the policy model based on the observed outcome that results from accepting the entity. 7 . The computer-implemented method of claim 1 , further comprising training a machine learning model to render the decision to accept the feature associated with the entity. 8 . The computer-implemented method of claim 1 , wherein the entity includes multiple features, and the computer-implemented method further comprises training a machine learning model to render the decision to accept based on two or more of the multiple features associated with the entity. 9 . The computer-implemented method of claim 1 , wherein the machine learning policy is based on the conditional success probability value and the confidence value. 10 . A computing device configured for decision-making using selective labels, the computing device comprising: a processor, a memory coupled to the processor, the memory storing instructions to cause the processor to perform acts comprising: receiving, at a success probability model of a computing device, a feature vector associated with an entity and generating, via the success probability model analyzing the feature vector, a conditional success probability value representing a likelihood of success of a feature associated with the entity if the entity is accepted; receiving, at a confidence model of the computing device, a sample size associated with prior accepted entities having similar features and generating, via the confidence model analyzing the sample size, a confidence value associated with the conditional success probability value; determining, at a discount factor model of the computing device, a tuning parameter value that is <1 and that represents a trade-off between a short-term learning cost and a long-term utility; rendering a decision, using a policy model of the computing device, to accept the entity based on a threshold function of the conditional success probability value, the confidence value, and the tuning parameter value; and iteratively updating the success probability model and the confidence model using an observed outcome that results from accepting the entity. 11 . The computing device of claim 10 , wherein the instructions cause the processor to perform an additional act of training a machine learning model to render the decision to accept the feature associated with the entity. 12 . The computing device of claim 10 , wherein the instructions cause the processor to perform an additional act of training a machine learning model to render the decision to accept based on two or more of the features associated with the entity. 13 . The computing device of claim 10 , wherein the instructions cause the processor to perform an additional act of rendering the decision to accept based on an optimal homogeneous policy. 14 . A computing device configured to perform decision-making using selective labels, the computing device comprising: one or more processors including a dialog processor configured to process extracted text from a plurality of participants of a collaborative query; a memory coupled to the one or more processors; a plurality of models configured in the one or more processors, the plurality of models comprising: a success probability model configured to provide a conditional success probability value representing an empirical success rate of a feature associated with an entity; a confidence model configured to provide a confidence value of the conditional success probability value, based at least on a sample size of entities; a discount factor model configured to determine a tuning parameter value that is <1 and that represents a trade-off between a short-term learning cost and a long-term utility; and a policy model configured to render a decision to accept the feature associated with the entity based on a threshold function of the conditional success probability value, the confidence value, and the tuning parameter value; and iteratively update the success probability model and the confidence model with a resultant outcome when the decision is to accept. 15 . The computing device according to claim 14 , wherein the success probability model and the confidence model each comprise a predictive model. 16 . The computing device according to claim 14 , wherein the success probability model is configured for automatically rendering decisions for dispensing a requested pharmaceutical or biological treatment. 17 . The computing device according to claim 14 , wherein the success probability model is configured for rendering decisions regarding suspending an operating license.

Assignees

Inventors

Classifications

  • delivered from dispensers · CPC title

  • Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence · CPC title

  • Machine learning · CPC title

  • for remote operation · CPC title

  • for mining of medical data, e.g. analysing previous cases of other patients · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12586683B2 cover?
A computer-implemented method of decision-making using selective labels, includes receiving a conditional success probability value of a feature associated with an entity. A confidence value of the received success probability value is received. A parameter value that is a trade-off between a short-term learning and a long-term utility is selected. A decision is rendered to accept or reject the…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G16H50/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 24 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).