Probability density ratio estimation

US11093584B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11093584-B2
Application numberUS-201715797051-A
CountryUS
Kind codeB2
Filing dateOct 30, 2017
Priority dateMar 7, 2017
Publication dateAug 17, 2021
Grant dateAug 17, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Probability density ratios may be estimated by a method including obtaining a first sample set including a plurality of first samples and a second sample set including a plurality of second samples, wherein each of the first samples and the second samples is represented as a vector including a plurality of parameters, constructing at least one decision tree estimating a ratio of probability density p(x)/q(x) based on the first sample set and the second sample set, wherein p(x) is a probability density of the first samples corresponding to an input vector x and q(x) is a probability density of the second samples corresponding to the input vector x.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: obtaining a first sample set including a plurality of first samples and a second sample set including a plurality of second samples, wherein each of the first samples and the second samples is represented as a vector including a plurality of parameters, constructing at least one decision tree for estimating a ratio of probability density p(x)/q(x) based on the first sample set and the second sample set, wherein p(x) is a probability density of the first samples corresponding to an input vector x and q(x) is a probability density of the second samples corresponding to the input vector x such that computational resources needed for estimating the ratio of the probability density is reduced without impairing accuracy of estimation, the constructing at least one decision tree including using a single branching condition existing at the particular leaf node to generate a plurality of three or more new nodes therefrom and determining a parameter among the plurality of parameters and a threshold for a target node of the at least one decision tree that minimize an output of an objective function corresponding to a determined branching condition, the branching condition being determined using a greedy search. 2. The method of claim 1 , wherein the constructing at least one decision tree includes constructing a plurality of decision trees, each decision tree outputting a corresponding ratio of probability density p(x)/q(x). 3. The method of claim 2 , further comprising: sampling one or more first samples from the plurality of first samples to generate a first subset of the first sample set, and sampling one or more second samples from the plurality of second samples to generate a second subset of the second sample set, wherein at least one of the plurality of decision trees outputs a ratio of probability density p(x)/q(x) based on the first subset and the second subset. 4. The method of claim 3 , wherein the sampling the one or more first samples and the sampling the one or more second samples is performed based on a bootstrap algorithm. 5. The method of claim 1 , wherein the decision tree outputs a ratio of a rate of the first samples allocated to a leaf node of the decision tree and a rate of the second samples allocated to the leaf node of the decision tree. 6. The method of claim 1 , wherein the constructing at least one decision tree includes: determining a branching condition at a target node of the at least one decision tree, and branching the target node according to the branching condition to generate a plurality of new nodes by dividing the first samples of the target node according to the branching condition, and dividing the second samples of the target node according to the branching condition. 7. The method of claim 6 , wherein the objective function approximates a squared error of an estimation of the ratio of probability density. 8. The method of claim 7 , wherein the output of the objective function is positively correlated with the output of the probability density q(x). 9. The method of claim 8 , wherein the objective function is based on: rates of the first samples at nodes branched from the target node according to the branching condition to the first samples of the decision tree and, rates of the second samples at nodes branched from the target node according to the branching condition to the second samples of the decision tree. 10. The method of claim 9 , wherein the objective function is at least partially represented by: 1 M l ⁢ ∑ m = 1 M l ⁢ ⁢ r ^ ⁡ ( x q ( m ) ) 2 - 2 ⁢ 1 N l ⁢ ∑ n = 1 N l ⁢ ⁢ r ^ ⁡ ( x p ( n ) ) where N 1 is a number of the first samples at the target node of the decision tree, M 1 is a number of the second samples at the target node of the decision tree, x p (n) represents a vector corresponding to (n)-th sample in the N 1 first samples, x q (m) represents a vector corresponding to (m)-th sample in the M 1 second samples, and r{circumflex over ( )}(x) represents a density ratio function inputting the vector x and outputting the ratio of probability density at the target node corresponding to the vector x based on: the rates of the first samples at the nodes branched from the target nodes according to the branching condition to the first samples of the decision tree and, the rates of the second samples at the nodes branched from the target node according to the branching condition to the second samples of the decision tree. 11. The method of claim 6 , wherein the determining a branching condition includes determining a parameter and a threshold for the target node randomly. 12. The method of claim 2 , further comprising: obtaining a target vector x T , inputting the target vector x T into the plurality of decision trees, obtaining a plurality of outputs from the plurality of decision trees, and estimating a ratio of probability density p(x T )/q(x T ) based on the plurality of outputs from the plurality of decision trees. 13. The method of claim 12 , wherein the estimating the ratio of probability density p(x T )/q(x T ) includes calculating an average of the plurality of outputs from the plurality of decision trees. 14. A computer-implemented meth

Assignees

Inventors

Classifications

  • Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound · CPC title

  • Ensemble learning · CPC title

  • G06F17/18Primary

    for evaluating statistical data {, e.g. average values, frequency distributions, probability functions, regression analysis (forecasting specially adapted for a specific administrative, business or logistic context G06Q10/04)} · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11093584B2 cover?
Probability density ratios may be estimated by a method including obtaining a first sample set including a plurality of first samples and a second sample set including a plurality of second samples, wherein each of the first samples and the second samples is represented as a vector including a plurality of parameters, constructing at least one decision tree estimating a ratio of probability den…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F17/18. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 17 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).