What technology area does this patent fall under?

Primary CPC classification G16H50/30. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Dec 01 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Comparatively-refined polygenic risk score generation machine learning frameworks

US2022383982A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2022383982-A1
Application number	US-202217804416-A
Country	US
Kind code	A1
Filing date	May 27, 2022
Priority date	May 28, 2021
Publication date	Dec 1, 2022
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Various embodiments of the present invention describe techniques for generating a polygenic risk score generation machine learning framework that integrates an optimal genetic variant refinement model without requiring brute-force traversal of potential parameter spaces defined by various distinct genetic variant sets. In response, various embodiments of the present invention use holistic Bayesian sampling routines to efficiently generate Bayesian evidence numerical estimates for various genetic variant refinement models and select an optimal genetic variant refinement model accordingly. This enables enhancing the accuracy of polygenic risk score generation machine learning frameworks without resorting to computationally resource-intensive traversals of potential parameter spaces defined by various distinct genetic variant sets. In doing so, various embodiments of the present invention enhance the computational efficiency of generating a polygenic risk score generation machine learning framework that integrates an optimal genetic variant refinement model in contrast to computationally-inefficient techniques that require brute-force traversal of potential parameter spaces.

First claim

Opening claim text (preview).

1 . A computer-implemented for generating a polygenic risk score for a target phenotype using a comparatively-refined polygenic risk score generation model, the computer-implemented method comprising: identifying, using one or more processors, the comparatively-refined polygenic risk score generation machine learning framework, wherein: the comparatively-refined polygenic risk score generation machine learning framework comprises an optimal genetic variant refinement model that is selected from a plurality of defined genetic variant refinement models, each defined genetic variant refinement model: (i) is associated with: (a) a distinct per-model genetic variant set of a group of genetic variants, and (b) a per-model parameter set comprising a per-model effect weight parameter set for the distinct per-model genetic variant set that is associated with the defined genetic variant refinement model, and (ii) is configured to generate a per-model polygenic risk score based at least in part on a per-model input feature vector corresponding to the distinct per-model genetic variant set for the defined genetic variant refinement model and the per-model parameter set for the defined genetic variant refinement model, and generating the optimal genetic variant refinement model comprises: (i) for each defined genetic variant refinement model, sampling from a per-model posterior probability distribution for the defined genetic variant refinement model given target genome-wide association data for the target phenotype and by using a holistic Bayesian sampling routine that is configured to generate: (a) a per-model parameter numerical estimate set for the per-model parameter set that is associated with the defined genetic variant refinement model, and (b) a Bayesian evidence numerical estimate for the defined genetic variant refinement model, and (ii) selecting the optimal genetic variant refinement model as the defined genetic variant refinement model with an optimal Bayesian evidence numerical estimate as generated by the holistic Bayesian sampling routine, generating, using the one or more processors, the polygenic risk score based at least in part on the per-model polygenic risk score for the optimal genetic variant refinement model; and performing, using the one or more processors, one or more prediction-based actions based at least in part on the polygenic risk score. 2 . The computer-implemented method of claim 1 , wherein the holistic Bayesian sampling routine comprises a nested sampling routine. 3 . The computer-implemented method of claim 1 , wherein the holistic Bayesian sampling routine comprises a dynamic nested sampling routine. 4 . The computer-implemented method of claim 1 , wherein: the holistic Bayesian sampling routine comprises a nested sampling sub-routine and a dynamic nested sampling sub-routine, and the Bayesian evidence numerical estimate for a particular defined genetic variant refinement model is generated based at least in part on a first Bayesian evidence numerical estimate for the particular defined genetic variant refinement model as generated by the nested sampling sub-routine and a second Bayesian evidence numerical estimate for the particular defined genetic variant refinement model as generated by the dynamic nested sampling sub-routine. 5 . The computer-implemented method of claim 4 , wherein: the Bayesian evidence numerical estimate for the particular defined genetic variant refinement model is generated based at least in part on a cross-estimate weighted combination of the first Bayesian evidence numerical estimate and the second Bayesian evidence numerical estimate, and the cross-estimate weighted combination is generated based at least in part on a first historical model performance quality weight for the nested sampling routine and a second historical model performance quality weight for the dynamic nested sampling routine. 6 . The computer-implemented method of claim 1 , wherein: the comparatively-refined polygenic risk score generation machine learning framework further comprises a cross-model refinement model that is configured to generate a cross-model weighted combination of each per-model polygenic risk score for the plurality of defined genetic variant refinement models, the cross-model weighted combination is generated based at least in part on a plurality of probabilistic model quality weights for the plurality of defined genetic variant refinement models, and each probabilistic model quality weight for a respective defined genetic variant refinement model is generated based at least in part on the Bayesian evidence numerical estimate for the respective defined genetic variant refinement model as generated by the holistic Bayesian sampling routine. 7 . The computer-implemented method of claim 6 , wherein generating the polygenic risk score comprises: adopting the cross-model weighted combination as the polygenic risk score. 8 . An apparatus for generating a polygenic risk score for a target phenotype using a comparatively-refined polygenic risk score generation model, the apparatus comprising at least one processor and at least one memory including program code, the at least one memory and the program code configured to, with the processor, cause the apparatus to at least: identify the comparatively-refined polygenic risk score generation machine learning framework, wherein: the comparatively-refined polygenic risk score generation machine learning framework comprises an optimal genetic variant refinement model that is selected from a plurality of defined genetic variant refinement models, each defined genetic variant refinement model: (i) is associated with: (a) a distinct per-model genetic variant set of a group of genetic variants, and (b) a per-model parameter set comprising a per-model effect weight parameter set for the distinct per-model genetic variant set that is associated with the defined genetic variant refinement model, and (ii) is configured to generate a per-model polygenic risk score based at least in part on a per-model input feature vector corresponding to the distinct per-model genetic variant set for the defined genetic variant refinement model and the per-model parameter set for the defined genetic variant refinement model, and generating the optimal genetic variant refinement model comprises: (i) for each defined genetic variant refinement model, sampling from a per-model posterior probability distribution for the defined genetic variant refinement model given target genome-wide association data for the target phenotype and by using a holistic Bayesian sampling routine that is configured to generate: (a) a per-model parameter numerical estimate set for the per-model parameter set that is associated with the defined genetic variant refinement model, and (b) a Bayesian evidence numerical estimate for the defined genetic variant refinement model, and (ii) selecting the optimal genetic variant refinement model as the defined genetic variant refinement model with an optimal Bayesian evidence numerical estimate as generated by the holistic Bayesian sampling routine, generate the polygenic risk score based at least in part on the per-model polygenic risk score for the optimal genetic variant refinement model; and perform one or more prediction-based actions based at least in part on the polygenic risk score. 9 . The apparatus of claim 8 , wherein the holistic Bayesian sampling routine comprises a nested sampling routine. 10 . The apparatus of claim 8 , wherein the holistic Bayesian sampling routine comprises a dynamic nested sampling routine. 11 . The apparatus of claim 8 , wherein: the holistic Bayesian sampling routine comprises

Assignees

Optum Services Ireland Ltd

Inventors

Classifications

G16H50/70
for mining of medical data, e.g. analysing previous cases of other patients · CPC title
G16H50/30Primary
for calculating health indices; for individual health risk assessment · CPC title
G16B40/20
Supervised data analysis · CPC title
G16B20/20
Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection · CPC title
G16B40/00Primary
ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding · CPC title

Patent family

Related publications grouped by family.

View patent family 84194366

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2022383982A1 cover?: Various embodiments of the present invention describe techniques for generating a polygenic risk score generation machine learning framework that integrates an optimal genetic variant refinement model without requiring brute-force traversal of potential parameter spaces defined by various distinct genetic variant sets. In response, various embodiments of the present invention use holistic Bayes…
Who is the assignee on this patent?: Optum Services Ireland Ltd
What technology area does this patent fall under?: Primary CPC classification G16H50/30. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Dec 01 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Machine Learning Platform for Polygenic Models

Molecular breeding methods

Diagnostic and therapeutic methods for cancer

Bayesian Approach For Tumor Forecasting

Phenotype trait prediction with threshold polygenic risk score

Cancer polygenic risk score

Frequently asked questions