Feature selection and hyperparameter optimization using LDS

US12547930B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12547930-B2
Application numberUS-202117408650-A
CountryUS
Kind codeB2
Filing dateAug 23, 2021
Priority dateAug 23, 2021
Publication dateFeb 10, 2026
Grant dateFeb 10, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method for simultaneous feature selection and hyperparameter optimization of non-linear models of machine learning is provided including setting a first solution having first hyperparameters and a first set of features of a plurality of features of a training data set, initializing a weight table providing a score for each feature of the first set of features, and initializing a discrepancy. The method further includes performing a limited discrepancy search (LDS), according to an order based on the weight table, to obtain a second solution having second hyperparameters and a second set of features by swapping the first set of features and switching the first hyperparameters from the first solution with the discrepancy, while updating the weight table during LDS, comparing the second solution with the first solution, and obtaining a new solution with improved features and hyperparameters, as an optimized solution.

First claim

Opening claim text (preview).

The invention claimed is: 1 . A computer-implemented method for simultaneous feature selection and hyperparameter optimization of non-linear models of machine learning, the method comprising: setting a first solution having first hyperparameters and a first set of features selected from a plurality of features of a training data set; initializing a weight table providing a score for each feature of the first set of features; initializing a discrepancy; evaluating and screening chemical structures according to the first hyperparameters and the first set of features to identify the chemical structure that satisfies a set of target properties; performing a limited discrepancy search (LDS), according to an order based on the weight table, to obtain a second solution having second hyperparameters and a second set of features from the plurality of features by swapping the first set of features and switching the first hyperparameters from the first solution with the discrepancy, while updating the weight table during LDS; comparing the second solution with the first solution; obtaining a new solution with improved features and improved hyperparameters, as an optimized solution; and selecting an optimized chemical structure with predicted properties that satisfy the target properties. 2 . The computer-implemented method of claim 1 , wherein, if the second solution is better than the first solution, updating the first solution with the second solution and recurring the LDS up to given maximum discrepancy. 3 . The computer-implemented method of claim 1 , wherein, if the second solution is not better than the first solution, incrementing the discrepancy and recurring the LDS up to the given maximum discrepancy. 4 . The computer-implemented method of claim 1 , wherein the discrepancy is a maximum number of modified features and hyperparameters. 5 . The computer-implemented method of claim 1 , wherein the updating of the weight table during LDS is enabled by a weighted sum of a number of visits to each feature of the plurality of features and a number of successes for improving an objective value. 6 . The computer-implemented method of claim 5 , wherein the weighted sum is given by: T ( x )= w 1 ·v ( x )+ w 2 ·u ( x ), where x is a variable, w 1 and w 2 are constants, and v(x) and u(x) are the number of visits of successful improvements. 7 . The computer-implemented method of claim 1 , wherein the features of the plurality of features are ordered in an ascending order. 8 . A computer program product for simultaneous feature selection and hyperparameter optimization of non-linear models of machine learning, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to: setting a first solution having first hyperparameters and a first set of features selected from a plurality of features of a training data set, to identify a chemical structure satisfying target properties; initialize a weight table providing a score for each feature of the first set of features; initialize a discrepancy; evaluate and screen chemical structures according to the first hyperparameters and the first set of features to identify the chemical structure that satisfies a set of target properties; perform a limited discrepancy search (LDS), according to an order based on the weight table, to obtain a second solution having second hyperparameters and a second set of features from the plurality of features by swapping the first set of features and switching the first hyperparameters from the first solution with the discrepancy, while updating the weight table during LDS; compare the second solution with the first solution; obtaining a new solution with improved features and improved hyperparameters, as an optimized solution; and selecting an optimized chemical structure with predicted properties that satisfy the target properties. 9 . The computer program product of claim 8 , wherein, if the second solution is better than the first solution, updating the first solution with the second solution and recurring the LDS up to given maximum discrepancy. 10 . The computer program product of claim 8 , wherein, if the second solution is not better than the first solution, incrementing the discrepancy and recurring the LDS up to the given maximum discrepancy. 11 . The computer program product of claim 8 , wherein the discrepancy is a maximum number of modified features and hyperparameters. 12 . The computer program product of claim 8 , wherein the updating of the weight table during LDS is enabled by a weighted sum of a number of visits to each feature of the plurality of features and a number of successes for improving an objective value. 13 . The computer program product of claim 12 , wherein the weighted sum is given by: T ( x )= w 1 ·( x )+ w 2 ·u ( x ), where x is a variable, w 1 and w 2 are constants, and v(x) and u(x) are the number of visits of successful improvements. 14 . The computer program product of claim 8 , wherein the features of the plurality of features are ordered in an ascending order. 15 . A system for simultaneous feature selection and hyperparameter optimization of non-linear models of machine learning, the system comprising: a memory; and one or more processors in communication with the memory configured to: set a first solution having first hyperparameters and a first set of features selected from a plurality of features of a training data set, to identify a chemical structure satisfying target properties; initialize a weight table providing a score for each feature of the first set of features; initialize a discrepancy; evaluate and screen chemical structures according to the first hyperparameters and the first set of features to identify the chemical structure that satisfies a set of target properties: perform a limited discrepancy search (LDS), according to an order based on the weight table, to obtain a second solution having second hyperparameters and a second set of features from the plurality of features by swapping the first set of features and switching the first hyperparameters from the first solution with the discrepancy, while updating the weight table during LDS; compare the second solution with the first solution; obtaining a new solution with improved features and improved hyperparameters, as an optimized solution; and select an optimized chemical structure with predicted properties that satisfy the target properties. 16 . The system of claim 15 , wherein, if the second solution is better than the first solution, updating the first solution with the second solution and recurring the LDS up to given maximum discrepancy. 17 . The system of claim 15 , wherein, if the second solution is not better than the first solution, incrementing the discrepancy and recurring the LDS up to the given maximum discrepancy. 18 . The system of claim 15 , wherein the discrepancy is a maximum number of modified features and hyperparameters. 19 . The system of claim 15 , wherein the updating of the weight table during LDS is enabled by a weighted sum of a number of visits to each feature of the plurality of features and a number of successes for improving an objective value. 20 . The system of claim 19 , wherein the weighted sum is given by: T ( x )= w 1 ·v ( x )+ w 2 ·u ( x ), where x is a variable, w

Assignees

Inventors

Classifications

  • using chaos models or non-linear system models · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • G06N3/0985Primary

    Hyperparameter optimisation; Meta-learning; Learning-to-learn · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12547930B2 cover?
A computer-implemented method for simultaneous feature selection and hyperparameter optimization of non-linear models of machine learning is provided including setting a first solution having first hyperparameters and a first set of features of a plurality of features of a training data set, initializing a weight table providing a score for each feature of the first set of features, and initial…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).