Medical disease feature selection method based on improved salp swarm algorithm

US2023029947A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2023029947-A1
Application numberUS-202217860077-A
CountryUS
Kind codeA1
Filing dateJul 7, 2022
Priority dateJul 23, 2021
Publication dateFeb 2, 2023
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The disclosure is a medical disease feature selection method based on an improved salp swarm algorithm. The improved salp swarm algorithm is used to optimize a feature selection problem, the accuracy of the mentioned method is estimated by means of a transfer function and classification by a K-nearest neighbor algorithm, and the salp swarm algorithm is improved by using a self-adapted control parameter and an elite grey wolf ruling policy, so that it helps the algorithm avoid premature convergence in the optimization process and jump out of local optimum, thereby achieving a target of the algorithm with the smallest selected feature quantity and the highest classification precision. The method has the advantages of high rate of convergence, higher classification precision and better robustness.

First claim

Opening claim text (preview).

What is claimed is: 1 . A medical disease feature selection method based on an improved salp swarm algorithm, the method comprises the following steps: Step S1, acquiring a microarray gene data set of medical diseases, marking a line number of the microarray gene data set of the medical diseases as m and a column number thereof as n, wherein the microarray gene data set of the medical diseases, which is obtained, is formed by arranging m*n gene feature data according to m lines and n columns; partitioning the microarray gene data set of the medical diseases into 10 subsets randomly by using a 10-cross validation function, wherein a line number of each of the 10 subsets is greater than or equal to 1 and column numbers are n; and selecting one subset from the 10 subsets as a validation set, the rest of the subsets being training sets; Step S2, defining a female salp population Y, wherein a size of the female salp population Y being M=20, i.e., there are M individuals in the female salp population Y, and each of the M individuals in the female salp population Yis respectively represented by a data matrix formed by arranging n dimensionality values in one line and n columns; and then performing initialized assignment to each of the dimensionality values of each individual in the female salp population Y by using a random number between 0 and 1 to obtain 0th of the female salp population Y°; Step S3, setting a global optimum fitness value is best, performing initialized assignment of best to positive infinity, setting a global optimal individual to be a bestposition, and initially setting the bestposition as a data matrix [0, 0, 0, . . . , 0] with one line and n columns; Step S4, setting a maximum time of an iteration of the female salp population T=50, setting an iterative time variable t and initially setting t as 1; Step S5, performing a t th iteration on the female salp population, an iteration process specifically including: Step S5.1, converting each dimensionality value of each individual in a (t−1) th generation female salp population Y t−1 into 0 or 1 via transfer functions shown in formulae (1)-(2) to obtain a t th generation binary salp population Bt; B i , j t = { 1 , S ⁡ ( Y i , j t - 1 ) ≥ r 0 , S ⁡ ( Y i , j t - 1 ) < r ( 1 ) S ⁡ ( Y i , j t - 1 ) = 1 1 + e - ( Y i , j t - 1 / 3 ) , ( 2 ) wherein Y i,j t−1 represents a dimensionality value in the j th column of the i th individual in the (t−1) th generation female salp population, i is equal to 1, 2, 3, . . . M, j is equal to 1, 2, 3, . . . n, B i,j t represents a dimensionality value in the j th column of the i th individual in the t th generation binary salp population, r is a random number between 0 and 1 and is generated a random function before operation every time, and e is a natural constant; Step S5.2, constructing feature subsets of each individual in the (t−1) th generation female salp population, the step S5.2 specifically including: judging whether the dimensionality value of eac

Assignees

Inventors

Classifications

  • based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO] · CPC title

  • for computer-aided diagnosis, e.g. based on medical expert systems · CPC title

  • G16H50/70Primary

    for mining of medical data, e.g. analysing previous cases of other patients · CPC title

  • for data related to laboratory analysis, e.g. patient specimen analysis · CPC title

  • G06N3/126Primary

    Evolutionary algorithms, e.g. genetic algorithms or genetic programming · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023029947A1 cover?
The disclosure is a medical disease feature selection method based on an improved salp swarm algorithm. The improved salp swarm algorithm is used to optimize a feature selection problem, the accuracy of the mentioned method is estimated by means of a transfer function and classification by a K-nearest neighbor algorithm, and the salp swarm algorithm is improved by using a self-adapted control p…
Who is the assignee on this patent?
Univ Wenzhou
What technology area does this patent fall under?
Primary CPC classification G16H50/70. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Feb 02 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).