Device and method for searching neural network architecture using supernet

US2025028959A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025028959-A1
Application numberUS-202318455183-A
CountryUS
Kind codeA1
Filing dateAug 24, 2023
Priority dateJul 18, 2023
Publication dateJan 23, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for searching a neural network architecture using supernets comprises the steps of: (a) searching for subnets that can be extracted from a set search space; (b) counting the number of non-linear activation functions included in each subnet for each of the searched subnets; (c) grouping the searched subnets based on the counted number of non-linear activation functions; (d) assigning the subnet groups to multiple supernets; (e) searching for a neural network having an optimal architecture based on operation blocks of the subnet groups assigned to each of the multiple supernets.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for searching a neural network architecture, the method comprising the steps of: (a) searching for subnets that can be extracted from a set search space; (b) counting the number of non-linear activation functions included in each subnet for each of the searched subnets; (c) grouping the searched subnets based on the counted number of non-linear activation functions; (d) assigning the subnet groups to multiple supernets; (e) searching for a neural network having an optimal architecture based on operation blocks of the subnet groups assigned to each of the multiple supernets. 2 . The method for searching a neural network architecture according to claim 1 , wherein the step (c) includes grouping subnets with the same number of non-linear activation functions into the same group. 3 . The method for searching a neural network architecture according to claim 1 , wherein the step (b) includes counting the number of non-linear activation functions by counting the number of operation blocks set to use the non-linear activation function among the operation blocks included in the extracted subnets. 4 . The method for searching a neural network architecture according to claim 3 , wherein when the extracted subnet is a neural network having a parallel architecture, the number of non-linear activation functions is counted for each path of the parallel architecture, and the number of non-linear activation functions of the path having the largest number of non-linear activation functions among a plurality of paths is determined as the number of non-linear activation functions of the corresponding subnet. 5 . The method for searching a neural network architecture according to claim 1 , wherein in step (d), subnets belonging to the same group are assigned to the same supernet. 6 . The method for searching a neural network architecture according to claim 1 , wherein the number of supernets is determined for each subnet group based on the variance value of the distribution after obtaining the distribution of the number of subnets included in the group. 7 . The method for searching a neural network architecture according to claim 1 , wherein in order to determine the number of supernets, groups including subnets greater than a preset critical value are searched among the plurality of groups grouped in step (c), and the number of groups including subnets greater than the preset critical value is determined as the number of supernets. 8 . The method for searching a neural network architecture according to claim 1 , wherein in step (e), each of the multiple supernets extracts subnets corresponding to the number of non-linear activation functions associated with the group assigned to the corresponding supernet, thereby searching for a neural network having an optimal architecture. 9 . A device for searching a neural network architecture, the device including: a processor; and at least one memory connected to the processor, wherein the processor executes the steps of: (a) searching for subnets that can be extracted from a set search space; (b) counting the number of non-linear activation functions included in each subnet for each of the searched subnets; (c) grouping the searched subnets based on the counted number of non-linear activation functions; (d) assigning the subnet groups to multiple supernets; (e) searching for a neural network having an optimal architecture based on operation blocks of the subnet groups assigned to each of the multiple supernets. 10 . The device for searching a neural network architecture according to claim 9 , wherein the step (c) includes grouping subnets with the same number of non-linear activation functions into the same group. 11 . The device for searching a neural network architecture according to claim 9 , wherein the step (b) includes counting the number of non-linear activation functions by counting the number of operation blocks set to use the non-linear activation function among the operation blocks included in the extracted subnets. 12 . The device for searching a neural network architecture according to claim 11 , wherein when the extracted subnet is a neural network having a parallel architecture, the number of non-linear activation functions is counted for each path of the parallel architecture, and the number of non-linear activation functions of the path having the largest number of non-linear activation functions among a plurality of paths is determined as the number of non-linear activation functions of the corresponding subnet. 13 . The device for searching a neural network architecture according to claim 9 , wherein in step (d), subnets belonging to the same group are assigned to the same supernet. 14 . The device for searching a neural network architecture according to claim 9 , wherein the number of supernets is determined for each subnet group based on the variance value of the distribution after obtaining the distribution of the number of subnets included in the group. 15 . The device for searching a neural network architecture according to claim 9 , wherein in order to determine the number of supernets, groups including subnets greater than a preset critical value are searched among the plurality of groups grouped in step (c), and the number of groups including subnets greater than the preset critical value is determined as the number of supernets. 16 . The device for searching a neural network architecture according to claim 9 , wherein in step (e), each of the multiple supernets extracts subnets corresponding to the number of non-linear activation functions associated with the group assigned to the corresponding supernet, thereby searching for a neural network having an optimal architecture.

Assignees

Inventors

Classifications

  • G06N3/048Primary

    Activation functions · CPC title

  • G06N3/045Primary

    Combinations of networks · CPC title

  • G06N3/082Primary

    modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025028959A1 cover?
A method for searching a neural network architecture using supernets comprises the steps of: (a) searching for subnets that can be extracted from a set search space; (b) counting the number of non-linear activation functions included in each subnet for each of the searched subnets; (c) grouping the searched subnets based on the counted number of non-linear activation functions; (d) assigning th…
Who is the assignee on this patent?
Uif Univ Industry Foundation Yonsei Univ
What technology area does this patent fall under?
Primary CPC classification G06N3/048. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jan 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).