Method for constructing design concept generation network (dcgn) and method for automatically generating conceptual scheme

US2024005130A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024005130-A1
Application numberUS-202318120434-A
CountryUS
Kind codeA1
Filing dateMar 13, 2023
Priority dateJul 4, 2022
Publication dateJan 4, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for constructing a design concept generation network (DCGN) and a method for automatically generating a conceptual scheme are provided. A DCGN includes a Transformer encoder, a Transformer decoder, an importance constraint matrix generation module, an importance constraint embedding layer, a cross-attention (CA) layer, and an optimization module. A word importance constraint is ingeniously introduced based on an attention mechanism of a Transformer to record input word constraint information contained in a generated text sequence. This can effectively ensure the reliability and effectiveness of a generated conceptual scheme and is conducive to capturing potential semantic importance information and implementing semantic knowledge reasoning.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for constructing a design concept generation network (DCGN), wherein the DCGN comprises a Transformer encoder, a Transformer decoder, an importance constraint matrix generation module, an importance constraint embedding layer, a cross-attention (CA) layer, and an optimization module; and the method comprises the following steps: S1: obtaining, by the Transformer encoder, a feature of a hidden layer of the Transformer encoder based on input words in a sample; S2: obtaining, by the Transformer decoder, a feature of a hidden layer of the Transformer decoder based on a target sequence in the sample; S3: obtaining, by the importance constraint matrix generation module, an importance constraint matrix based on the input words and the target sequence in the sample; S4: mapping, by the importance constraint embedding layer, the importance constraint matrix to a distributed vector space to obtain two input word importance embedding features; S5: obtaining, by the CA layer, a generated sequence based on the feature of the hidden layer of the Transformer encoder, the feature of the hidden layer of the Transformer decoder, and the two input word importance embedding features; and S6: constructing a loss function based on the generated sequence and the target sequence, and adjusting, by the optimization module, network parameters based on the loss function; and repeating S1 to S6 until the loss function meets a specified requirement to obtain the DCGN. 2 . The method according to claim 1 , wherein in S1, the Transformer encoder obtains the feature h e of the hidden layer of the Transformer encoder by using the following formula: h e =SA( W e K x,W e V x,W e Q x )  (1), wherein x represents the input words; SA( ) represents a spatial attention; and W e K , W e V , and W e Q represent weight matrices of a self-attention layer of the Transformer encoder. 3 . The method according to claim 1 , wherein in S2, the Transformer decoder maps a target sequence y :t-1 =[y 0 ,y 1 , . . . , y t-1 ] at a moment t−1 to a distributed feature representation through a self-attention layer to obtain the feature h d t of the hidden layer of the Transformer decoder: h d t =SA( W d K y :t-1 ,W d V y :t-1 ,W d Q y :t-1 )  (2), wherein SA( ) represents a spatial attention; and W d K , W d V , and W d Q represent weight matrices of the self-attention layer of the Transformer decoder. 4 . The method according to claim 3 , wherein in S3, f(x, w , y :t ) represents an input word importance constraint vector C :t contained in the target sequence Y :t ; f(x, w ,y :t ) is calculated as follows: f ( x, w ,y :t )= w ·c t   (4), wherein ⋅· represents a dot product operation of a vector or a matrix; and w =[ w 1 , w 2 , . . . , w i , . . . , w m ]∈ m represents a relative importance vector of the input words x in the target sequence y :t and is calculated as follows: w ¯ i = [ w i - w min w max - w min × ( M - 1 ) ] , ∀ i ∈ { 1 , 2 , ⋯ , m } , ( 5 ) wherein w i represents a relative importance of an i th input word in the target sequence y :t ; w i represents an absolute importance of the i th input word in the target sequence y :t ; w min represents a minimum absolute importance of the input word in the target sequence y :t ; w max represents a maximum absolute importance of the input word in the target sequence y :t ; [ ] represents a rounding operation; and M≥m>1 and M represents a maximum number of input words contained in samples in an entire training sample set; and c t ∈ m represents an input word constraint contained in the target sequence y :t ; when the target sequence y :t contains the i th input word, an i th element in the vector c t is 1, and the vector is calculated as follows: c t = ( c t i ) i = 1 m = { c t i = 0 , if ⁢

Assignees

Inventors

Classifications

  • G06N3/0455Primary

    Auto-encoder networks; Encoder-decoder networks · CPC title

  • Learning methods · CPC title

  • G06F17/16Primary

    Matrix or vector computation {, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization (matrix transposition G06F7/78)} · CPC title

  • Generative networks · CPC title

  • Probabilistic or stochastic networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024005130A1 cover?
A method for constructing a design concept generation network (DCGN) and a method for automatically generating a conceptual scheme are provided. A DCGN includes a Transformer encoder, a Transformer decoder, an importance constraint matrix generation module, an importance constraint embedding layer, a cross-attention (CA) layer, and an optimization module. A word importance constraint is ingenio…
Who is the assignee on this patent?
Univ Sichuan
What technology area does this patent fall under?
Primary CPC classification G06N3/0455. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jan 04 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).