Electronic device and control method therefor

US2023244441A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2023244441-A1
Application numberUS-202318131164-A
CountryUS
Kind codeA1
Filing dateApr 5, 2023
Priority dateOct 5, 2020
Publication dateAug 3, 2023
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An electronic device and a control method therefor are disclosed. An electronic device of the present disclosure includes a processor, which quantizes weight data with a combination of sign data and scaling factor data to obtain quantized data, and may input the first input data into a first module to obtain second input data in which exponents of input values included in the first input data are converted to the same value; input the second input data and the sign data into a second module to determine the signs of input values and perform calculations between the input values of which signs are determined to obtain first output data; input the first output data into a third module to normalize output values included in the first output data; and perform a multiplication operation on data including the normalized output values and the scaling factor data to obtain second output data.

First claim

Opening claim text (preview).

What is claimed is: 1 . An electronic device, comprising: a memory configured to store first input data and weight data used for calculation of a neural network model; and a processor configured to quantize the weight data with a combination of sign data and scaling factor data to obtain quantized data, wherein the processor is further configured to: input the first input data to a first module to obtain second input data in which exponents of input values included in the first input data are converted into a same value; input the second input data and the sign data to a second module to determine signs of the input values included in the second input data, and perform calculations between the input values of which signs are determined to obtain first output data; input the first output data to a third module to normalize output values included in the first output data; and perform a multiplication operation on data including the normalized output values and the scaling factor data to obtain second output data. 2 . The electronic device of claim 1 , wherein the processor is further configured to: identify a minimum value among the exponents of the input values included in the first input data, and converts the exponents of the input values included in the first input data into the identified minimum value to obtain the second input data. 3 . The electronic device of claim 1 , wherein the processor is further configured to: determine a sign of an input value among the input values included in the second input data based on applying one of a −1 or a 1 included in the sign data to the input value among the input values included in the second input data. 4 . The electronic device of claim 1 , wherein the second module includes a plurality of calculation modules having a systolic array, and each of the plurality of calculation modules includes a sign determination circuit that determines a sign of the second input data using the sign data and a calculation circuit that performs a sum calculation between the input values included in the second input data of which the signs are determined. 5 . The electronic device of claim 4 , wherein the processor is further configured to: input a first input value among the second input data and a sign value corresponding to the first input value among the sign data to a first sign determination circuit included in a first calculation module among the plurality of calculation modules to determine the sign of the first input value. 6 . The electronic device of claim 5 , wherein the processor is further configured to: input the first input value of which the signs are determined and a second input value output from a second calculation module disposed above the first calculation module on the systolic array to a first calculation circuit among the first calculation module to obtain a sum of the first input value and the input values included in the second input data. 7 . The electronic device of claim 1 , wherein the processor is further configured to: change a first digit of a mantissa of the output value included in the first output data to be a one-digit natural number smaller than a base to normalize the output value included in the first output data. 8 . The electronic device of claim 1 , wherein at least one of the scaling factor data and the first input data is implemented as data in a floating-point format. 9 . The electronic device of claim 1 , wherein the processor is further configured to: quantize the weight data by summing k products of the sign data and the scaling factor data, and a size of k is determined based on an accuracy level required when performing the calculation of the neural network model. 10 . A method of controlling an electronic device including a memory that stores first input data and weight data used for calculation of a neural network model, the method comprising: quantizing the weight data with a combination of sign data and scaling factor data to obtain quantized data; inputting the first input data to a first module to obtain second input data in which exponents of input values included in the first input data are converted into a same value; inputting the second input data and the sign data to a second module to determine signs of the input values included in the second input data, and performing calculation between the input values of which signs are determined to obtain first output data; inputting the first output data to a third module to normalize output values included in the first output data; and performing a multiplication operation on data including the normalized output values and the scaling factor data to obtain second output data. 11 . The method of claim 10 , wherein the obtaining of the second input data comprises identifying a minimum value among the exponents of the input values included in the first input data, and converting the exponents of the input values included in the first input data into the identified minimum value to obtain the second input data. 12 . The method of claim 10 , wherein the obtaining of the first output data comprises applying one of a −1 or a 1 included in the sign data to a input value among the input values included in the second input data to determine a sign of the input value among the input values included in the second input data. 13 . The method of claim 10 , wherein the second module comprises a plurality of calculation modules having a systolic array, and each of the plurality of calculation modules comprises a sign determination circuit that determines a sign of the second input data using the sign data and a calculation circuit that performs a sum calculation between the input values included in the second input data of which the signs are determined. 14 . The method of claim 13 , wherein the obtaining of the first output data comprises inputting a first input value among the second input data and a sign value corresponding to the first input value among the sign data to a first sign determination circuit included in a first calculation module among the plurality of calculation modules to determine the sign of the first input value. 15 . The method of claim 14 , wherein the obtaining of the first output data comprises inputting the first input value of which the signs are determined and a second input value output from a second calculation module disposed above the first calculation module on the systolic array to a first calculation circuit among the first calculation module to obtain a sum of the first input value and the input values included in the second input data. 16 . The method of claim 10 , wherein the inputting the first output data to the third module to normalize output values included in the first output data comprises changing a first digit of a mantissa of the output value included in the first output data to be a one-digit natural number smaller than a base to normalize the output value included in the first output data. 17 . The method of claim 10 , wherein at least one of the scaling factor data and the first input data is implemented as data in a floating-point format. 18 . The method of claim 10 , wherein the quantizing the weight data comprises quantizing the weight data by summing k products of the sign data and the scaling factor data, and wherein a size of k is determined based on an accuracy level required when performing the calculation of the neural network model. 19 . A non-transitory computer readable recording medium stori

Assignees

Inventors

Classifications

  • Quantised networks; Sparse networks; Compressed networks · CPC title

  • G06F5/01Primary

    for shifting, e.g. justifying, scaling, normalising {(digital stores in which the information is moved stepwise, e.g. shift-registers G11C19/00; digital stores in which the information circulates G11C21/00)} · CPC title

  • Multiplying · CPC title

  • G06F7/5443Primary

    Sum of products (for applications thereof, see the relevant places, e.g. G06F17/10, H03H17/00) · CPC title

  • Adding; Subtracting {(G06F7/4833, G06F7/4836 take precedence)} · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023244441A1 cover?
An electronic device and a control method therefor are disclosed. An electronic device of the present disclosure includes a processor, which quantizes weight data with a combination of sign data and scaling factor data to obtain quantized data, and may input the first input data into a first module to obtain second input data in which exponents of input values included in the first input data a…
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F5/01. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Aug 03 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).