Methods and apparatus of motion vector rounding, clipping and storage for inter prediction
US-2024333960-A1 · Oct 3, 2024 · US
US2023244441A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2023244441-A1 |
| Application number | US-202318131164-A |
| Country | US |
| Kind code | A1 |
| Filing date | Apr 5, 2023 |
| Priority date | Oct 5, 2020 |
| Publication date | Aug 3, 2023 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An electronic device and a control method therefor are disclosed. An electronic device of the present disclosure includes a processor, which quantizes weight data with a combination of sign data and scaling factor data to obtain quantized data, and may input the first input data into a first module to obtain second input data in which exponents of input values included in the first input data are converted to the same value; input the second input data and the sign data into a second module to determine the signs of input values and perform calculations between the input values of which signs are determined to obtain first output data; input the first output data into a third module to normalize output values included in the first output data; and perform a multiplication operation on data including the normalized output values and the scaling factor data to obtain second output data.
Opening claim text (preview).
What is claimed is: 1 . An electronic device, comprising: a memory configured to store first input data and weight data used for calculation of a neural network model; and a processor configured to quantize the weight data with a combination of sign data and scaling factor data to obtain quantized data, wherein the processor is further configured to: input the first input data to a first module to obtain second input data in which exponents of input values included in the first input data are converted into a same value; input the second input data and the sign data to a second module to determine signs of the input values included in the second input data, and perform calculations between the input values of which signs are determined to obtain first output data; input the first output data to a third module to normalize output values included in the first output data; and perform a multiplication operation on data including the normalized output values and the scaling factor data to obtain second output data. 2 . The electronic device of claim 1 , wherein the processor is further configured to: identify a minimum value among the exponents of the input values included in the first input data, and converts the exponents of the input values included in the first input data into the identified minimum value to obtain the second input data. 3 . The electronic device of claim 1 , wherein the processor is further configured to: determine a sign of an input value among the input values included in the second input data based on applying one of a −1 or a 1 included in the sign data to the input value among the input values included in the second input data. 4 . The electronic device of claim 1 , wherein the second module includes a plurality of calculation modules having a systolic array, and each of the plurality of calculation modules includes a sign determination circuit that determines a sign of the second input data using the sign data and a calculation circuit that performs a sum calculation between the input values included in the second input data of which the signs are determined. 5 . The electronic device of claim 4 , wherein the processor is further configured to: input a first input value among the second input data and a sign value corresponding to the first input value among the sign data to a first sign determination circuit included in a first calculation module among the plurality of calculation modules to determine the sign of the first input value. 6 . The electronic device of claim 5 , wherein the processor is further configured to: input the first input value of which the signs are determined and a second input value output from a second calculation module disposed above the first calculation module on the systolic array to a first calculation circuit among the first calculation module to obtain a sum of the first input value and the input values included in the second input data. 7 . The electronic device of claim 1 , wherein the processor is further configured to: change a first digit of a mantissa of the output value included in the first output data to be a one-digit natural number smaller than a base to normalize the output value included in the first output data. 8 . The electronic device of claim 1 , wherein at least one of the scaling factor data and the first input data is implemented as data in a floating-point format. 9 . The electronic device of claim 1 , wherein the processor is further configured to: quantize the weight data by summing k products of the sign data and the scaling factor data, and a size of k is determined based on an accuracy level required when performing the calculation of the neural network model. 10 . A method of controlling an electronic device including a memory that stores first input data and weight data used for calculation of a neural network model, the method comprising: quantizing the weight data with a combination of sign data and scaling factor data to obtain quantized data; inputting the first input data to a first module to obtain second input data in which exponents of input values included in the first input data are converted into a same value; inputting the second input data and the sign data to a second module to determine signs of the input values included in the second input data, and performing calculation between the input values of which signs are determined to obtain first output data; inputting the first output data to a third module to normalize output values included in the first output data; and performing a multiplication operation on data including the normalized output values and the scaling factor data to obtain second output data. 11 . The method of claim 10 , wherein the obtaining of the second input data comprises identifying a minimum value among the exponents of the input values included in the first input data, and converting the exponents of the input values included in the first input data into the identified minimum value to obtain the second input data. 12 . The method of claim 10 , wherein the obtaining of the first output data comprises applying one of a −1 or a 1 included in the sign data to a input value among the input values included in the second input data to determine a sign of the input value among the input values included in the second input data. 13 . The method of claim 10 , wherein the second module comprises a plurality of calculation modules having a systolic array, and each of the plurality of calculation modules comprises a sign determination circuit that determines a sign of the second input data using the sign data and a calculation circuit that performs a sum calculation between the input values included in the second input data of which the signs are determined. 14 . The method of claim 13 , wherein the obtaining of the first output data comprises inputting a first input value among the second input data and a sign value corresponding to the first input value among the sign data to a first sign determination circuit included in a first calculation module among the plurality of calculation modules to determine the sign of the first input value. 15 . The method of claim 14 , wherein the obtaining of the first output data comprises inputting the first input value of which the signs are determined and a second input value output from a second calculation module disposed above the first calculation module on the systolic array to a first calculation circuit among the first calculation module to obtain a sum of the first input value and the input values included in the second input data. 16 . The method of claim 10 , wherein the inputting the first output data to the third module to normalize output values included in the first output data comprises changing a first digit of a mantissa of the output value included in the first output data to be a one-digit natural number smaller than a base to normalize the output value included in the first output data. 17 . The method of claim 10 , wherein at least one of the scaling factor data and the first input data is implemented as data in a floating-point format. 18 . The method of claim 10 , wherein the quantizing the weight data comprises quantizing the weight data by summing k products of the sign data and the scaling factor data, and wherein a size of k is determined based on an accuracy level required when performing the calculation of the neural network model. 19 . A non-transitory computer readable recording medium stori
Quantised networks; Sparse networks; Compressed networks · CPC title
for shifting, e.g. justifying, scaling, normalising {(digital stores in which the information is moved stepwise, e.g. shift-registers G11C19/00; digital stores in which the information circulates G11C21/00)} · CPC title
Multiplying · CPC title
Sum of products (for applications thereof, see the relevant places, e.g. G06F17/10, H03H17/00) · CPC title
Adding; Subtracting {(G06F7/4833, G06F7/4836 take precedence)} · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.