Data processing apparatus and related products

US12346697B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12346697-B2
Application numberUS-202318531734-A
CountryUS
Kind codeB2
Filing dateDec 7, 2023
Priority dateApr 4, 2019
Publication dateJul 1, 2025
Grant dateJul 1, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure provides a data processing apparatus and related products. The products include a control module including an instruction caching unit, an instruction processing unit, and a storage queue unit. The instruction caching unit is configured to store computation instructions associated with an artificial neural network operation; the instruction processing unit is configured to parse the computation instructions to obtain a plurality of operation instructions; and the storage queue unit is configured to store an instruction queue, where the instruction queue includes a plurality of operation instructions or computation instructions to be executed in the sequence of the queue. By adopting the above-mentioned method, the present disclosure can improve the operation efficiency of related products when performing operations of a neural network model.

First claim

Opening claim text (preview).

The invention claimed is: 1. A data synchronization method performed by a first processor, comprising: determining synchronization information of tensor data according to a descriptor of the tensor data to be synchronized, wherein the descriptor indicates a shape of the tensor data to be synchronized; generating a synchronization instruction according to the synchronization information of the tensor data; and sending the synchronization instruction to a second processor, wherein the synchronization instruction is used to instruct the second processor to obtain the tensor data to be synchronized according to the synchronization instruction. 2. The data synchronization method of claim 1 , wherein the synchronization information includes a storage address of the tensor data to be synchronized, wherein the generating a synchronization instruction according to the synchronization information of the tensor data includes: when the storage address of the tensor data to be synchronized is in a shared storage space, generating the synchronization instruction according to the storage address of the tensor data to be synchronized, wherein the synchronization instruction is used to instruct the second processor to obtain the tensor data to be synchronized from the shared storage space. 3. The data synchronization method of claim 1 , wherein the synchronization information includes a storage address of the tensor data to be synchronized, wherein the generating a synchronization instruction according to the synchronization information of the tensor data includes: when the storage address of the tensor data to be synchronized is in a non-shared storage space, storing the tensor data to be synchronized in the shared storage space; and according to the storage address of the tensor data to be synchronized in the shared storage space, generating the synchronization instruction to instruct the second processor to obtain the tensor data to be synchronized from the shared storage space. 4. The data synchronization method of claim 1 , further comprising: determining the descriptor of the tensor data to be synchronized according to a synchronization request instruction from the second processor. 5. The data synchronization method of claim 4 , wherein the synchronization request instruction includes data characteristics of the tensor data to be synchronized, wherein the determining the descriptor of the tensor data to be synchronized according to the synchronization request instruction from the second processor includes: parsing the synchronization request instruction to obtain the data characteristics of the tensor data to be synchronized; and determining the descriptor of the tensor data to be synchronized according to the data characteristics of the tensor data to be synchronized. 6. A data synchronization method performed by a second processor, comprising: parsing a synchronization instruction from a first processor to obtain synchronization information of tensor data to be synchronized; determining a descriptor of the tensor data to be synchronized according to the synchronization information of the tensor data to be synchronized, wherein the descriptor indicates a shape of the tensor data to be synchronized; and obtaining the tensor data to be synchronized according to the descriptor of the tensor data to be synchronized. 7. The data synchronization method of claim 6 , wherein the synchronization information includes a storage address of the tensor data to be synchronized, wherein the determining a descriptor of the tensor data to be synchronized according to the synchronization information of the tensor data to be synchronized includes: determining an identifier of the descriptor of the tensor data to be synchronized or content of the descriptor according to the storage address of the tensor data to be synchronized; and the obtaining the tensor data to be synchronized according to the descriptor of the tensor data to be synchronized includes: according to the content of the descriptor of the tensor data to be synchronized, obtaining the tensor data to be synchronized from a shared storage space. 8. A data synchronization method performed by a second processor, comprising: when there is tensor data to be synchronized, generating a synchronization request instruction, wherein the synchronization request instruction is used to instruct a first processor to determine a descriptor of the tensor data to be synchronized, and the descriptor indicates a shape of the tensor data to be synchronized; and sending the synchronization request instruction to the first processor. 9. The data synchronization method of claim 8 , wherein the synchronization request instruction includes data characteristics of the tensor data to be synchronized. 10. The data synchronization method of claim 8 , further comprising: parsing a synchronization instruction from the first processor to obtain synchronization information of the tensor data to be synchronized; determining the descriptor of the tensor data to be synchronized according to the synchronization information of the tensor data to be synchronized; and obtaining the tensor data to be synchronized according to the descriptor of the tensor data to be synchronized. 11. The data synchronization method of claim 10 , wherein the synchronization information includes a storage address of the tensor data to be synchronized, wherein the determining the descriptor of the tensor data to be synchronized according to the synchronization information of the tensor data to be synchronized includes: determining an identifier of the descriptor of the tensor data to be synchronized or content of the descriptor according to the storage address of the tensor data to be synchronized; and the obtaining the tensor data to be synchronized according to the descriptor of the tensor data to be synchronized includes: according to the content of the descriptor of the tensor data to be synchronized, obtaining the tensor data to be synchronized from a shared storage space. 12. A data synchronization apparatus, comprising: a first processor: configured to determine synchronization information of tensor data according to a descriptor of the tensor data to be synchronized, wherein the descriptor indicates a shape of the tensor data to be synchronized; generate a synchronization instruction according to the synchronization information of the tensor data; and send the synchronization instruction to a second processor, where the synchronization instruction is used to instruct the second processor to obtain the tensor data to be synchronized according to the synchronization instruction. 13. The data synchronization apparatus of claim 12 , wherein the synchronization information includes a storage address of the tensor data to be synchronized, wherein when the storage address of the tensor data to be synchronized is in a shared storage space, the first processor is further configured to: generate the synchronization instruction according to the storage address of the tensor data to be synchronized, wherein the synchronization instruction is used to instruct the second processor to obtain the tensor data to be synchronized from the shared storage space. 14. The data synchronization apparatus of claim 12 , wherein the synchronization information includes a storage address of the tensor data to be synchronized, wherein the first processor is further configured to: when the storage address of the tensor data to be synchronized is in a non-shared storage space, store the tensor data to be synchronized in the shared storage space; and acc

Assignees

Inventors

Classifications

  • Instructions to perform operations on packed data, e.g. vector, tile or matrix operations · CPC title

  • using electronic means · CPC title

  • Dependency mechanisms, e.g. register scoreboarding · CPC title

  • Operand accessing · CPC title

  • Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12346697B2 cover?
The present disclosure provides a data processing apparatus and related products. The products include a control module including an instruction caching unit, an instruction processing unit, and a storage queue unit. The instruction caching unit is configured to store computation instructions associated with an artificial neural network operation; the instruction processing unit is configured t…
Who is the assignee on this patent?
Cambricon Tech Corp Ltd
What technology area does this patent fall under?
Primary CPC classification G06F9/30036. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 01 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).