Communication between dataflow processing units and memories

US10564929B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10564929-B2
Application numberUS-201715665631-A
CountryUS
Kind codeB2
Filing dateAug 1, 2017
Priority dateSep 1, 2016
Publication dateFeb 18, 2020
Grant dateFeb 18, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A combination of memory units and dataflow processing units is disclosed for computation. A first memory unit is interposed between a first dataflow processing unit and a second dataflow processing unit. Operations for a dataflow graph are allocated across the first dataflow processing unit and the second dataflow processing unit. The first memory unit passes data between the first dataflow processing unit and the second dataflow processing unit to execute the dataflow graph. The first memory unit is a high bandwidth, shared memory device including a hybrid memory cube. The first dataflow processing unit and second dataflow processing unit include a plurality of circular buffers containing instructions for controlling data transfer between the first dataflow processing unit and second dataflow processing unit. Additional dataflow processing units and additional memory units are included for additional functionality and efficiency.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus for computation comprising: a first dataflow processing unit; a second dataflow processing unit; a first plurality of circular buffers, wherein the first plurality of circular buffers is included in the first dataflow processing unit and contains instructions for controlling data transfer between the first dataflow processing unit and the second dataflow processing unit; and a first memory unit interposed between the first dataflow processing unit and the second dataflow processing unit wherein operations for a dataflow graph are allocated across the first dataflow processing unit and the second dataflow processing unit and wherein the first memory unit passes data between the first dataflow processing unit and the second dataflow processing unit to execute the dataflow graph. 2. The apparatus of claim 1 wherein the first memory unit comprises a shared memory device. 3. The apparatus of claim 1 wherein the first memory unit comprises a hybrid memory cube. 4. The apparatus of claim 1 wherein the second dataflow processing unit comprises a second plurality of circular buffers containing instructions for controlling data transfer between the first dataflow processing unit and the second dataflow processing unit using the first memory unit. 5. The apparatus of claim 1 wherein the first dataflow processing unit is coupled to the first memory unit via a first link. 6. The apparatus of claim 5 wherein the first link accesses a plurality of FIFOs within the first memory unit. 7. The apparatus of claim 6 wherein the plurality of FIFOs is configured using DRAM memory. 8. The apparatus of claim 7 wherein the plurality of FIFOs each have an address pointer to sequence through a FIFO from the plurality of FIFOs. 9. The apparatus of claim 6 wherein the plurality of FIFOs is statically defined. 10. The apparatus of claim 9 wherein definition for the plurality of FIFOs is accomplished at compile time based on the dataflow graph. 11. The apparatus of claim 5 wherein the first dataflow processing unit uses a write port within the first link to send data to the first memory unit. 12. The apparatus of claim 1 wherein the second dataflow processing unit is coupled to the first memory unit via a second link. 13. The apparatus of claim 12 wherein the second dataflow processing unit uses a read port within the second link to receive data from the first memory unit. 14. The apparatus of claim 12 wherein the second link accesses a plurality of FIFOs within the first memory unit. 15. The apparatus of claim 1 further comprising a third dataflow processing unit coupled to the first memory unit wherein the third dataflow processing unit accesses the first memory unit through a third link. 16. The apparatus of claim 15 further comprising a fourth dataflow processing unit coupled to the first memory unit wherein the fourth dataflow processing unit accesses the first memory unit through a fourth link. 17. The apparatus of claim 1 further comprising a second memory unit interposed between the first dataflow processing unit and the second dataflow processing unit. 18. The apparatus of claim 17 further comprising a third memory unit interposed between the first dataflow processing unit and the second dataflow processing unit. 19. The apparatus of claim 18 further comprising a fourth memory unit interposed between the first dataflow processing unit and the second dataflow processing unit. 20. The apparatus of claim 19 further comprising a third dataflow processing unit and a fourth dataflow processing unit. 21. The apparatus of claim 20 wherein each of the first memory unit, the second memory unit, the third memory unit, and the fourth memory unit access each of the first dataflow processing unit, the second dataflow processing unit, the third dataflow processing unit, and the fourth dataflow processing unit on different links. 22. The apparatus of claim 1 wherein the first dataflow processing unit, the second dataflow processing unit, and the first memory unit comprise a deep learning machine. 23. A processor-implemented method for computation comprising: obtaining data from a first dataflow processing unit, wherein the first dataflow processing unit includes a plurality of circular buffers containing instructions to control dataflow; sending the data from the first dataflow processing unit through a first memory unit interposed between the first dataflow processing unit and a second dataflow processing unit wherein operations for a dataflow graph are allocated across the first dataflow processing unit and the second dataflow processing unit and wherein the first memory unit passes data between the first dataflow processing unit and the second dataflow processing unit to execute the dataflow graph; and receiving the data into the second dataflow processing unit.

Assignees

Inventors

Classifications

  • using switching circuits, e.g. switching matrix, connection or expansion network (G06F13/4009 takes precedence) · CPC title

  • data or demand driven · CPC title

  • using buffers · CPC title

  • Access to shared memory · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10564929B2 cover?
A combination of memory units and dataflow processing units is disclosed for computation. A first memory unit is interposed between a first dataflow processing unit and a second dataflow processing unit. Operations for a dataflow graph are allocated across the first dataflow processing unit and the second dataflow processing unit. The first memory unit passes data between the first dataflow pro…
Who is the assignee on this patent?
Wave Computing Inc
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 18 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).