Neural network accelerator in DIMM form factor

US12443837B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12443837-B2
Application numberUS-202016994990-A
CountryUS
Kind codeB2
Filing dateAug 17, 2020
Priority dateAug 17, 2020
Publication dateOct 14, 2025
Grant dateOct 14, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The technology relates to a neural network dual in-line memory module (NN-DIMM), a microelectronic system comprising a CPU and a plurality of the NN-DIMMs, and a method of transferring information between the CPU and the plurality of the NN-DIMMS. The NN-DIMM may include a module card having a plurality of parallel edge contacts adjacent to an edge of a slot connector thereof and configured to have the same command and signal interface as a standard dual in-line memory module (DIMM). The NN-DIMM may also include a deep neural network (DNN) accelerator affixed to the module card, and a bridge configured to transfer information between the DNN accelerator and the plurality of parallel edge contacts via a DIMM external interface.

First claim

Opening claim text (preview).

The invention claimed is: 1. A neural network dual in-line memory module (NN-DIMM), comprising: a module card having a plurality of parallel edge contacts adjacent an edge of a slot connector thereof and configured to have the same command and signal interface as a standard dual in-line memory module (DIMM); a deep neural network (DNN) accelerator affixed to the module card; and a bridge configured to transfer information between the DNN accelerator and the plurality of parallel edge contacts via a DIMM external interface, and wherein the bridge includes control registers and on-chip memory are accessible as a standard DRAM to a host CPU, wherein the bridge is configured to map the control registers and the on-chip memory to one row of a plurality of banks of the NN-DIMM, and wherein access of the control registers or the on-chip memory is limited to the one row of the plurality of banks. 2. The NN-DIMM of claim 1 , comprising a DNN accelerator chip affixed to the module card, the DNN accelerator chip comprising the DNN accelerator, the bridge, and the DIMM external interface. 3. The NN-DIMM of claim 2 , wherein the DNN accelerator chip is an off-the-shelf DNN accelerator chip. 4. The NN-DIMM of claim 1 , further comprising one or more memory chips affixed to the module card and connected with the bridge via an interface. 5. The NN-DIMM of claim 4 , further comprising a memory controller connected with the one or more memory chips and the bridge. 6. The NN-DIMM of claim 1 , wherein the bridge includes a controller connected with the DNN accelerator, the control registers, the on-chip memory, and the DIMM external interface. 7. The NN-DIMM of claim 1 , further comprising a power connector affixed to the module card and configured to connect to a power cable from a motherboard. 8. The NN-DIMM of claim 1 , wherein the plurality of parallel edge contacts includes 288 parallel edge contacts comprising a standard DDR4 contact configuration. 9. A microelectronic system comprising a CPU affixed to a motherboard, the CPU having a plurality of memory channels each including a respective neural network dual in-line memory module (NN-DIMM) electronically connected with the CPU, each NN-DIMM comprising: a module card having a plurality of parallel edge contacts adjacent an edge of a slot connector thereof and configured to have the same command and signal interface as a standard dual in-line memory module (DIMM); a deep neural network (DNN) accelerator affixed to the module card; and a bridge configured to transfer information between the DNN accelerator and the plurality of parallel edge contacts via a DIMM external interface, and wherein the bridge includes control registers and on-chip memory are configured to be accessed as a standard DRAM to a host CPU, wherein the bridge is configured to map the control registers and the on-chip memory to one row of a plurality of banks of the NN-DIMM, and wherein the control registers or the on-chip memory access is limited to the one row in the plurality of banks. 10. The microelectronic system of claim 9 , wherein the slot connector of each NN-DIMM is mounted inside a corresponding socket connector affixed to the motherboard. 11. The microelectronic system of claim 9 , wherein each of the memory channels further includes a standard DIMM. 12. The microelectronic system of claim 9 , wherein the plurality of memory channels includes 16 DDR4 memory channels. 13. The microelectronic system of claim 9 , comprising a DNN accelerator chip affixed to the module card, the DNN accelerator chip comprising the DNN accelerator, the bridge, and the DIMM external interface. 14. The microelectronic system of claim 9 , wherein the DNN accelerator chip is an off-the-shelf DNN accelerator chip. 15. A method of transferring information between a host system CPU and a plurality of neural network dual in-line memory modules (NN-DIMMs), the method comprising: electrically connecting each of the NN-DIMMs to the host system CPU, the CPU having a plurality of memory channels each including a respective one of the NN-DIMMs, each of the NN-DIMMs comprising a deep neural network (DNN) accelerator and a bridge configured to transfer the information between the DNN accelerator and the host system CPU via a DIMM external interface, wherein the bridge of each of the NN-DIMMs includes control registers and on-chip memory that are seen as a standard DRAM to the host system CPU and wherein the bridge is configured to map the control registers and the on-chip memory to one row of a plurality of memory banks of the NN-DIMM; accessing, by the control registers or the on-chip memory, only the one row in the plurality of banks; and transferring the information between the host system CPU and the plurality of NN-DIMMs. 16. The method of claim 15 , further comprising the host system controlling the NN-DIMMs through memory read and write operations. 17. The method of claim 15 , wherein each of the NN-DIMMs comprises a module card having a plurality of parallel edge contacts adjacent an edge of a slot connector thereof and having the same command and signal interface as a standard dual in-line memory module (DIMM), the slot connector of each NN-DIMM is mounted inside a corresponding socket connector affixed to a motherboard, and the host system CPU is affixed to the motherboard. 18. The method of claim 15 , wherein the plurality of memory channels includes 16 DDR4 memory channels. 19. The NN-DIMM of claim 1 , wherein the control registers or the on-chip memory accesses the predetermined row in the plurality of banks using only column read or write commands. 20. The NN-DIMM of claim 1 , wherein the plurality of banks comprise memory banks formed from the control registers or on-chip memory and that are accessed by the host CPU using column read or write commands.

Assignees

Inventors

Classifications

  • Learning methods · CPC title

  • using bus bridges (G06F13/4022 takes precedence) · CPC title

  • using a secondary processor, e.g. coprocessor (peripheral processor G06F13/12) · CPC title

  • Register arrangements · CPC title

  • System on board, i.e. computer system on one or more PCB, e.g. motherboards, daughterboards or blades · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12443837B2 cover?
The technology relates to a neural network dual in-line memory module (NN-DIMM), a microelectronic system comprising a CPU and a plurality of the NN-DIMMs, and a method of transferring information between the CPU and the plurality of the NN-DIMMS. The NN-DIMM may include a module card having a plurality of parallel edge contacts adjacent to an edge of a slot connector thereof and configured to …
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G06N3/065. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 14 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).