Processing for multiple input data sets

US11475306B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11475306-B2
Application numberUS-201815933201-A
CountryUS
Kind codeB2
Filing dateMar 22, 2018
Priority dateMar 22, 2018
Publication dateOct 18, 2022
Grant dateOct 18, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein are techniques for performing multi-layer neural network processing for multiple contexts. In one embodiment, a computing engine is set in a first configuration to implement a second layer of a neural network and to process first data related to a first context to generate first context second layer output. The computing engine can be switched from the first configuration to a second configuration to implement a first layer of the neural network. The computing engine can be used to process second data related to a second context to generate second context first layer output. The computing engine can be set to a third configuration to implement a third layer of the neural network to process the first context second layer output and the second context first layer output to generate a first processing result of the first context and a second processing result of the second context.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of processing data, comprising: setting a computing engine in a first configuration to implement a first layer of a neural network; processing first data related to a first context using the computing engine in the first configuration to generate a first layer output of the first context; setting the computing engine in a second configuration to implement a second layer of the neural network; processing the first layer output of the first context using the computing engine in the second configuration to generate a second layer output of the first context; switching the computing engine from the second configuration back to the first configuration; processing second data related to a second context using the computing engine in the first configuration to generate a first layer output of the second context; setting the computing engine back to the second configuration to implement the second layer of the neural network; processing the first layer output of the second context in the second configuration to generate a second layer output of the second context; setting the computing engine in a third configuration to implement a third layer of the neural network; and processing the second layer output of the first context and the second layer output of the second context using the computing engine in the third configuration to generate a third layer output of the first context and a third layer output of the second context. 2. The method of claim 1 , further comprising: processing third data related to a third context using the computing engine in the first configuration to generate a first layer output of the third context; and processing the first layer output of the third context and the first layer output of the first context using the computing engine in the second configuration to generate, respectively, a second layer output of the third context and the second layer output of the first context. 3. The method of claim 2 , further comprising: storing the first layer output of the first context and the first layer output of the third context in a memory device; configuring a first portion of the computing engine to receive the first layer output of the first context from the memory device as input; configuring a second portion of the computing engine to receive the first layer output of the third context from the memory device as input; processing the first layer output of the first context using the first portion of the computing engine; and processing the first layer output of the third context using the second portion of the computing engine. 4. The method of claim 3 , wherein the processing of the first layer output of the first context using the first portion of the computing engine and the processing of the first layer output of the third context using the second portion of the computing engine are performed in parallel. 5. The method of claim 1 , further comprising: processing fourth data related to a fourth context using the computing engine in the first configuration to generate a first layer output of the fourth context; and processing the first layer output of the second context and the first layer output of the fourth context using the computing engine in the second configuration to generate, respectively, the second layer output of the second context and a second layer output of the fourth context. 6. The method of claim 5 , further comprising: storing the first layer output of the second context and the first layer output of the fourth context in a memory device; configuring a first portion of the computing engine to receive the first layer output of the second context from the memory device as input; configuring a second portion of the computing engine to receive the first layer output of the fourth context from the memory device as input; processing the first layer output of the second context using the first portion of the computing engine; and processing the first layer output of the fourth context using the second portion of the computing engine. 7. The method of claim 6 , wherein the processing of the first layer output of the second context using the first portion of the computing engine and the processing of the first layer output of the fourth context using the second portion of the computing engine are performed substantially in parallel. 8. The method of claim 1 , further comprising: storing, at a memory device, the second layer output of the first context; storing, at the memory device in addition to the second layer output of the first context, the second data related to the second context; and storing, at the memory device in addition to the second layer output of the first context and the second data related to the second context, the first layer output of the second context. 9. The method of claim 8 , further comprising: overwriting at least a part of the second data related to the second context or the first layer output of the second context stored at the memory device with the second layer output of the second context. 10. The method of claim 9 , further comprising: configuring a third portion of the computing engine to receive the second layer output of the first context from the memory device as input; configuring a fourth portion of the computing engine to receive the second layer output of the second context from the memory device as input; processing the second layer output of the first context using the third portion of the computing engine; and processing the second layer output of the second context using the fourth portion of the computing engine. 11. The method of claim 10 , wherein the processing of the second layer output of the first context using the third portion of the computing engine and the processing of the second layer output of the second context using the fourth portion of the computing engine are performed substantially in parallel. 12. The method of claim 1 , wherein processing the first data related to the first context using the computing engine in the first configuration to generate the first layer output of the first context comprises: performing one or more convolution computations between the first data and a set of weights associated with the first layer of the neural network. 13. The method of claim 12 , further comprising: processing results of the one or more convolution computations by an activation function engine to generate the first layer output of the first context.

Assignees

Inventors

Classifications

  • Activation functions · CPC title

  • Combinations of networks · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

  • G06N3/082Primary

    modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • Single storage device · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11475306B2 cover?
Disclosed herein are techniques for performing multi-layer neural network processing for multiple contexts. In one embodiment, a computing engine is set in a first configuration to implement a second layer of a neural network and to process first data related to a first context to generate first context second layer output. The computing engine can be switched from the first configuration to a …
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 18 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).