Method and apparatus for artificial intelligence acceleration

US10467527B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10467527-B1
Application numberUS-201815885665-A
CountryUS
Kind codeB1
Filing dateJan 31, 2018
Priority dateJan 31, 2018
Publication dateNov 5, 2019
Grant dateNov 5, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for artificial intelligence acceleration is provided. The apparatus includes a storage and compute system having a distributed, redundant key value store for metadata. The storage and compute system having distributed compute resources configurable to access, through a plurality of authorities, data in the solid-state memory, run inference with a deep learning model, generate vectors for the data and store the vectors in the key value store.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus for artificial intelligence acceleration, comprising: a storage and compute system having a distributed, redundant key value store for metadata; and the storage and compute system having distributed compute resources configurable to access, through a plurality of authorities, data in the solid-state memory, run inference with a deep learning model, generate vectors for the data and store the vectors in the key value store, the distributed compute resources configurable to store metadata for one or more buckets containing the data and create bucket views containing links to objects within the data that are contained in the one or more buckets and stored in the solid-state memory. 2. The apparatus of claim 1 , further comprising: the storage and compute system having at least one network port, and configurable to receive an object or file, run the inference for the object or the file with the deep learning model, based on the vectors in the key value store, and output a response based on the inference. 3. The apparatus of claim 1 , wherein storage capacity of the solid-state memory is sufficient to hold all of the data for the inference with the deep learning model throughout the processing of an entire data pipeline. 4. The apparatus of claim 1 , wherein the plurality of authorities are configurable for distributed file deletion in accordance with the data identifiers owned by each authority. 5. The apparatus of claim 1 , wherein the links satisfy a similarity predicate, based on the vectors in the key value store. 6. The apparatus of claim 1 , further comprising: the distributed compute resources configurable to compute a vector of an object for a search, perform a hash operation on the vector, search for vectors in the key value store that are within a specified distance of the vector, based on a result of the hash operation, and retrieve one or more full vectors for objects stored in the plurality of solid-state memory storages based on a result of the search, wherein the vectors are stored in the key value store using a locality-preserving hash operation. 7. The apparatus of claim 1 , further comprising: the distributed compute resources configurable to receive tags associated with the data, and generate a list or count of data entities having tags that satisfy a predicate. 8. A method for artificial intelligence acceleration, performed by a storage and compute system, the method comprising: accessing data in the system through a plurality of authorities; performing inference with a deep learning model; generating vectors for the data based on the performing; storing the vectors in a key value store of the system; storing metadata for one or more buckets containing examples in the unstructured data; and generating bucket views containing links to the examples. 9. The method of claim 8 , further comprising: receiving an object or file; and performing the inference for the object or file with the deep learning model based on vectors in the key value store. 10. The method of claim 8 , further comprising: outputting a response based on the inference. 11. The method of claim 8 , wherein the system is configurable to write and receive unstructured data into solid state memory of the system. 12. The method of claim 8 , wherein the key value store is a distributed key value store, across storage nodes of the system. 13. The apparatus of claim 9 , wherein the links to the examples satisfy a similarity predicate, based on the search of the vectors. 14. A tangible, non-transitory, computer-readable media having instructions thereupon which, when executed by a processor, cause the processor to perform a method comprising: accessing data in the system through a plurality of authorities; performing inference with a deep learning model; generating vectors for the data based on the performing; storing the vectors in a key value store of the system; storing metadata for one or more buckets containing examples in the unstructured data; and generating bucket views containing links to the examples. 15. The computer readable media of claim 14 , wherein the method further comprises: receiving an object or file; and performing the inference for the object or file with the deep learning model based on vectors in the key value store. 16. The computer readable media of claim 14 , wherein the method further comprises: outputting a response based on the inference. 17. The method of claim 14 , wherein the system is configurable to write and receive unstructured data into solid state memory of the system. 18. The method of claim 14 , wherein the key value store is a distributed key value store, across storage nodes of the system. 19. The computer readable media of claim 14 , wherein the links to the examples satisfy a similarity predicate, based on the search of the vectors.

Assignees

Inventors

Classifications

  • Databases characterised by their database models, e.g. relational or object models · CPC title

  • using electronic means · CPC title

  • Management of blocks · CPC title

  • Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10467527B1 cover?
An apparatus for artificial intelligence acceleration is provided. The apparatus includes a storage and compute system having a distributed, redundant key value store for metadata. The storage and compute system having distributed compute resources configurable to access, through a plurality of authorities, data in the solid-state memory, run inference with a deep learning model, generate vecto…
Who is the assignee on this patent?
Pure Storage Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 05 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).