Two-phase distributed neural network training system

US10360500B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10360500-B2
Application numberUS-201815946331-A
CountryUS
Kind codeB2
Filing dateApr 5, 2018
Priority dateApr 20, 2017
Publication dateJul 23, 2019
Grant dateJul 23, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computing system provides distributed training of a neural network model. Explore phase options, exploit phase options, a subset of a training dataset, and a validation dataset are distributed to a plurality of computing devices. (a) Execution of the model by the computing devices is requested using the subset stored at each computing device. (b) A first result of the execution is received from a computing device. (c) Next configuration data for the neural network model is selected based on the first result and distributed to the computing device. (a) to (c) is repeated until an exploration phase is complete. (d) Execution of the neural network model is requested. (e) A second result is received. (f) Next configuration data is computed based on the second result and distributed to the computing device. (d) to (f) is repeated until an exploitation phase is complete. The next configuration data defines the model.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory computer-readable medium having stored thereon computer-readable instructions that when executed by a first computing device cause the first computing device to: distribute explore phase optimization parameter values to each computing device of a plurality of computing devices; distribute a subset of a training dataset to each computing device of the plurality of computing devices; distribute a validation dataset to each computing device of the plurality of computing devices; request initialization of a neural network model by the plurality of computing devices; (a) request execution of the initialized neural network model by the plurality of computing devices using the subset of the training dataset stored at each computing device of the plurality of computing devices; (b) receive a first result of the execution from a computing device of the plurality of computing devices, wherein the first result is an accuracy value computed using the distributed validation dataset; (c) identify a computing device of the plurality of computing devices that has an extremum value for the accuracy value in comparison to other computing devices of the plurality of computing devices; (d) request next configuration data for the neural network model from the identified computing device; (e) receive the requested next configuration data from the identified computing device; (f) distribute the received next configuration data to the plurality of computing devices except the identified computing device; repeat (a) to (f) until an exploration phase is complete based on the explore phase optimization parameter values, wherein in (a), the neural network model is reinitialized with the distributed next configuration data at each computing device of the plurality of computing devices; distribute exploit phase optimization parameter values to each computing device of the plurality of computing devices; (g) request execution of the neural network model by the plurality of computing devices using the subset of the training dataset stored at each computing device of the plurality of computing devices; (h) receive a second result of the execution from the computing device of the plurality of computing devices; (i) compute next configuration data for the neural network model based on the received second result; (j) distribute the computed next configuration data to the computing device; repeat (g) to (j) until an exploitation phase is complete based on the exploit phase optimization parameter values; and output at least a portion of the computed next configuration data to define a trained neural network model. 2. The non-transitory computer-readable medium of claim 1 , wherein each distributed subset is different. 3. The non-transitory computer-readable medium of claim 1 , wherein the validation dataset distributed to each computing device is identical. 4. The non-transitory computer-readable medium of claim 1 , wherein initialization of the neural network model at each computing device is identical. 5. The non-transitory computer-readable medium of claim 1 , wherein the first computing device is not included in the plurality of computing devices. 6. The non-transitory computer-readable medium of claim 1 , wherein the explore phase optimization parameter values include at least one value that is not equal to a corresponding value of the exploit phase optimization parameter values. 7. The non-transitory computer-readable medium of claim 1 , wherein (a) to (f) are repeated asynchronously for each computing device of the plurality of computing devices. 8. The non-transitory computer-readable medium of claim 7 , wherein (g) to (j) are repeated synchronously for the plurality of computing devices. 9. The non-transitory computer-readable medium of claim 1 , wherein (a) to (f) are repeated synchronously for the plurality of computing devices. 10. The non-transitory computer-readable medium of claim 9 , wherein (g) to (j) are repeated asynchronously for each computing device of the plurality of computing devices. 11. The non-transitory computer-readable medium of claim 1 , wherein the received second result is a gradient vector computed using the distributed validation dataset. 12. The non-transitory computer-readable medium of claim 11 , wherein computing the next configuration data for the neural network model based on the received second result comprises: solving an optimization problem defined by the exploit phase optimization parameter values using the gradient vector from each computing device of the plurality of computing devices; performing a collective reduce operation to aggregate the gradient vector from each computing device of the plurality of computing devices; and computing new neural network weight values from the aggregated gradient vector, wherein the next configuration data includes the new neural network weight values. 13. The non-transitory computer-readable medium of claim 1 , wherein distributing the selected next configuration data in (f) comprises sending, by the first computing device, the configuration data for the neural network model computed by the identified computing device to remaining devices of the plurality of computing devices. 14. The non-transitory computer-readable medium of claim 1 , wherein distributing the selected next configuration data in (f) comprises requesting that the identified computing device send configuration data for the neural network model computed by the identified computing device to remaining devices of the plurality of computing devices. 15. The non-transitory computer-readable medium of claim 1 , wherein identifying the computing device in (c) comprises: determining if the accuracy value is better than a current accuracy value; and when the accuracy value is better than a current accuracy value, the next configuration data for the neural network model is requested from the computing device and stored as the next configuration data and the current accuracy value is set equal to the accuracy value. 16. The non-transitory computer-readable medium of claim 15 , wherein distributing the selected next configuration data in (f) comprises sending the stored next configuration data to the computing device when the accuracy value is not better than the current accuracy value. 17. The non-transitory computer-readable medium of claim 15 , wherein the accuracy value is better than the current accuracy value when the accuracy value is greater than the current accuracy value. 18. The non-transitory computer-readable medium of claim 15 , wherein after the exploration phase is complete and before (g), the computer-readable instructions further cause the first computing device to send the stored next configuration data to each computing device of the plurality of computing devices to reinitialize the neural network model at each computing device of the plurality of computing devices. 19. The non-transitory computer-readable medium of claim 1 , wherein execution of the initialized neural network model by the plurality of computing devices in (a) comprises second computer-readable instructions that when executed by each computing device of the plurality of computing devices cause each computing device of the plurality of computing devices to: (aa) select a set of observation vectors from the subset of the training dataset stored at each computing device; (ab) execute the neural network model using the distributed explore phase optimization parameter values with the selecte

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Multiprogramming arrangements · CPC title

  • G06N3/084Primary

    Backpropagation, e.g. using gradient descent · CPC title

  • Architecture, e.g. interconnection topology · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10360500B2 cover?
A computing system provides distributed training of a neural network model. Explore phase options, exploit phase options, a subset of a training dataset, and a validation dataset are distributed to a plurality of computing devices. (a) Execution of the model by the computing devices is requested using the subset stored at each computing device. (b) A first result of the execution is received fr…
Who is the assignee on this patent?
Sas Inst Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 23 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).