Training multiple neural networks with different accuracy

US2016379113A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016379113-A1
Application numberUS-201615260460-A
CountryUS
Kind codeA1
Filing dateSep 9, 2016
Priority dateMay 23, 2014
Publication dateDec 29, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a deep neural network. One of the methods includes generating a plurality of feature vectors that each model a different portion of an audio waveform, generating a first posterior probability vector for a first feature vector using a first neural network, determining whether one of the scores in the first posterior probability vector satisfies a first threshold value, generating a second posterior probability vector for each subsequent feature vector using a second neural network, wherein the second neural network is trained to identify the same key words and key phrases and includes more inner layer nodes than the first neural network, and determining whether one of the scores in the second posterior probability vector satisfies a second threshold value.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system comprising: one or more computers and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising: training a first neural network to identify a set of features using a first training set, the first neural network comprising a first quantity of nodes; training a second neural network to identify the set of features using a second training set, the second neural network comprising a second quantity of nodes, greater than the first quantity of nodes; and providing the first neural network, and the second neural network to a user device that uses both the first neural network and the second neural network to analyze a data set and determine whether the data set comprises a digital representation of a feature from the set of features. 2 . The system of claim 1 , wherein: the set of features comprises key words and key phrases; and the user device uses both the first neural network and the second neural network to analyze an audio waveform and determine whether a digital representation of one of the key words or the key phrases from the set of features is included in the audio waveform. 3 . The system of claim 2 , the operations comprising: providing a feature extraction module, a first posterior handling module, and a second posterior handling module to the user device with the first neural network and the second neural network, wherein the user device uses the feature extraction module, the first posterior handling module, and the second posterior handling module to perform the analysis of the data set. 4 . The system of claim 3 , wherein the first posterior handling module and the second posterior handling module comprise the same posterior handling module. 5 . The system of claim 1 , wherein the set of features comprise computer vision features, handwriting recognition features, text classification features, or authentication features. 6 . The system of claim 1 , wherein the first training set and the second training set comprise the same training set. 7 . The system of claim 6 , the operations comprising: training the first neural network for a first quantity of iterations; and training the second neural network for a second quantity of iterations, greater than the first quantity of iterations. 8 . The system of claim 1 , wherein a ratio between the first quantity of nodes and the second quantity of nodes identifies a performance cost savings of the user device when the user device analyzes a particular portion of the data set with the first neural network and not the second neural network. 9 . A method comprising: training a first neural network to identify a set of features using a first training set, the first neural network comprising a first quantity of nodes; training a second neural network to identify the set of features using a second training set, the second neural network comprising a second quantity of nodes, greater than the first quantity of nodes; and providing the first neural network, and the second neural network to a user device that uses both the first neural network and the second neural network to analyze a data set and determine whether the data set comprises a digital representation of a feature from the set of features. 10 . The method of claim 9 , wherein: the set of features comprises key words and key phrases; and the user device uses both the first neural network and the second neural network to analyze an audio waveform and determine whether a digital representation of one of the key words or the key phrases from the set of features is included in the audio waveform. 11 . The method of claim 10 , comprising: providing a feature extraction module, a first posterior handling module, and a second posterior handling module to the user device with the first neural network and the second neural network, wherein the user device uses the feature extraction module, the first posterior handling module, and the second posterior handling module to perform the analysis of the data set. 12 . The method of claim 11 , wherein the first posterior handling module and the second posterior handling module comprise the same posterior handling module. 13 . The method of claim 9 , wherein the set of features comprise computer vision features, handwriting recognition features, text classification features, or authentication features. 14 . The method of claim 9 , wherein the first training set and the second training set comprise the same training set. 15 . The method of claim 14 , comprising: training the first neural network for a first quantity of iterations; and training the second neural network for a second quantity of iterations, greater than the first quantity of iterations. 16 . The method of claim 9 , wherein a ratio between the first quantity of nodes and the second quantity of nodes identifies a performance cost savings of the user device when the user device analyzes a particular portion of the data set with the first neural network and not the second neural network. 17 . A non-transitory computer-readable medium storing software comprising instructions executable by one or more computers which, upon such execution, cause the one or more computers to perform operations comprising: training a first neural network to identify a set of features using a first training set, the first neural network comprising a first quantity of nodes; training a second neural network to identify the set of features using a second training set, the second neural network comprising a second quantity of nodes, greater than the first quantity of nodes; and providing the first neural network, and the second neural network to a user device that uses both the first neural network and the second neural network to analyze a data set and determine whether the data set comprises a digital representation of a feature from the set of features. 18 . The computer-readable medium of claim 17 , wherein: the set of features comprises key words and key phrases; and the user device uses both the first neural network and the second neural network to analyze an audio waveform and determine whether a digital representation of one of the key words or the key phrases from the set of features is included in the audio waveform. 19 . The computer-readable medium of claim 18 , the operations comprising: providing a feature extraction module, a first posterior handling module, and a second posterior handling module to the user device with the first neural network and the second neural network, wherein the user device uses the feature extraction module, the first posterior handling module, and the second posterior handling module to perform the analysis of the data set. 20 . The computer-readable medium of claim 19 , wherein the first posterior handling module and the second posterior handling module comprise the same posterior handling module.

Assignees

Inventors

Classifications

  • Probabilistic or stochastic networks · CPC title

  • Combinations of networks · CPC title

  • using artificial neural networks · CPC title

  • Ontology · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016379113A1 cover?
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a deep neural network. One of the methods includes generating a plurality of feature vectors that each model a different portion of an audio waveform, generating a first posterior probability vector for a first feature vector using a first neural network, determining whether one of the s…
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 29 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).