Normalizing electronic communications using a neural-network normalizer and a neural-network flagger

US9552547B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9552547-B2
Application numberUS-201514937810-A
CountryUS
Kind codeB2
Filing dateNov 10, 2015
Priority dateMay 29, 2015
Publication dateJan 24, 2017
Grant dateJan 24, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Electronic communications can be normalized using neural networks. For example, an electronic representation of a noncanonical communication can be received. A normalized version of the noncanonical communication can be determined using a normalizer including a neural network. The neural network can receive a single vector at an input layer of the neural network and transform an output of a hidden layer of the neural network into multiple values that sum to a total value of one. Each value of the multiple values can be a number between zero and one and represent a probability of a particular character being in a particular position in the normalized version of the noncanonical communication. The neural network can determine the normalized version of the noncanonical communication based on the multiple values. Whether the normalized version should be output can be determined based on a result from a flagger including another neural network.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory computer readable medium comprising program code executable by a processor for causing the processor to: receive an electronic representation of a noncanonical communication; feed a vector that is representative of the noncanonical communication as input to a normalizer, wherein the normalizer is operable for implementing a first neural network to: receive the vector at an input layer of the first neural network, perform matrix operations on the vector using a plurality of hidden layers to generate hidden values, provide hidden values from a final hidden layer of the plurality of hidden layers to an output layer of the first neural network, perform, at the output layer, a softmax operation on the hidden values to generate a plurality of values representing probabilities of particular characters being in particular positions in a normalized version of the noncanonical communication, and determine the normalized version of the noncanonical communication based on the plurality of values, wherein the normalized version of the noncanonical communication is different from the noncanonical communication; feed the vector as input to a flagger for operating a second neural network that is trained separately from the first neural network; determine, based on a result from the flagger, that the normalized version of the noncanonical communication should be outputted; and output the normalized version of the noncanonical communication or a corrected version of the normalized version of the noncanonical communication, wherein the corrected version of the noncanonical communication is different from the noncanonical communication. 2. The non-transitory computer readable medium of claim 1 , further comprising program code executable by the processor for causing the processor to: in response to determining that the normalized version of the noncanonical communication should be outputted based on the result from the flagger: output the normalized version of the noncanonical communication in response to determining that the normalized version of the noncanonical communication is present in a database; and output the corrected version of the normalized version of the noncanonical communication in response to determining that the normalized version of the noncanonical communication is not present in the database. 3. The non-transitory computer readable medium of claim 2 , further comprising program code executable by the processor for causing the processor to: determine the corrected version using a conformer, the conformer configured to: determine a Levenshtein distance between the normalized version of the noncanonical communication and each word of a plurality of words in the database; and select as the corrected version a word from the plurality of words in the database associated with a smallest Levenshtein distance. 4. The non-transitory computer readable medium of claim 1 , further comprising program code executable by the processor for causing the processor to: preprocess the noncanonical communication prior to determining the normalized version of the noncanonical communication by: determining a plurality of vectors associated with the noncanonical communication by transforming each character in the noncanonical communication into a vector comprising a predetermined length; and concatenating the plurality of vectors together into the vector, the vector comprising another predetermined length. 5. The non-transitory computer readable medium of claim 1 , wherein each hidden layer of the plurality of hidden layers comprises a layer of units between the input layer and the output layer of the first neural network. 6. The non-transitory computer readable medium of claim 1 , wherein the first neural network is configured so that every unit of the first neural network only propagates an output value to a subsequent layer of the first neural network. 7. The non-transitory computer readable medium of claim 1 , wherein the input layer of the first neural network is a first input layer, the output layer of the first neural network is a first output layer, the softmax operation is a first softmax operation, and wherein the second neural network of the flagger is configured to: receive the vector at a second input layer of the second neural network; perform a plurality of matrix operations on the vector using at least two hidden layers of the second neural network to generate a plurality of hidden values, wherein each hidden layer of the at least two hidden layers comprises a layer of units between the second input layer and a second output layer of the second neural network; provide one or more hidden values of the plurality of hidden values to the second output layer of the second neural network; perform, at the second output layer, a second softmax operation on the one or more hidden values to generate a first value indicating a first probability that the normalized version of the noncanonical communication should be output and a second value indicating a second probability that the normalized version of the noncanonical communication should not be output; and determine that the normalized version of the noncanonical communication should be output in response to the first value being greater than the second value, and that the normalized version of the noncanonical communication should not be output in response to the second value being greater than the first value. 8. The non-transitory computer readable medium of claim 1 , further comprising program code executable by the processor for causing the processor to: determine the normalized version of the noncanonical communication using the normalizer simultaneously and in parallel to determining that the normalized version of the noncanonical communication should be outputted using the flagger. 9. The non-transitory computer readable medium of claim 1 , further comprising program code executable by the processor for causing the processor to: include the normalized version of the noncanonical communication in a data set for use in textual analysis; and perform textual analysis on the data set to determine one or more trends indicated by the data set. 10. A method comprising: receiving an electronic representation of a noncanonical communication; feeding a vector that is representative of the noncanonical communication as input to a normalizer, wherein the normalizer implements a first neural network to: receive the vector at an input layer of the first neural network, perform matrix operations on the vector using a plurality of hidden layers to generate hidden values, provide hidden values from a final hidden layer of the plurality of hidden layers to an output layer of the first neural network, perform, at the output layer, a softmax operation on the hidden values to generate a plurality of values representing probabilities of particular characters being in particular positions in a normalized version of the noncanonical communication, and determine the normalized version of the noncanonical communication based on the plurality of values, wherein the normalized version of the noncanonical communication is different from the noncanonical communication; feeding the vector as input to a flagger that operates a second neural network that is trained separately from the first neural network; determining, based on a result from the flagger, that the normalized version of the noncanonical communication should be outputted; and outputting the normalized version of the noncanonical communication or a corrected version of the normalized version of the noncanonical communication, wherein the corrected version of

Assignees

Inventors

Classifications

  • G06N3/084Primary

    Backpropagation, e.g. using gradient descent · CPC title

  • Combinations of networks · CPC title

  • Orthographic correction, e.g. spell checking or vowelisation · CPC title

  • Semantic analysis · CPC title

  • Content adaptation, e.g. replacement of unsuitable content · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9552547B2 cover?
Electronic communications can be normalized using neural networks. For example, an electronic representation of a noncanonical communication can be received. A normalized version of the noncanonical communication can be determined using a normalizer including a neural network. The neural network can receive a single vector at an input layer of the neural network and transform an output of a hid…
Who is the assignee on this patent?
Sas Inst Inc, Univ North Carolina State
What technology area does this patent fall under?
Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 24 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).