Method and apparatus for classifying class, to which sentence belongs, using deep neural network

US11568240B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11568240-B2
Application numberUS-201816613317-A
CountryUS
Kind codeB2
Filing dateMay 16, 2018
Priority dateMay 16, 2017
Publication dateJan 31, 2023
Grant dateJan 31, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided are a method and apparatus for classifying a sentence into a class by using a deep neural network. The method includes respectively training first and second sentences by using first and second neural networks, obtaining a contrastive loss based on first and second feature vectors generated as output data of the training, and information about whether classes to which the first and second sentences belong are the same, and repeating the training in such a manner that the contrastive loss has a maximum value.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method of classifying a sentence into a class by using a deep neural network, the method comprising: training a first feature vector by using a first neural network and using a first sentence comprising one or more words, and a first class to which the first sentence belongs, as input data; training a second feature vector by using a second neural network and using a second sentence and a second class to which the second sentence belongs, as input data; obtaining a contrastive loss quantifying a representational similarity between the first and second sentences by using a dot product of the first feature vector and the second feature vector, and an equation for indicating, as a number, information about whether the first class is the same as the second class; and repeating the trainings using the first and second neural networks, in such a manner that the contrastive loss has a maximum value. 2. The method of claim 1 , further comprising: receiving an utterance input from a user; recognizing the received utterance as a sentence; and extracting one or more words comprised in the recognized sentence, and transforming the one or more words into one or more word vectors, wherein the training of the first feature vector comprises: generating a sentence vector by arranging the one or more word vectors in a form of a matrix, and training the first feature vector by inputting the sentence vector to the first neural network as input data. 3. The method of claim 1 , wherein a plurality of sentences and a plurality of classes to which the plurality of sentences belong are stored in a database, and wherein the second sentence and the second class are randomly extracted from the database. 4. The method of claim 1 , wherein the equation outputs 1 when the first class is the same as the second class, and outputs 0 when the first class is not the same as the second class. 5. The method of claim 1 , wherein the training using the first neural network and the training using the second neural network are simultaneously performed. 6. An electronic device for classifying a sentence into a class by using a deep neural network, the electronic device comprising a processor configured to perform training by using a neural network, wherein the processor is further configured to: train a first feature vector by using a first neural network and using a first sentence comprising one or more words, and a first class to which the first sentence belongs, as input data, train a second feature vector by using a second neural network and using a second sentence and a second class to which the second sentence belongs, as input data, obtain a contrastive loss quantifying a representational similarity between the first and second sentences by using a dot product of the first feature vector and the second feature vector, and an equation for indicating, as a number, information about whether the first class is the same as the second class, and repeat the trainings using the first and second neural networks, in such a manner that the contrastive loss has a maximum value. 7. The electronic device of claim 6 , further comprising an utterance inputter configured to receive an utterance input from a user, wherein the processor is further configured to recognize the received utterance as a sentence, and extract one or more words comprised in the recognized sentence, and transform the one or more words into one or more word vectors. 8. The electronic device of claim 7 , wherein the processor is further configured to generate a sentence vector by arranging the one or more word vectors in a form of a matrix, and train the first feature vector by inputting the sentence vector to the first neural network as input data. 9. The electronic device of claim 6 , further comprising: a database storing a plurality of sentences and a plurality of classes to which the plurality of sentences belong, wherein the processor is further configured to randomly extract the second sentence and the second class from the database and input the second sentence and the second class to the second neural network as input data. 10. The electronic device of claim 6 , wherein the equation outputs 1 when the first class is the same as the second class, and outputs 0 when the first class is not the same as the second class. 11. The electronic device of claim 6 , wherein the processor is further configured to transform the first sentence into a matrix comprising one or more word vectors, input the transformed matrix to a convolutional neural network (CNN) as input data, generate feature maps by applying a plurality of filters, and extract the first feature vector by passing the feature maps through a max pooling layer. 12. The electronic device of claim 6 , wherein the processor is further configured to simultaneously perform the training using the first neural network and the training using the second neural network. 13. A computer program product comprising a non-transitory computer-readable recording medium, the non-transitory computer-readable recording medium comprising instructions that are executed to: train a first feature vector by using a first neural network and using a first sentence comprising one or more words, and a first class to which the first sentence belongs, as input data; train a second feature vector by using a second neural network and using a second sentence and a second class to which the second sentence belongs, as input data, obtain a contrastive loss quantifying a representational similarity between the first and second sentences by using a dot product of the first feature vector and the second feature vector, and an equation for indicating, as a number, information about whether the first class is the same as the second class, and repeat the trainings using the first and second neural networks, in such a manner that the contrastive loss has a maximum value.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11568240B2 cover?
Provided are a method and apparatus for classifying a sentence into a class by using a deep neural network. The method includes respectively training first and second sentences by using first and second neural networks, obtaining a contrastive loss based on first and second feature vectors generated as output data of the training, and information about whether classes to which the first and sec…
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 31 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).