Cognitive function estimation device, cognitive function estimation method, and storage medium
US-2024138750-A1 · May 2, 2024 · US
US2025246185A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2025246185-A1 |
| Application number | US-202519184526-A |
| Country | US |
| Kind code | A1 |
| Filing date | Apr 21, 2025 |
| Priority date | Nov 6, 2023 |
| Publication date | Jul 31, 2025 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method and apparatus for classifying generated speech are disclosed. The method for classifying generated speech includes: applying a one-dimensional convolution operation to raw speech data to embed the raw speech data into a feature space and extract a feature vector; quantizing the feature vector by applying it to a residual vector quantizer; and applying the quantized result to a classifier model including a natural language processing model to output a classification label.
Opening claim text (preview).
What is claimed is: 1 . A method for classifying generated speech, comprising: applying a one-dimensional convolution operation to raw speech data to embed the raw speech data into a feature space and extract a feature vector; quantizing the feature vector by applying the feature vector to a residual vector quantizer; and applying the quantized result to a classifier model comprising a natural language processing model to output a classification label. 2 . The method for classifying generated speech according to claim 1 , wherein the natural language processing model comprises a BERT (Bidirectional Encoder Representations from Transformers) language model, and wherein the classifier model is configured to output the classification label indicating one of generated speech and real speech based on an output of the BERT language model. 3 . The method for classifying generated speech according to claim 2 , wherein the quantized result is represented as a vector reflecting overall contextual structure through the BERT language model, and the vector is passed through a fully connection layer and a softmax activation function of the classifier model to output the classification label indicating one of generated speech and real speech. 4 . The method for classifying generated speech according to claim 1 , wherein the residual vector quantizer is configured to quantize the feature vector, which is a one-dimension array of real values, into positive integer values. 5 . The method for classifying generated speech according to claim 1 , wherein the residual vector quantizer is configured to quantize the feature vector differently according to a length of the raw speech data. 6 . A non-transitory computer-readable recording medium storing a program code for executing the method of claim 1 . 7 . An apparatus for classifying generated speech, comprising; a feature extractor configured to apply a one-dimensional convolution operation to raw speech data to embed the raw speech data into a feature space and to extract a feature vector; a quantizer disposed downstream of the feature extractor and configured to quantize the feature vector; and a classifier model configured to receive an output of the quantizer and to output a speech classification result with contextual awareness. 8 . The apparatus for classifying generated speech according to claim 7 , wherein the quantizer is a residual vector quantizer, and wherein the quantizer is configured to quantize the feature vector, which is a one-dimension array of real values, into positive integer values. 9 . The apparatus for classifying generated speech according to claim 7 , wherein the residual vector quantizer is configured to quantize the feature vector differently according to a length of the raw speech data. 10 . The apparatus for classifying generated speech according to claim 7 , wherein the classifier model comprises a natural language processing model including a BERT (Bidirectional Encoder Representations from Transformers) language model at a front end, wherein a fully connected layer and a softmax activation layer are disposed at a rear end of the BERT language model, and wherein the output of the quantizer is represented as a vector reflecting overall contextual structure through the BERT language model, and is passed through the fully connected layer and the softmax activation layer to output a classification label indicating one of generated speech and real speech.
Semantic analysis · CPC title
Artificial neural networks; Connectionist approaches · CPC title
for comparison or discrimination · CPC title
using neural networks · CPC title
Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.