Method and apparatus for classifying generated speech

US2025246185A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025246185-A1
Application numberUS-202519184526-A
CountryUS
Kind codeA1
Filing dateApr 21, 2025
Priority dateNov 6, 2023
Publication dateJul 31, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and apparatus for classifying generated speech are disclosed. The method for classifying generated speech includes: applying a one-dimensional convolution operation to raw speech data to embed the raw speech data into a feature space and extract a feature vector; quantizing the feature vector by applying it to a residual vector quantizer; and applying the quantized result to a classifier model including a natural language processing model to output a classification label.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for classifying generated speech, comprising: applying a one-dimensional convolution operation to raw speech data to embed the raw speech data into a feature space and extract a feature vector; quantizing the feature vector by applying the feature vector to a residual vector quantizer; and applying the quantized result to a classifier model comprising a natural language processing model to output a classification label. 2 . The method for classifying generated speech according to claim 1 , wherein the natural language processing model comprises a BERT (Bidirectional Encoder Representations from Transformers) language model, and wherein the classifier model is configured to output the classification label indicating one of generated speech and real speech based on an output of the BERT language model. 3 . The method for classifying generated speech according to claim 2 , wherein the quantized result is represented as a vector reflecting overall contextual structure through the BERT language model, and the vector is passed through a fully connection layer and a softmax activation function of the classifier model to output the classification label indicating one of generated speech and real speech. 4 . The method for classifying generated speech according to claim 1 , wherein the residual vector quantizer is configured to quantize the feature vector, which is a one-dimension array of real values, into positive integer values. 5 . The method for classifying generated speech according to claim 1 , wherein the residual vector quantizer is configured to quantize the feature vector differently according to a length of the raw speech data. 6 . A non-transitory computer-readable recording medium storing a program code for executing the method of claim 1 . 7 . An apparatus for classifying generated speech, comprising; a feature extractor configured to apply a one-dimensional convolution operation to raw speech data to embed the raw speech data into a feature space and to extract a feature vector; a quantizer disposed downstream of the feature extractor and configured to quantize the feature vector; and a classifier model configured to receive an output of the quantizer and to output a speech classification result with contextual awareness. 8 . The apparatus for classifying generated speech according to claim 7 , wherein the quantizer is a residual vector quantizer, and wherein the quantizer is configured to quantize the feature vector, which is a one-dimension array of real values, into positive integer values. 9 . The apparatus for classifying generated speech according to claim 7 , wherein the residual vector quantizer is configured to quantize the feature vector differently according to a length of the raw speech data. 10 . The apparatus for classifying generated speech according to claim 7 , wherein the classifier model comprises a natural language processing model including a BERT (Bidirectional Encoder Representations from Transformers) language model at a front end, wherein a fully connected layer and a softmax activation layer are disposed at a rear end of the BERT language model, and wherein the output of the quantizer is represented as a vector reflecting overall contextual structure through the BERT language model, and is passed through the fully connected layer and the softmax activation layer to output a classification label indicating one of generated speech and real speech.

Assignees

Inventors

Classifications

  • Semantic analysis · CPC title

  • Artificial neural networks; Connectionist approaches · CPC title

  • for comparison or discrimination · CPC title

  • using neural networks · CPC title

  • G10L17/26Primary

    Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025246185A1 cover?
A method and apparatus for classifying generated speech are disclosed. The method for classifying generated speech includes: applying a one-dimensional convolution operation to raw speech data to embed the raw speech data into a feature space and extract a feature vector; quantizing the feature vector by applying it to a residual vector quantizer; and applying the quantized result to a classifi…
Who is the assignee on this patent?
Foundation Soongsil Univ Industry Cooperation
What technology area does this patent fall under?
Primary CPC classification G10L17/26. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jul 31 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).