What technology area does this patent fall under?

Primary CPC classification G10L21/028. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Jul 08 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Signal processing device, signal processing method, signal processing program, learning device, learning method, and learning program

US12354620B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12354620-B2
Application number	US-202018020084-A
Country	US
Kind code	B2
Filing date	Aug 13, 2020
Priority date	Aug 13, 2020
Publication date	Jul 8, 2025
Grant date	Jul 8, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A signal processing device includes processing circuitry configured to receive an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes, and output a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal, with a neural network by using a feature value of the mixture audio signal and the extraction target information.

First claim

Opening claim text (preview).

The invention claimed is: 1. A signal processing device comprising: processing circuitry configured to: receive an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes; integrate information of the mixture audio signal and the extraction target information; and output a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with a neural network by using a result of the integration of the information of the mixture audio signal and the extraction target information. 2. The signal processing device according to claim 1 , wherein the extraction target information is a target class vector indicating, by a vector, which audio class of the audio signal is to be extracted from the mixture audio signal, the processing circuitry is further configured to perform processing of embedding the target class vector by using a neural network, and output a result of extracting the audio signal of the audio class indicated by the target class vector from the mixture audio signal with the neural network by using a feature value obtained by integrating a feature value of the mixture audio signal and the target class vector after the embedding processing. 3. The signal processing device according to claim 1 , wherein the processing circuitry is further configured to receive an input of a target class vector indicating, by a vector, which audio class of the audio signal is to be removed from the mixture audio signal, and output a result of removing the audio signal of the audio class indicated by the target class vector from the mixture audio signal with the neural network by using a feature value obtained by integrating the target class vector after an embedding processing to a feature value of the mixture audio signal. 4. A signal processing method executed by a signal processing device, the signal processing method comprising: receiving an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes; integrate information of the mixture audio signal and the extraction target information; and outputting a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with a neural network by using a result of the integration of the information of the mixture audio signal and the extraction target information. 5. A non-transitory computer-readable recording medium storing therein a signal processing program that causes a computer to execute a process comprising: receiving an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes; integrating information of the mixture audio signal and the extraction target information; and outputting a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with a neural network by using a result of the integration of the information of the mixture audio signal and the extraction target information. 6. The signal processing device according to claim 1 , wherein the information of the mixture audio signal includes a feature value of the mixture audio signal, and the processing circuitry is configured to: obtain a feature value by integrating the feature value of the mixture audio signal and the extraction target information; and output a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with the neural network by using the feature value obtained by the integrating and the feature value of the mixture audio signal. 7. The signal processing device according to claim 1 , wherein the processing circuitry is configured to perform processing of embedding the extraction target information by using a neural network. 8. The signal processing device according to claim 1 , wherein the extraction target information is a target class vector indicating, by a vector, which audio class of the audio signal is to be extracted from the mixture audio signal, the processing circuitry is further configured to perform processing of embedding the target class vector by using a neural network. 9. The signal processing device according to claim 1 , wherein the processing circuitry is configured to output the result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with the neural network by using the result of the integration and an intermediate feature value of the mixture audio signal. 10. The signal processing device according to claim 1 , wherein the processing circuitry is configured to output the result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with the neural network by using an intermediate feature value derived based on the result of the integration and the intermediate feature value of the mixture audio signal.

Assignees

Nippon Telegraph & Telephone

Inventors

Classifications

G10L25/30
using neural networks · CPC title
G10L25/51
for comparison or discrimination · CPC title
G10L21/028Primary
using properties of sound source · CPC title
G10L21/0272Primary
Voice signal separating · CPC title

Patent family

Related publications grouped by family.

View patent family 80247110

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12354620B2 cover?: A signal processing device includes processing circuitry configured to receive an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes, and output a result of extracting the audio signal of the audio class indicated by the extraction target i…
Who is the assignee on this patent?: Nippon Telegraph & Telephone
What technology area does this patent fall under?: Primary CPC classification G10L21/028. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Jul 08 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).