Signal processing device, signal processing method, signal processing program, learning device, learning method, and learning program

US12354620B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12354620-B2
Application numberUS-202018020084-A
CountryUS
Kind codeB2
Filing dateAug 13, 2020
Priority dateAug 13, 2020
Publication dateJul 8, 2025
Grant dateJul 8, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A signal processing device includes processing circuitry configured to receive an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes, and output a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal, with a neural network by using a feature value of the mixture audio signal and the extraction target information.

First claim

Opening claim text (preview).

The invention claimed is: 1. A signal processing device comprising: processing circuitry configured to: receive an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes; integrate information of the mixture audio signal and the extraction target information; and output a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with a neural network by using a result of the integration of the information of the mixture audio signal and the extraction target information. 2. The signal processing device according to claim 1 , wherein the extraction target information is a target class vector indicating, by a vector, which audio class of the audio signal is to be extracted from the mixture audio signal, the processing circuitry is further configured to perform processing of embedding the target class vector by using a neural network, and output a result of extracting the audio signal of the audio class indicated by the target class vector from the mixture audio signal with the neural network by using a feature value obtained by integrating a feature value of the mixture audio signal and the target class vector after the embedding processing. 3. The signal processing device according to claim 1 , wherein the processing circuitry is further configured to receive an input of a target class vector indicating, by a vector, which audio class of the audio signal is to be removed from the mixture audio signal, and output a result of removing the audio signal of the audio class indicated by the target class vector from the mixture audio signal with the neural network by using a feature value obtained by integrating the target class vector after an embedding processing to a feature value of the mixture audio signal. 4. A signal processing method executed by a signal processing device, the signal processing method comprising: receiving an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes; integrate information of the mixture audio signal and the extraction target information; and outputting a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with a neural network by using a result of the integration of the information of the mixture audio signal and the extraction target information. 5. A non-transitory computer-readable recording medium storing therein a signal processing program that causes a computer to execute a process comprising: receiving an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes; integrating information of the mixture audio signal and the extraction target information; and outputting a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with a neural network by using a result of the integration of the information of the mixture audio signal and the extraction target information. 6. The signal processing device according to claim 1 , wherein the information of the mixture audio signal includes a feature value of the mixture audio signal, and the processing circuitry is configured to: obtain a feature value by integrating the feature value of the mixture audio signal and the extraction target information; and output a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with the neural network by using the feature value obtained by the integrating and the feature value of the mixture audio signal. 7. The signal processing device according to claim 1 , wherein the processing circuitry is configured to perform processing of embedding the extraction target information by using a neural network. 8. The signal processing device according to claim 1 , wherein the extraction target information is a target class vector indicating, by a vector, which audio class of the audio signal is to be extracted from the mixture audio signal, the processing circuitry is further configured to perform processing of embedding the target class vector by using a neural network. 9. The signal processing device according to claim 1 , wherein the processing circuitry is configured to output the result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with the neural network by using the result of the integration and an intermediate feature value of the mixture audio signal. 10. The signal processing device according to claim 1 , wherein the processing circuitry is configured to output the result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with the neural network by using an intermediate feature value derived based on the result of the integration and the intermediate feature value of the mixture audio signal.

Assignees

Inventors

Classifications

  • using neural networks · CPC title

  • for comparison or discrimination · CPC title

  • G10L21/028Primary

    using properties of sound source · CPC title

  • Voice signal separating · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12354620B2 cover?
A signal processing device includes processing circuitry configured to receive an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes, and output a result of extracting the audio signal of the audio class indicated by the extraction target i…
Who is the assignee on this patent?
Nippon Telegraph & Telephone
What technology area does this patent fall under?
Primary CPC classification G10L21/028. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 08 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).