Information processing device and method, and program
US-2021281739-A1 · Sep 9, 2021 · US
US12354620B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12354620-B2 |
| Application number | US-202018020084-A |
| Country | US |
| Kind code | B2 |
| Filing date | Aug 13, 2020 |
| Priority date | Aug 13, 2020 |
| Publication date | Jul 8, 2025 |
| Grant date | Jul 8, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A signal processing device includes processing circuitry configured to receive an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes, and output a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal, with a neural network by using a feature value of the mixture audio signal and the extraction target information.
Opening claim text (preview).
The invention claimed is: 1. A signal processing device comprising: processing circuitry configured to: receive an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes; integrate information of the mixture audio signal and the extraction target information; and output a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with a neural network by using a result of the integration of the information of the mixture audio signal and the extraction target information. 2. The signal processing device according to claim 1 , wherein the extraction target information is a target class vector indicating, by a vector, which audio class of the audio signal is to be extracted from the mixture audio signal, the processing circuitry is further configured to perform processing of embedding the target class vector by using a neural network, and output a result of extracting the audio signal of the audio class indicated by the target class vector from the mixture audio signal with the neural network by using a feature value obtained by integrating a feature value of the mixture audio signal and the target class vector after the embedding processing. 3. The signal processing device according to claim 1 , wherein the processing circuitry is further configured to receive an input of a target class vector indicating, by a vector, which audio class of the audio signal is to be removed from the mixture audio signal, and output a result of removing the audio signal of the audio class indicated by the target class vector from the mixture audio signal with the neural network by using a feature value obtained by integrating the target class vector after an embedding processing to a feature value of the mixture audio signal. 4. A signal processing method executed by a signal processing device, the signal processing method comprising: receiving an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes; integrate information of the mixture audio signal and the extraction target information; and outputting a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with a neural network by using a result of the integration of the information of the mixture audio signal and the extraction target information. 5. A non-transitory computer-readable recording medium storing therein a signal processing program that causes a computer to execute a process comprising: receiving an input of extraction target information indicating which audio class of an audio signal is to be extracted from a mixture audio signal constituted by a mixture of audio signals of a plurality of audio classes; integrating information of the mixture audio signal and the extraction target information; and outputting a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with a neural network by using a result of the integration of the information of the mixture audio signal and the extraction target information. 6. The signal processing device according to claim 1 , wherein the information of the mixture audio signal includes a feature value of the mixture audio signal, and the processing circuitry is configured to: obtain a feature value by integrating the feature value of the mixture audio signal and the extraction target information; and output a result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with the neural network by using the feature value obtained by the integrating and the feature value of the mixture audio signal. 7. The signal processing device according to claim 1 , wherein the processing circuitry is configured to perform processing of embedding the extraction target information by using a neural network. 8. The signal processing device according to claim 1 , wherein the extraction target information is a target class vector indicating, by a vector, which audio class of the audio signal is to be extracted from the mixture audio signal, the processing circuitry is further configured to perform processing of embedding the target class vector by using a neural network. 9. The signal processing device according to claim 1 , wherein the processing circuitry is configured to output the result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with the neural network by using the result of the integration and an intermediate feature value of the mixture audio signal. 10. The signal processing device according to claim 1 , wherein the processing circuitry is configured to output the result of extracting the audio signal of the audio class indicated by the extraction target information from the mixture audio signal with the neural network by using an intermediate feature value derived based on the result of the integration and the intermediate feature value of the mixture audio signal.
using neural networks · CPC title
for comparison or discrimination · CPC title
using properties of sound source · CPC title
Voice signal separating · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.