Information processing device and method, and program

US2024340605A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024340605-A1
Application numberUS-202218575889-A
CountryUS
Kind codeA1
Filing dateFeb 25, 2022
Priority dateJul 12, 2021
Publication dateOct 10, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present technology relates to an information processing device and method, and a program that allow the voice of a speaker to be easily recognized. The information processing device includes an information processing unit configured to generate, on the basis of orientation information indicating an orientation of a listener, virtual location information indicating a location of the listener in a virtual space, the location being set by the listener, and virtual location information of a speaker, a voice of the speaker localized at a location corresponding to the orientation and location of the listener and the location of the speaker. The present technology can be applied to a tele-communication system.

First claim

Opening claim text (preview).

1 . An information processing device comprising an information processing unit configured to generate, on a basis of orientation information indicating an orientation of a listener, virtual location information indicating a location of the listener in a virtual space, the location being set by the listener, and virtual location information of a speaker, a voice of the speaker localized at a location corresponding to the orientation and location of the listener and the location of the speaker. 2 . The information processing device according to claim 1 , wherein a location of the speaker in the virtual space indicated by the virtual location information of the speaker is set by the listener. 3 . The information processing device according to claim 1 , further comprising a communication unit configured to receive the orientation information and the virtual location information of the listener from a client of the listener and transmit the voice of the speaker to the client of the listener. 4 . The information processing device according to claim 1 , wherein the information processing unit generates the voice of the speaker by performing acoustic processing including binaural processing. 5 . The information processing device according to claim 1 , wherein the information processing unit generates the voice of the speaker such that the voice of the speaker is clearly heard as a direction of the speaker as viewed from the listener becomes closer to a front direction of the listener. 6 . The information processing device according to claim 5 , wherein the information processing unit generates the voice of the speaker on a basis of directivity designated by the listener. 7 . The information processing device according to claim 1 , wherein the information processing unit generates the voice of the speaker such that the voice of the speaker is clearly heard as a front direction of the speaker becomes closer to a direction of the listener as viewed from the speaker. 8 . The information processing device according to claim 7 , wherein the information processing unit generates the voice of the speaker on a basis of directivity designated by the speaker. 9 . The information processing device according to claim 1 , wherein the information processing unit adjusts a location of one or a plurality of the speakers in the virtual space so as to make an inter-speaker angle formed by the direction of the speaker viewed from the listener and a direction of another speaker viewed from the listener greater than or equal to a predetermined minimum angle. 10 . The information processing device according to claim 9 , wherein in a case where the information processing unit fails to arrange all the speakers in the virtual space so as to make the inter-speaker angle between all the speakers greater than or equal to the minimum angle, the information processing unit calculates a degree of priority of the speaker on a basis of the voice of the speaker, and adjusts the location of one or a plurality of the speakers in the virtual space so as to make the inter-speaker angle between the speakers with a high degree of priority equal to the minimum angle. 11 . The information processing device according to claim 10 , wherein the information processing unit adjusts the location of one or a plurality of the speakers in the virtual space so as to make the inter-speaker angle between the speakers with a low degree of priority equal to an angle smaller than the minimum angle. 12 . The information processing device according to claim 10 , wherein the information processing unit adjusts the location of one or a plurality of the speakers in the virtual space such that a plurality of the speakers with the low degree of priority is arranged at a same location in the virtual space. 13 . The information processing device according to claim 10 , wherein the information processing unit calculates the degree of priority for each group including one or a plurality of the speakers. 14 . The information processing device according to claim 10 , wherein the information processing unit calculates the degree of priority based on a speaking frequency of the speaker. 15 . The information processing device according to claim 1 , wherein the information processing unit generates the voice of the speaker for each of a plurality of orientations including the orientation of the listener indicated by the orientation information. 16 . The information processing device according to claim 1 , wherein the information processing unit causes a display unit to display a virtual space image indicating a positional relation between the listener and the speaker in the virtual space. 17 . An information processing method comprising generating, on a basis of orientation information indicating an orientation of a listener, virtual location information indicating a location of the listener in a virtual space, the location being set by the listener, and virtual location information of a speaker, a voice of the speaker localized at a location corresponding to the orientation and location of the listener and the location of the speaker by an information processing device. 18 . A program causing a computer to execute processing, the processing comprising generating, on a basis of orientation information indicating an orientation of a listener, virtual location information indicating a location of the listener in a virtual space, the location being set by the listener, and virtual location information of a speaker, a voice of the speaker localized at a location corresponding to the orientation and location of the listener and the location of the speaker.

Assignees

Inventors

Classifications

  • Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution (control circuits for electronic adaptation of the sound field H04S7/30) · CPC title

  • Aspects of volume control, not necessarily automatic, in stereophonic sound systems · CPC title

  • For headphones · CPC title

  • Electronic adaptation of stereophonic sound system to listener position or orientation (H04S7/301 takes precedence) · CPC title

  • Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024340605A1 cover?
The present technology relates to an information processing device and method, and a program that allow the voice of a speaker to be easily recognized. The information processing device includes an information processing unit configured to generate, on the basis of orientation information indicating an orientation of a listener, virtual location information indicating a location of the listener…
Who is the assignee on this patent?
Sony Group Corp
What technology area does this patent fall under?
Primary CPC classification H04S7/303. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Oct 10 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).