Device and method for transmitting voice data of user in virtual space

US12411655B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12411655-B2
Application numberUS-202418403083-A
CountryUS
Kind codeB2
Filing dateJan 3, 2024
Priority dateSep 8, 2022
Publication dateSep 9, 2025
Grant dateSep 9, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An example server for constructing a virtual space includes a memory configured to store computer-executable instructions and a processor configured to execute the instructions by accessing the memory. The instructions, when executed, cause the processor to extract first partial voice data corresponding to a target utterance from voice data of a first user received from a terminal of the first user among users in the virtual space; instruct a target terminal of the target user to reproduce the first partial voice data; and, based on transmission of second partial voice data of a second user to the target user being requested while the target terminal reproduces the first partial voice data, instruct the target terminal to display visual information generated based on the second partial voice data.

First claim

Opening claim text (preview).

What is claimed is: 1. A server for constructing a virtual space, the server comprising: memory configured to store computer-executable instructions; and at least one processor, wherein the instructions, when executed, configure the at least one processor individually or collectively to control the server to at least: extract first partial voice data corresponding to a target utterance from voice data of a first user received from a terminal of the first user among users in the virtual space; determine a target user to receive the first partial voice data of the first user; instruct a target terminal of the target user to reproduce the first partial voice data; based on transmission to the target user of second partial voice data corresponding to an utterance from voice data of a second user being requested while the target terminal reproduces the first partial voice data, select partial voice data to instruct the target terminal to reproduce from among the first partial voice data and the second partial voice data; instruct the target terminal to reproduce the selected partial voice data; and instruct the target terminal to display visual information generated based on the non-selected one from among the first partial voice data and the second partial voice data. 2. A method performed by a server for constructing a virtual space, the method comprising: extracting first partial voice data corresponding to a target utterance from voice data of a first user received from a terminal of the first user among users in the virtual space; determining a target user to receive the first partial voice data of the first user; instructing a target terminal of the target user to reproduce the first partial voice data; and based on transmission to the target user of second partial voice data corresponding to an utterance from voice data of a second user being requested while the target terminal reproduces the first partial voice data, instructing the target terminal to reproduce the first partial voice data, but not the second partial voice data, and to display visual information including text converted from the utterance of the second user. 3. The method of claim 2 , wherein the extracting of the first partial voice data comprises: detecting a start event and an end event from the voice data of the first user based on at least one of a gesture input of the first user or a portion of the voice data of the first user; and extracting from the voice data of the first user, as the first partial voice data, a portion corresponding to a time period between the start event and the end event. 4. The method of claim 2 , further comprising: based on receiving the voice data of the first user from the terminal of the first user, starting transmission of the voice data of the first user to the users in the virtual space; based on detecting a start event from the voice data of the first user, stopping the transmission of the voice data of the first user to the users in the virtual space; and based on detecting an end event from the voice data of the first user, restarting the transmission of the voice data of the first user to the users in the virtual space. 5. The method of claim 2 , wherein the instructing of the target terminal of the target user to reproduce the first partial voice data comprises: restricting transmission of the first partial voice data to a user among the users in the virtual space other than the determined target user. 6. The method of claim 2 , wherein the instructing of the target terminal to display the visual information generated based on the second partial voice data comprises: instructing the target terminal to restrict reproduction of the second partial voice data. 7. The method of claim 2 , further comprising: selecting partial voice data to instruct the target terminal to reproduce from among the first partial voice data and the second partial voice data; instructing the target terminal to reproduce the selected partial voice data; and instructing the target terminal to display visual information generated based on partial voice data other than the selected partial voice data among the first partial voice data and the second partial voice data. 8. The method of claim 2 , further comprising: determining an artificial intelligence (AI) server other than the server as a receiver of the first partial voice data based on at least one of a gesture input of the first user or the first partial voice data; based on the determining of the AI server as the receiver of the first partial voice data, transmitting the first partial voice data to the AI server; and restricting transmission of the first partial voice data to a user other than the first user among the users in the virtual space. 9. The method of claim 8 , further comprising: transmitting feedback voice data received from the AI server to the first user; and restricting the transmission of the feedback voice data to a user other than the first user. 10. The method of claim 2 , wherein the determining of the target user comprises: based on not determining a user among the users in the virtual space to receive the first partial voice data, determining all users in the virtual space as target users. 11. A server for constructing a virtual space, the server comprising: memory configured to store computer-executable instructions; and at least one processor, wherein the instructions, when executed, configure the at least one processor individually or collectively to control the server to at least: extract first partial voice data corresponding to a target utterance from voice data of a first user received from a terminal of the first user among users in the virtual space; determine a target user to receive the first partial voice data of the first user; instruct a target terminal of the target user to reproduce the first partial voice data; and based on transmission to the target user of second partial voice data corresponding to an utterance from voice data of a second user being requested while the target terminal reproduces the first partial voice data, instruct the target terminal to reproduce the first partial voice data, but not the second partial voice data, and to display visual information including text converted from the utterance of the second user. 12. The server of claim 11 , wherein the instructions, when executed, configure the at least one processor to individually or collectively control the server to: detect a start event and an end event from the voice data of the first user based on at least one of a gesture input of the first user or a portion of the voice data of the first user; and extract from the voice data of the first user, as the first partial voice data, a portion corresponding to a time period between the start event and the end event. 13. The server of claim 11 , wherein the instructions, when executed, configure the at least one processor to individually or collectively control the server to: based on receiving of the voice data of the first user from the terminal of the first user, start transmission of the voice data of the first user to the users in the virtual space; based on detecting of a start event from the voice data of the first user, stop the transmission of the voice data of the first user to the users in the virtual space; and based on detecting of an end event from the voice data of the first user, restart the transmission of the voice data of the first user to the users in the virtual space. 14. The server of claim 11 , wherein the instructions, when executed, configure the at l

Assignees

Inventors

Classifications

  • Gesture based interaction, e.g. based on a set of recognized hand gestures (interaction based on gestures traced on a digitiser G06F3/04883) · CPC title

  • G06F3/167Primary

    Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Management of the audio stream, e.g. setting of volume, audio stream path · CPC title

  • Indicating arrangements; Control arrangements, e.g. balance control · CPC title

  • Segmentation; Word boundary detection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12411655B2 cover?
An example server for constructing a virtual space includes a memory configured to store computer-executable instructions and a processor configured to execute the instructions by accessing the memory. The instructions, when executed, cause the processor to extract first partial voice data corresponding to a target utterance from voice data of a first user received from a terminal of the first …
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F3/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 09 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).