System and method of three-dimensional immersive applications in multi-user communication sessions
US-2023274504-A1 · Aug 31, 2023 · US
US11893669B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11893669-B2 |
| Application number | US-202217571099-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 7, 2022 |
| Priority date | Jan 8, 2021 |
| Publication date | Feb 6, 2024 |
| Grant date | Feb 6, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A digital human development platform can enable a user to generate a digital human. The digital human development platform can receive user input specifying a dialogue for the digital human and one or more behaviors for the digital human, the one or more specified behaviors corresponding with one or more portions of the dialog on a common timeline. Scene data can be generated with the digital human development platform by merging the one or more behaviors with one or more portions of the dialogue based on times of the one or more behaviors and the one or more portions of the dialog on the common timeline.
Opening claim text (preview).
What is claimed is: 1. A method, comprising: selecting, with a development platform, a digital human; receiving, with the development platform, user input specifying a dialog for the digital human and one or more behaviors for the digital human corresponding with one or more portions of the dialog on a common timeline; wherein the dialog includes words to be spoken by the digital human in response to one or more predetermined cues received during an interactive dialog with an individual; and generating scene data, with the development platform, by merging the one or more behaviors with the one or more portions of the dialog based on times of the one or more behaviors and the one or more portions of the dialog on the common timeline; wherein the scene data is executable by a device to render the digital human and engage in the interactive dialog with the individual based on the one or more predetermined cues from the individual as received by the device during the interactive dialog. 2. The method of claim 1 , wherein the selecting comprises selecting a visual representation of the digital human from a plurality of visual representations corresponding to different digital humans electronically stored in a database communicatively coupled with the development platform. 3. The method of claim 1 , wherein the selecting comprises selecting a voice type corresponding to the digital human for audibly rendering the dialog. 4. The method of claim 1 , wherein the selecting comprises selecting a language corresponding to the digital human for audibly rendering the dialog. 5. The method of claim 1 , wherein the merging comprises determining individual time segments of the common timeline during which distinct portions of the dialog are rendered and editing at least one of the one or more behaviors within each of the individual time segments. 6. The method of claim 1 , wherein the one or more behaviors comprise one or more facial expressions rendered by the digital human, each facial expression of the one or more facial expressions is timed based on the common timeline to correspond to a time interval including a select portion of the dialog or an absence of dialog between two consecutive portions of the dialog. 7. The method of claim 1 , wherein the one or more behaviors comprise one or more gestures made by the digital human, each gesture of the one or more gestures corresponding to a select portion of the dialog or an absence of dialog between two consecutive portions of the dialog. 8. The method of claim 1 , further comprising: editing the dialog, wherein the editing provides at least one of an inflection, a stress, a speech rate, or a tone of one or more words of the dialog. 9. The method of claim 1 , further comprising: generating a background against which the digital human is visually rendered. 10. The method of claim 9 , further comprising: positioning the digital human against the background. 11. A system, comprising: a processor configured to initiate operations including: selecting a digital human; receiving user input specifying a dialog for the digital human and one or more behaviors for the digital human corresponding with one or more portions of the dialog on a common timeline; wherein the dialog includes words to be spoken by the digital human in response to one or more predetermined cues received during an interactive dialog with an individual; and generating scene data by merging the one or more behaviors with the one or more portions of the dialog based on times of the one or more behaviors and the one or more portions of the dialog on the common timeline; wherein the scene data is executable by a device to render the digital human and engage in the interactive dialog with the individual based on the one or more predetermined cues from the individual as received by the device during the interactive dialog. 12. The system of claim 11 , wherein the selecting comprises selecting a visual representation of the digital human from a plurality of visual representations corresponding to different digital humans electronically stored in a database communicatively coupled with the system. 13. The system of claim 11 , wherein the selecting comprises selecting a voice type corresponding to the digital human for audibly rendering the dialog. 14. The system of claim 11 , wherein the selecting comprises selecting a language corresponding to the digital human for audibly rendering the dialog. 15. The system of claim 11 , wherein the merging comprises determining individual time segments of the common timeline during which distinct portions of the dialog are rendered and editing at least one of the one or more behaviors within each of the individual time segments. 16. The system of claim 11 , wherein the one or more behaviors comprise one or more facial expressions rendered by the digital human, each facial expression of the one or more facial expressions is timed based on the common timeline to correspond to a time interval including a select portion of the dialog or an absence of dialog between two consecutive portions of the dialog. 17. The system of claim 11 , wherein the one or more behaviors comprise one or more gestures made by the digital human, each gesture of the one or more gestures corresponding to a select portion of the dialog or an absence of dialog between two consecutive portions of the dialog. 18. The system of claim 11 , wherein the processor is configured to initiate operations further including: editing the dialog, wherein the editing provides at least one of an inflection, a stress, a speech rate, or a tone of one or more words of the dialog. 19. The system of claim 11 , wherein the processor is configured to initiate operations further including: generating a background against which the digital human is visually rendered. 20. A computer program product, the computer program product comprising: one or more computer-readable storage media and program instructions collectively stored on the one or more computer-readable storage media, the program instructions executable by a processor to cause the processor to initiate operations including: selecting a digital human; receiving user input specifying a dialog for the digital human and one or more behaviors for the digital human corresponding with one or more portions of the dialog on a common timeline; wherein the dialog includes words to be spoken by the digital human in response to one or more predetermined cues received during an interactive dialog with an individual; and generating scene data by merging the one or more behaviors with one or more portions of the dialog based on times of the one or more behaviors and the one or more portions of the dialog on the common timeline; wherein the scene data is executable by a device to render the digital human and engage in the interactive dialog with the individual based on the one or more predetermined cues from the individual as received by the device during the interactive dialog.
Animation · CPC title
Arrangements for interaction with the human body, e.g. for user immersion in virtual reality (blind teaching G09B21/00) · CPC title
using display panels · CPC title
Sound input; Sound output (speech processing G10L) · CPC title
of characters, e.g. humans, animals or virtual beings · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.