Visually tracked spatial audio
US-2022191638-A1 · Jun 16, 2022 · US
US11770500B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11770500-B2 |
| Application number | US-202217587211-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 28, 2022 |
| Priority date | Jul 15, 2021 |
| Publication date | Sep 26, 2023 |
| Grant date | Sep 26, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A system for managing a virtual meeting (e.g., video conference) includes memory storing a video conference application and at least one processor to execute the video conference application to generate a virtual meeting view for a first attendee including multiple attendee video streams arranged according to a virtual attendee arrangement specifying positions of the attendee video streams relative to each other in the virtual meeting view, receive second attendee audio data associated with a second attendee video stream, identify a particular video stream position specified by the virtual attendee arrangement, determine differential stereo effect data corresponding with the particular video stream position, and apply the differential stereo effect data to the second attendee audio data to provide differential audio signals on different audio channels output to the first attendee to create a stereo sound effect corresponding with the particular video stream position.
Opening claim text (preview).
The invention claimed is: 1. A system for managing a virtual meeting including a group of attendees, the system comprising: non-transitory memory storing a video conference application comprising computer-readable instructions; and at least one processor communicatively coupled to the non-transitory memory to execute the video conference application to: generate and display, to a first attendee in the group of attendees, a virtual meeting view including multiple attendee video streams arranged according to a virtual attendee arrangement, each attendee video stream comprising a video stream of a respective attendee in the group of attendees; wherein the virtual attendee arrangement specifies a video stream position of each respective attendee video stream relative to each other attendee video stream in the virtual meeting view, including a particular video stream position of a second attendee video stream of a second attendee in the group of attendees; wherein different ones of the attendee video streams are located at different respective offset distances relative to a defined reference position in the virtual meeting view; receive second attendee audio data associated with the second attendee video stream; determine a respective offset distance of the particular video stream position of the second attendee video stream relative to the defined reference position in the virtual meeting view; determine differential stereo effect data corresponding with the particular video stream position of the second attendee video stream specified by the virtual attendee arrangement as a function of the determined respective offset distance of the particular video stream position; and apply the differential stereo effect data to the second attendee audio data to provide differential audio signals on different audio channels output to the first attendee, wherein the differential audio signals on the different audio channels create a stereo sound effect corresponding with the determined respective offset distance of the particular video stream position. 2. The system of claim 1 , wherein the differential stereo effect data comprises differential delay data defining an audio delay differential between the different audio channels. 3. The system of claim 1 , wherein the differential stereo effect data comprises differential amplitude data defining an amplitude differential between the different audio channels. 4. The system of claim 1 , wherein the differential stereo effect data comprises (a) differential delay data defining a different audio delay on the different audio channels and (b) differential amplitude data defining a different audio amplitude on the different audio channels. 5. The system of claim 1 , wherein: the particular video stream position defined by the virtual attendee arrangement defines a lateral offset of the second attendee video stream relative to the defined reference position in the virtual meeting view; and a magnitude of the differential stereo effect data corresponding with the particular video stream position depends on the defined lateral offset of the second attendee video stream relative to the defined reference position. 6. The system of claim 5 , wherein the defined reference position in the virtual meeting view corresponds with a position of a key stream of the multiple attendee video streams. 7. The system of claim 6 , wherein: the virtual attendee arrangement defines a number of attendees arranged between the second attendee and the defined reference position in the virtual meeting view; and the lateral offset of the second attendee video stream relative to the defined refence position is defined by a number of attendee video streams arranged between the second attendee video stream. 8. The system of claim 6 , wherein the key stream is selected by the first attendee. 9. The system of claim 6 , wherein the key stream is selected based on focus input received from the first attendee. 10. The system of claim 9 , wherein the focus input received from the first attendee comprises focal sensor data received from a focal sensor associated with the first attendee. 11. The system of claim 1 , wherein the video conference application is executable to: define a focus group comprising a subset of one or more attendee video streams of the multiple attendee video streams; determine whether the second attendee video stream is included in the focus group; and apply a focus-related audio effect to the second attendee audio data based on whether the second attendee video stream is included in the focus group. 12. The system of claim 11 , wherein applying the focus-related audio effect to the second attendee audio data based on whether the second attendee video stream is included in the focus group comprises attenuating an amplitude of the second attendee audio data in response to determining the second attendee video stream is not included in the focus group. 13. The system of claim 11 , wherein the video conference application is executable to: receive focus input from the first attendee; and adjust the subset of attendee video streams in the focus group based on the focus input received from the first attendee. 14. The system of claim 1 , wherein the video conference application is executable to: determine a distance-related audio effect corresponding with a virtual distance between the second attendee and the first attendee assigned to the particular video stream position; and apply the distance-related audio effect to the second attendee audio data, the distance-related audio effect adjusting an amplitude of the second attendee audio data. 15. A system for managing a virtual meeting including a group of attendees, the system comprising: non-transitory memory storing computer-readable audio management instructions; and at least one processor communicatively coupled to the non-transitory memory to execute the video conference application to: generate and display, to a first attendee in the group of attendees, a virtual meeting view including multiple attendee video streams arranged according to a virtual attendee arrangement, each attendee video stream comprising a video stream of a respective attendee in the group of attendees; wherein the virtual attendee arrangement specifies a position of each respective attendee video stream relative to each other attendee video stream in the virtual meeting view; receive focal sensor data from a focal sensor, the focal sensor data indicating a spatial focus of the first attendee; based at least on the received focal sensor data, define a focus group of attendee video streams comprising a subset of attendee video streams of the multiple attendee video streams, the subset of the attendee video streams including (a) a key stream corresponding with the spatial focus indicated by the focal sensor data and (b) at least one adjacent attendee video stream adjacent to the key stream; receive second attendee audio data associated with a second attendee video stream of the multiple attendee video streams, the second attendee video stream comprising a video stream of a second attendee in the group of attendees, determine whether the second attendee video stream is in the focus group; apply a focus-related audio effect to the second attendee audio data based on whether the second attendee video stream is in the focus group; and output the second attendee audio data with the applied audio effect to the first attendee via at least one audio channel. 16. The system of claim 15 , wherein applying the focus-related audio effect to the
Conference systems · CPC title
Conducting the conference, e.g. admission, detection, selection or grouping of participants, correlating users to one or more conference sessions, prioritising transmission · CPC title
Processing of audio elementary streams · CPC title
involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations · CPC title
involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream (arrangements characterised by components specially adapted for monitoring, identification or recognition of video in broadcast systems H04H60/59) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.