Autonomous video conferencing system with virtual director assistance
US-2024414437-A1 · Dec 12, 2024 · US
US9485464B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9485464-B2 |
| Application number | US-201514748982-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 24, 2015 |
| Priority date | Aug 28, 2014 |
| Publication date | Nov 1, 2016 |
| Grant date | Nov 1, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In a processing method for a video conference, start speaking events and end speaking events of endpoints joined in the video conference are detected. Video streaming of a speaking endpoint is displayed in a speaking area of a foreground window of a display device. A current time “t”, a duration time “T” of the video conference, a start speaking time “s i ” and an end speaking time “e i ” of the endpoints are recorded. Real-time activity scores of each of the endpoints are calculated and updated according to video recording contents. Video streaming of non-speaking endpoints are displayed in corresponding areas of the display device according to the calculated activity scores.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method, executable by a processor of a server, the method comprising: detecting start speaking events and end speaking events of endpoints joined in a video conference; displaying video streaming of a speaking endpoint in a speaking area of a foreground window of a display device; recording a current time “t”, a duration time “T” of the video conference, a start speaking time “s i ” and an end speaking time “e i ” of each of the endpoints; calculating and updating real-time activity scores of each of the endpoints according to video recording contents; and displaying video streaming of non-speaking endpoints in corresponding areas of the display device according to the calculated activity scores. 2. The method as claimed in claim 1 , further comprising: displaying video streaming of a non-speaking endpoint in a non-speaking area of the foreground window, in event an activity score of the non-speaking endpoint is not less than a first preset value; displaying the video streaming of the non-speaking endpoint in a background window of the display device, in event the activity score of the non-speaking endpoint is less than the first preset value and greater than a second preset value; or stopping displaying the video streaming of the non-speaking endpoint, in event the activity score of the non-speaking endpoint is not greater than the second preset value. 3. The method as claimed in claim 1 , wherein the activity score S is calculated according to a formula represented as S = ∑ i = 1 n ( 1 - Δ t i / T ) 3 t i , where t i =e i −s i , and Δt i =t−e i . 4. The method as claimed in claim 1 , wherein the speaking area is located at the center of the foreground window while the non-speaking area is located around the speaking area. 5. The method as claimed in claim 1 , wherein the speaking area is located at the top of the foreground window, the non-speaking area is located at the bottom of the foreground window, and the dimensions of the speaking area are greater than the dimensions of the non-speaking area. 6. A non-transitory storage medium, storing a set of instructions, the set of instructions being executed by a processor of a server, to perform a method comprising: detecting start speaking events and end speaking events of endpoints joined in a video conference; displaying video streaming of a speaking endpoint in a speaking area of a foreground window of a display device; recording a current time “t”, a duration time “T” of the video conference, a start speaking time “s i ” and an end speaking time “e i ” of the endpoints; calculating and updating real-time activity scores of each of the endpoints according to video recording contents; and displaying video streaming of non-speaking endpoints in corresponding areas of the display device according to the calculated activity scores. 7. The non-transitory storage medium as claimed in claim 6 , wherein the method further comprises: displaying video streaming of a non-speaking endpoint in a non-speaking area of the foreground window, in event an activity score of the non-speaking endpoint is not less than a first preset value; displaying the video streaming of the non-speaking endpoint in a background window of the display device, in event the activity score of the non-speaking endpoint is less than the first preset value and greater than a second preset value; or stopping displaying the video streaming of the non-speaking endpoint, in event the activity score of the non-speaking endpoint is not greater than the second preset value. 8. The non-transitory storage medium as claimed in claim 6 , wherein the activity score S is calculated according to a formula represented as S = ∑ i = 1 n ( 1 - Δ t i / T ) 3 t i , where t i =e i −s i , and Δt i =t−e i . 9. The non-transitory storage medium as claimed in claim 6 , wherein the speaking area is located at the center of the foreground window while the non-speaking area is located around the speaking area. 10. The non-transitory storage medium as claimed in claim 6 , wherein the speaking area is located at the top of the foreground window, the non-speaking area is located at the bottom of the foreground window, and the dimensions of the speaking area are greater than the dimensions of the non-speaking area. 11. A server, the server comprising: at least one processor; a storage unit; and one or more programs that are stored in the storage unit and executed by the at least one processor, the one or more programs comprising instructions for: detecting start speaking events and end speaking events of endpoints joined in a video conference; displaying video streaming of a speaking endpoint in a speaking area of a foreground window of a display device; recording a current time “t”, a duration time “T” of the video conference, a start speaking time “s i ” and an end speaking time “e i ” of each of the endpoints; calculating and updating real-time activity scores of each o
Related publications grouped by family.
Answers are generated from the same data shown on this page.