Transmission terminal, transmission system, display method and program
US-2016105642-A1 · Apr 14, 2016 · US
US10110831B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10110831-B2 |
| Application number | US-201715703147-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 13, 2017 |
| Priority date | Sep 27, 2016 |
| Publication date | Oct 23, 2018 |
| Grant date | Oct 23, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A videoconference device displays video data from a speech site such that a viewer can easily understand even in a case where the number of sites is large. A communication controller receives each piece of video data and voice data from conference terminal devices of a plurality of other sites. A video and voice synthesizer determines a screen layout depending on the number of sites participating in a videoconference, and generates synthesized video data obtained by synthesizing video data of each site according to the screen layout. At this time, the video and voice synthesizer generates the synthesized video data such that display of the video data of each site where a level of voice data is higher than or equal to a threshold is highlighted more than display of video data of the other sites. A video and voice output controller displays the synthesized video data on a screen of a display device.
Opening claim text (preview).
What is claimed is: 1. A videoconference device which is provided at a host site and is simultaneously connectable to videoconference devices of a plurality of other sites, the videoconference device comprising: a video input unit that acquires video data by capturing video of the host site; a voice input unit that acquires voice data by picking up voices of the host site; a communication controller that receives video data and voice data from the videoconference devices, respectively, of the plurality of other sites; and a display controller that determines a screen layout depending on the number of sites participating in a videoconference, generates synthesized video data by synthesizing the video data of the sites in accordance with the screen layout, displays the synthesized video data on a screen, starts a main timer, wherein the videoconference device further includes a level detector that detects a level of the voice data, the display controller determines, for each site, whether a level of the voice data is higher than or equal to a threshold, determines any site having voice data higher than or equal to the threshold as a speech site, and determines any site having voice data not higher than or equal to the threshold as an audience site, wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of a first group of the audience sites in such a manner that displayed speech sites are highlighted more than the first group of audience sites, wherein the display controller further generates, after expiration of the main timer, the synthesized video data so as to display video data of any site determined to be a speech site, and video data of a second group of the audience sites in such a manner that displayed speech sites are highlighted more than the second group of audience sites, and wherein the display controller determines for each speech site, whether a level of the voice data has been lower than the threshold for a predetermined amount of time, and determines any speech site having voice data lower than the threshold for the predetermined amount of time as an audience site that is no longer to be highlighted. 2. The videoconference device of claim 1 , wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of the first group of the audience sites in such a manner that a display area of displayed speech sites is larger than a display area of the first group of audience sites. 3. The videoconference device of claim 1 , wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of the first group of the audience sites in such a manner that a display method of displayed speech sites differs from a display method of the first group of audience sites. 4. The videoconference device of claim 2 , wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of the first group of the audience sites in such a manner that a display position of displayed speech sites is changed. 5. A videoconference device which is provided at a host site and is simultaneously connectable to videoconference devices of a plurality of other sites, the videoconference device comprising: a video input unit that acquires video data by capturing video of the host site; a voice input unit that acquires voice data by picking up voices of the host site; a communication controller that receives each piece of the video data and the voice data from conference terminal devices of the plurality of other sites; and a display controller that determines a screen layout depending on the number of sites participating in a videoconference, generates synthesized video data by synthesizing the video data of the sites in accordance with the screen layout, displays the synthesized video data on a screen, starts a main timer, wherein the videoconference device further includes a level detector that detects a level of the voice data, wherein the display controller generates the synthesized video data so as to, in a case where a speech site, having a participant who mainly speaks, is displayed and where an audience site, having a participant who only listens without basically speaking, is displayed are determined in advance, display video data of the speech site and the audience site in such a manner that a display area of the speech site is larger than a display area of the audience site, wherein the audience site has voice data higher than or equal to a threshold, wherein the display controller generates the synthesized video data so as to display video data of the audience site in such a manner that the display area of the audience site is larger than display areas of a first group of other audience sites, of the plurality of other sites having voice data lower than the threshold, and is smaller than the display area of the speech site, wherein the display controller generates the synthesized video data so as to, after expiration of the main timer, display video data of the audience site in such a manner that the display area of the audience site is larger than display areas of a second group of other audience sites, of the plurality of other sites having voice data lower than the threshold, and is smaller than the display area of the speech site, and wherein the display controller determines for the speech site, whether the voice data has been lower than the threshold for a predetermined amount of time, and if the speech site has been lower than the threshold for the predetermined amount of time, determines the speech site as a new audience site that is no longer to have a display area that is larger than the display area of the audience site. 6. The videoconference device of claim 1 , wherein the display controller still further generates, prior to expiration of the main timer and if one of the first group of audience sites voice data is higher than the threshold, the synthesized video data so as to display video data of the speech sites in such a manner that the displayed speech sites are modified to include the one of the first group of audience sites having the level of the voice data higher than the threshold. 7. The videoconference device of claim 6 , wherein the display controller still further generates, the synthesized video data such that display of video data of the audience sites is modified to include a third group of audience sites not including the one of the first group of audience sites having voice data higher than the threshold. 8. The videoconference device of claim 7 , wherein the display controller additionally starts an individual timer based on the one of the first group of audience sites having voice data higher than the threshold, and wherein the display controller further generates, after expiration of the individual timer and if the one of the first group of audience sites has had a level of the voice data lower than the threshold for the duration of the individual timer, the synthesized video data such that display of video data of the speech sites is further modified to not include the one of the first group of audience sites having the voice data higher than the threshold. 9. A method of providing a videoconference at a host site that is simultaneously connectable to videoconference devices of a plurality of other sites, the method comprising: acquiring, via a video input unit, video data by capturing video of the host site; acquiring, via a voice input unit,
Conference systems · CPC title
Alteration of picture size, shape, position or orientation, e.g. zooming, rotation, rolling, perspective, translation · CPC title
Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals (selecting H04Q) · CPC title
for obtaining an image which is composed of whole input images, e.g. splitscreen · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.