Videoconference device

US10110831B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10110831-B2
Application numberUS-201715703147-A
CountryUS
Kind codeB2
Filing dateSep 13, 2017
Priority dateSep 27, 2016
Publication dateOct 23, 2018
Grant dateOct 23, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A videoconference device displays video data from a speech site such that a viewer can easily understand even in a case where the number of sites is large. A communication controller receives each piece of video data and voice data from conference terminal devices of a plurality of other sites. A video and voice synthesizer determines a screen layout depending on the number of sites participating in a videoconference, and generates synthesized video data obtained by synthesizing video data of each site according to the screen layout. At this time, the video and voice synthesizer generates the synthesized video data such that display of the video data of each site where a level of voice data is higher than or equal to a threshold is highlighted more than display of video data of the other sites. A video and voice output controller displays the synthesized video data on a screen of a display device.

First claim

Opening claim text (preview).

What is claimed is: 1. A videoconference device which is provided at a host site and is simultaneously connectable to videoconference devices of a plurality of other sites, the videoconference device comprising: a video input unit that acquires video data by capturing video of the host site; a voice input unit that acquires voice data by picking up voices of the host site; a communication controller that receives video data and voice data from the videoconference devices, respectively, of the plurality of other sites; and a display controller that determines a screen layout depending on the number of sites participating in a videoconference, generates synthesized video data by synthesizing the video data of the sites in accordance with the screen layout, displays the synthesized video data on a screen, starts a main timer, wherein the videoconference device further includes a level detector that detects a level of the voice data, the display controller determines, for each site, whether a level of the voice data is higher than or equal to a threshold, determines any site having voice data higher than or equal to the threshold as a speech site, and determines any site having voice data not higher than or equal to the threshold as an audience site, wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of a first group of the audience sites in such a manner that displayed speech sites are highlighted more than the first group of audience sites, wherein the display controller further generates, after expiration of the main timer, the synthesized video data so as to display video data of any site determined to be a speech site, and video data of a second group of the audience sites in such a manner that displayed speech sites are highlighted more than the second group of audience sites, and wherein the display controller determines for each speech site, whether a level of the voice data has been lower than the threshold for a predetermined amount of time, and determines any speech site having voice data lower than the threshold for the predetermined amount of time as an audience site that is no longer to be highlighted. 2. The videoconference device of claim 1 , wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of the first group of the audience sites in such a manner that a display area of displayed speech sites is larger than a display area of the first group of audience sites. 3. The videoconference device of claim 1 , wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of the first group of the audience sites in such a manner that a display method of displayed speech sites differs from a display method of the first group of audience sites. 4. The videoconference device of claim 2 , wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of the first group of the audience sites in such a manner that a display position of displayed speech sites is changed. 5. A videoconference device which is provided at a host site and is simultaneously connectable to videoconference devices of a plurality of other sites, the videoconference device comprising: a video input unit that acquires video data by capturing video of the host site; a voice input unit that acquires voice data by picking up voices of the host site; a communication controller that receives each piece of the video data and the voice data from conference terminal devices of the plurality of other sites; and a display controller that determines a screen layout depending on the number of sites participating in a videoconference, generates synthesized video data by synthesizing the video data of the sites in accordance with the screen layout, displays the synthesized video data on a screen, starts a main timer, wherein the videoconference device further includes a level detector that detects a level of the voice data, wherein the display controller generates the synthesized video data so as to, in a case where a speech site, having a participant who mainly speaks, is displayed and where an audience site, having a participant who only listens without basically speaking, is displayed are determined in advance, display video data of the speech site and the audience site in such a manner that a display area of the speech site is larger than a display area of the audience site, wherein the audience site has voice data higher than or equal to a threshold, wherein the display controller generates the synthesized video data so as to display video data of the audience site in such a manner that the display area of the audience site is larger than display areas of a first group of other audience sites, of the plurality of other sites having voice data lower than the threshold, and is smaller than the display area of the speech site, wherein the display controller generates the synthesized video data so as to, after expiration of the main timer, display video data of the audience site in such a manner that the display area of the audience site is larger than display areas of a second group of other audience sites, of the plurality of other sites having voice data lower than the threshold, and is smaller than the display area of the speech site, and wherein the display controller determines for the speech site, whether the voice data has been lower than the threshold for a predetermined amount of time, and if the speech site has been lower than the threshold for the predetermined amount of time, determines the speech site as a new audience site that is no longer to have a display area that is larger than the display area of the audience site. 6. The videoconference device of claim 1 , wherein the display controller still further generates, prior to expiration of the main timer and if one of the first group of audience sites voice data is higher than the threshold, the synthesized video data so as to display video data of the speech sites in such a manner that the displayed speech sites are modified to include the one of the first group of audience sites having the level of the voice data higher than the threshold. 7. The videoconference device of claim 6 , wherein the display controller still further generates, the synthesized video data such that display of video data of the audience sites is modified to include a third group of audience sites not including the one of the first group of audience sites having voice data higher than the threshold. 8. The videoconference device of claim 7 , wherein the display controller additionally starts an individual timer based on the one of the first group of audience sites having voice data higher than the threshold, and wherein the display controller further generates, after expiration of the individual timer and if the one of the first group of audience sites has had a level of the voice data lower than the threshold for the duration of the individual timer, the synthesized video data such that display of video data of the speech sites is further modified to not include the one of the first group of audience sites having the voice data higher than the threshold. 9. A method of providing a videoconference at a host site that is simultaneously connectable to videoconference devices of a plurality of other sites, the method comprising: acquiring, via a video input unit, video data by capturing video of the host site; acquiring, via a voice input unit,

Assignees

Inventors

Classifications

  • H04N7/15Primary

    Conference systems · CPC title

  • Alteration of picture size, shape, position or orientation, e.g. zooming, rotation, rolling, perspective, translation · CPC title

  • Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals (selecting H04Q) · CPC title

  • H04N5/2624Primary

    for obtaining an image which is composed of whole input images, e.g. splitscreen · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10110831B2 cover?
A videoconference device displays video data from a speech site such that a viewer can easily understand even in a case where the number of sites is large. A communication controller receives each piece of video data and voice data from conference terminal devices of a plurality of other sites. A video and voice synthesizer determines a screen layout depending on the number of sites participati…
Who is the assignee on this patent?
Panasonic Ip Man Co Ltd
What technology area does this patent fall under?
Primary CPC classification H04N7/15. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Oct 23 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).