What technology area does this patent fall under?

Primary CPC classification H04N7/15. Mapped technology areas include Electricity.

When was this patent published?

Publication date Tue Oct 23 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Videoconference device

US10110831B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10110831-B2
Application number	US-201715703147-A
Country	US
Kind code	B2
Filing date	Sep 13, 2017
Priority date	Sep 27, 2016
Publication date	Oct 23, 2018
Grant date	Oct 23, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A videoconference device displays video data from a speech site such that a viewer can easily understand even in a case where the number of sites is large. A communication controller receives each piece of video data and voice data from conference terminal devices of a plurality of other sites. A video and voice synthesizer determines a screen layout depending on the number of sites participating in a videoconference, and generates synthesized video data obtained by synthesizing video data of each site according to the screen layout. At this time, the video and voice synthesizer generates the synthesized video data such that display of the video data of each site where a level of voice data is higher than or equal to a threshold is highlighted more than display of video data of the other sites. A video and voice output controller displays the synthesized video data on a screen of a display device.

First claim

Opening claim text (preview).

What is claimed is: 1. A videoconference device which is provided at a host site and is simultaneously connectable to videoconference devices of a plurality of other sites, the videoconference device comprising: a video input unit that acquires video data by capturing video of the host site; a voice input unit that acquires voice data by picking up voices of the host site; a communication controller that receives video data and voice data from the videoconference devices, respectively, of the plurality of other sites; and a display controller that determines a screen layout depending on the number of sites participating in a videoconference, generates synthesized video data by synthesizing the video data of the sites in accordance with the screen layout, displays the synthesized video data on a screen, starts a main timer, wherein the videoconference device further includes a level detector that detects a level of the voice data, the display controller determines, for each site, whether a level of the voice data is higher than or equal to a threshold, determines any site having voice data higher than or equal to the threshold as a speech site, and determines any site having voice data not higher than or equal to the threshold as an audience site, wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of a first group of the audience sites in such a manner that displayed speech sites are highlighted more than the first group of audience sites, wherein the display controller further generates, after expiration of the main timer, the synthesized video data so as to display video data of any site determined to be a speech site, and video data of a second group of the audience sites in such a manner that displayed speech sites are highlighted more than the second group of audience sites, and wherein the display controller determines for each speech site, whether a level of the voice data has been lower than the threshold for a predetermined amount of time, and determines any speech site having voice data lower than the threshold for the predetermined amount of time as an audience site that is no longer to be highlighted. 2. The videoconference device of claim 1 , wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of the first group of the audience sites in such a manner that a display area of displayed speech sites is larger than a display area of the first group of audience sites. 3. The videoconference device of claim 1 , wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of the first group of the audience sites in such a manner that a display method of displayed speech sites differs from a display method of the first group of audience sites. 4. The videoconference device of claim 2 , wherein the display controller generates the synthesized video data so as to display video data of any site determined to be a speech site, and video data of the first group of the audience sites in such a manner that a display position of displayed speech sites is changed. 5. A videoconference device which is provided at a host site and is simultaneously connectable to videoconference devices of a plurality of other sites, the videoconference device comprising: a video input unit that acquires video data by capturing video of the host site; a voice input unit that acquires voice data by picking up voices of the host site; a communication controller that receives each piece of the video data and the voice data from conference terminal devices of the plurality of other sites; and a display controller that determines a screen layout depending on the number of sites participating in a videoconference, generates synthesized video data by synthesizing the video data of the sites in accordance with the screen layout, displays the synthesized video data on a screen, starts a main timer, wherein the videoconference device further includes a level detector that detects a level of the voice data, wherein the display controller generates the synthesized video data so as to, in a case where a speech site, having a participant who mainly speaks, is displayed and where an audience site, having a participant who only listens without basically speaking, is displayed are determined in advance, display video data of the speech site and the audience site in such a manner that a display area of the speech site is larger than a display area of the audience site, wherein the audience site has voice data higher than or equal to a threshold, wherein the display controller generates the synthesized video data so as to display video data of the audience site in such a manner that the display area of the audience site is larger than display areas of a first group of other audience sites, of the plurality of other sites having voice data lower than the threshold, and is smaller than the display area of the speech site, wherein the display controller generates the synthesized video data so as to, after expiration of the main timer, display video data of the audience site in such a manner that the display area of the audience site is larger than display areas of a second group of other audience sites, of the plurality of other sites having voice data lower than the threshold, and is smaller than the display area of the speech site, and wherein the display controller determines for the speech site, whether the voice data has been lower than the threshold for a predetermined amount of time, and if the speech site has been lower than the threshold for the predetermined amount of time, determines the speech site as a new audience site that is no longer to have a display area that is larger than the display area of the audience site. 6. The videoconference device of claim 1 , wherein the display controller still further generates, prior to expiration of the main timer and if one of the first group of audience sites voice data is higher than the threshold, the synthesized video data so as to display video data of the speech sites in such a manner that the displayed speech sites are modified to include the one of the first group of audience sites having the level of the voice data higher than the threshold. 7. The videoconference device of claim 6 , wherein the display controller still further generates, the synthesized video data such that display of video data of the audience sites is modified to include a third group of audience sites not including the one of the first group of audience sites having voice data higher than the threshold. 8. The videoconference device of claim 7 , wherein the display controller additionally starts an individual timer based on the one of the first group of audience sites having voice data higher than the threshold, and wherein the display controller further generates, after expiration of the individual timer and if the one of the first group of audience sites has had a level of the voice data lower than the threshold for the duration of the individual timer, the synthesized video data such that display of video data of the speech sites is further modified to not include the one of the first group of audience sites having the voice data higher than the threshold. 9. A method of providing a videoconference at a host site that is simultaneously connectable to videoconference devices of a plurality of other sites, the method comprising: acquiring, via a video input unit, video data by capturing video of the host site; acquiring, via a voice input unit,

Assignees

Panasonic Ip Man Co Ltd

Inventors

Classifications

H04N7/15Primary
Conference systems · CPC title
H04N5/2628
Alteration of picture size, shape, position or orientation, e.g. zooming, rotation, rolling, perspective, translation · CPC title
H04N7/147
Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals (selecting H04Q) · CPC title
H04N5/2624Primary
for obtaining an image which is composed of whole input images, e.g. splitscreen · CPC title

Patent family

Related publications grouped by family.

View patent family 61686909

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10110831B2 cover?: A videoconference device displays video data from a speech site such that a viewer can easily understand even in a case where the number of sites is large. A communication controller receives each piece of video data and voice data from conference terminal devices of a plurality of other sites. A video and voice synthesizer determines a screen layout depending on the number of sites participati…
Who is the assignee on this patent?: Panasonic Ip Man Co Ltd
What technology area does this patent fall under?: Primary CPC classification H04N7/15. Mapped technology areas include Electricity.
When was this patent published?: Publication date Tue Oct 23 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Transmission terminal, transmission system, display method and program

Method and system for new layout experience in video communication

Video Conference Apparatus, Method, and Storage Medium

Computer system employing speech recognition for detection of non-speech audio

Frequently asked questions