Techniques for enhancing user experience in video conferencing

US2017280098A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017280098-A1
Application numberUS-201415504967-A
CountryUS
Kind codeA1
Filing dateSep 26, 2014
Priority dateSep 26, 2014
Publication dateSep 28, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are disclosed for enhancing user experience in video conferencing. In accordance with some embodiments, the graphical user interface (GUI) displayed on a device involved in a video conferencing session may undergo dynamic adjustment of its video composition, for example, to render video content in either a prominent or a thumbnail region of the GUI. Reorganization of the GUI's video composition may be performed, for example: (1) automatically based on detected audio activity levels of the video conferencing participants; and/or (2) upon user instruction. In accordance with some embodiments, individualized volume control over video conferencing participants may be provided. In accordance with some embodiments, the resolution and/or frame rate of video data captured at a source device involved in a video conferencing session may be adaptively varied, for example, during capture and/or processing before encoding based on the detected audio activity level of the user of that source device.

First claim

Opening claim text (preview).

1 . A non-transitory computer program product encoded with instructions that, when executed by one or more processors, causes a process to be carried out, the process comprising: receiving audio data in a video conferencing session; analyzing the audio data to determine an audio activity level of at least one participant of the video conferencing session; and adjusting a video composition of a graphical user interface (GUI) based on the audio activity level of the at least one participant. 2 . The non-transitory computer program product of claim 1 , wherein analyzing the audio data to determine the audio activity level of the at least one participant comprises: sampling the audio data received in the video conferencing session and computing therefrom an audio signature to identify which participant is associated with the audio data; and comparing the audio data against an audio threshold. 3 . The non-transitory computer program product of claim 1 , wherein upon comparing the audio data against the audio threshold, if the audio data exceeds the audio threshold, then adjusting the video composition of the GUI comprises: automatically transitioning presentation of a video stream representative of the participant from a thumbnail region of the GUI to a prominent region of the GUI; automatically transitioning presentation of a video stream representative of the participant from a thumbnail region of the GUI to a prominent region of the GUI and automatically transitioning presentation of a video stream representative of another participant from the prominent region of the GUI to the thumbnail region of the GUI; or maintaining presentation of a video stream representative of the participant within a prominent region of the GUI. 4 . The non-transitory computer program product of claim 1 , wherein upon comparing the audio data against the audio threshold, if the audio data does not exceed the audio threshold, then adjusting the video composition of the GUI comprises: automatically transitioning presentation of a video stream representative of the participant from a prominent region of the GUI to a thumbnail region of the GUI; or maintaining presentation of a video stream representative of the participant within a thumbnail region of the GUI. 5 . The non-transitory computer program product of claim 1 , wherein adjusting the video composition of the GUI comprises at least one of: transitioning presentation of a video stream representative of at least one of a remote participant and an object/scene of interest between a prominent region of the GUI and a thumbnail region of the GUI; adjusting a resolution of a video stream representative of at least one remote participant; and adjusting a frame rate of a video stream representative of at least one remote participant. 6 . The non-transitory computer program product of claim 1 , wherein adjusting the video composition of the GUI is performed automatically based on the audio activity level of a local or remote participant causing the adjusting. 7 . The non-transitory computer program product of claim 1 , wherein adjusting the video composition of the GUI is further based on input received via a touch-sensitive display on which the GUI is presented. 8 . The non-transitory computer program product of claim 1 , wherein at least a portion of the process is carried out via at least one of an IR.94-based implementation and a WebRTC-based implementation. 9 . A non-transitory computer program product encoded with instructions that, when executed by one or more processors, causes a process to be carried out, the process comprising: receiving audio data in a video conferencing session, the audio data including at least one audio stream associated with an individual remote video conferencing participant; and adjusting a volume level of the at least one audio stream associated with the individual remote video conferencing participant. 10 . The non-transitory computer program product of claim 9 , wherein at least a portion of the process is carried out via a WebRTC-based implementation. 11 . The non-transitory computer program product of claim 9 , wherein: prior to adjusting the volume level of the at least one audio stream associated with the individual remote video conferencing participant, the process further comprises splitting the audio data into a plurality of audio streams, the plurality including the at least one audio stream associated with the individual remote video conferencing participant; and after adjusting the volume level of the at least one audio stream associated with the individual remote video conferencing participant, the process further comprises re-synthesizing the plurality of audio streams into a single audio stream. 12 . The non-transitory computer program product of claim 11 , wherein at least a portion of the process is carried out via an IR.94-based implementation. 13 . A non-transitory computer program product encoded with instructions that, when executed by one or more processors, causes a process to be carried out, the process comprising: receiving audio data in a video conferencing session; analyzing the audio data to determine therefrom an audio activity level of a local participant of the video conferencing session; and adjusting at least one of a resolution and a frame rate of video data transmitted in the video conferencing session based on the audio activity level of the local participant. 14 . The non-transitory computer program product of claim 13 , wherein adjusting at least one of the resolution and the frame rate of the video data transmitted in the video conferencing session comprises: adjusting at least one of a capture resolution and a capture frame rate of an image capture device configured to capture the video data before encoding thereof. 15 . The non-transitory computer program product of claim 13 , wherein adjusting at least one of the resolution and the frame rate of the video data transmitted in the video conferencing session comprises: scaling at least one of the resolution and the frame rate of captured video data before encoding thereof. 16 . The non-transitory computer program product of claim 13 , wherein analyzing the audio data to determine therefrom the audio activity level of the local participant comprises: sampling the audio data received in the video conferencing session and computing therefrom an audio signature to identify which participant is associated with the audio data; and comparing the audio data against an audio threshold. 17 . The non-transitory computer program product of claim 16 , wherein upon comparing the audio data against the audio threshold, if the audio data exceeds the audio threshold, then adjusting at least one of the resolution and the frame rate of the video data comprises at least one of: automatically increasing at least one of a capture resolution and a capture frame rate of an image capture device configured to capture the video data before encoding thereof; and automatically upscaling at least one of the resolution and the frame rate of the video data before encoding thereof. 18 . The non-transitory computer program product of claim 16 , wherein upon comparing the audio data against the audio threshold, if the audio data does not exceed the audio threshold, then adjusting at least one of the resolution and the frame rate of the video data comprises at least one of: automatically decreasing at least one of a capture resolution and a capture frame rate of an image capture device configured

Assignees

Inventors

Classifications

  • H04N7/15Primary

    Conference systems · CPC title

  • Network arrangements for conference optimisation or adaptation · CPC title

  • Measuring mean amplitude; Measuring mean power; Measuring time integral of power · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017280098A1 cover?
Techniques are disclosed for enhancing user experience in video conferencing. In accordance with some embodiments, the graphical user interface (GUI) displayed on a device involved in a video conferencing session may undergo dynamic adjustment of its video composition, for example, to render video content in either a prominent or a thumbnail region of the GUI. Reorganization of the GUI's video …
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification H04N7/15. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Sep 28 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).