What technology area does this patent fall under?

Primary CPC classification H04M3/568. Mapped technology areas include Electricity.

When was this patent published?

Publication date Tue Jun 03 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Technologies for video conferencing

US12323558B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12323558-B2
Application number	US-202117213680-A
Country	US
Kind code	B2
Filing date	Mar 26, 2021
Priority date	Mar 26, 2021
Publication date	Jun 3, 2025
Grant date	Jun 3, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for video conferencing are disclosed. In one embodiment, a mirror assembly can be used to position a mirror in front of a camera, reflecting a view of the camera downward towards a whiteboard or other drawing surface, allowing a user to show a drawing during a video conference. In another embodiment, video data and audio data of a user in a video conference can be analyzed to determine what portions of the video data and audio data should be transmitted to a remote participant in the video conference. Unintentional input supplied by the user in the video data and audio data may be used to determine which portions of the video data and audio data to transmit.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computing device comprising: input controller circuitry to: receive video data of a user in a video conference; and receive audio data of the user in the video conference; audiovisual analyzer circuitry to perform an analysis of the video data and the audio data, wherein to perform an analysis of the video data and the audio data comprises to perform an analysis of the video data to determine that the user is talking to a person in a background of the video data; and audiovisual framer circuitry to determine which portion of the video data or the audio data to transmit based on the analysis of the video data and the audio data, wherein to determine which portion of the video data or the audio data to transmit comprises to determine not to transmit a portion of the audio data corresponding to the user talking to the person in the background of the video data. 2. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to monitor mannerisms of the user in the video conference, wherein to determine which portion of the video data or the audio data to transmit comprises to determine which portion of the video data or the audio data to transmit based on the mannerisms of the user in the video conference. 3. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to determine that the user is eating, wherein to determine which portion of the video data or the audio data to transmit comprises to determine which portion of the video data or the audio data to transmit based on the determination that the user is eating. 4. The computing device of claim 1 , wherein to perform the analysis of the video data and the audio data comprises to: determine that a first user in the video data is speaking and that a second user in the video data is not speaking; and perform sound localization on the audio data to generate a localized sound corresponding to the first user that is speaking, wherein to determine which portion of the video data or the audio data to transmit comprises to determine that the localized sound corresponding to the first user that is speaking is to be transmitted. 5. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to determine that the user has stood up, wherein to determine which portion of the video data or the audio data to transmit comprises to determine that no video is to be transmitted based on the determination that the user has stood up. 6. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to determine that a person has entered an environment behind the user, wherein to determine which portion of the video data or the audio data to transmit comprises to determine that video showing the person that entered the environment behind the user should not be transmitted. 7. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to determine a gaze of the user, wherein to determine which portion of the video data or the audio data to transmit comprises to determine which portion of the video data or the audio data to transmit based on a gaze of the user. 8. A method comprising: receiving, by a computing device, video data of a user in a video conference; receiving, by the computing device, audio data of the user in the video conference; performing, by the computing device, an analysis of the video data and the audio data, wherein performing the analysis of the video data and the audio data comprises performing an analysis of the video data to determine that the user is not facing a camera used to capture the video data; and determining, by the computing device, which portion of the video data or the audio data to transmit based on the analysis of the video data and the audio data, wherein determining which portion of the video data or the audio data to transmit comprises determining not to transmit a portion of the audio data corresponding to the user not facing the camera. 9. The method of claim 8 , wherein performing an analysis of the video data and the audio data comprises monitoring mannerisms of the user in the video conference, wherein determining which portion of the video data or the audio data to transmit comprises determining which portion of the video data or the audio data to transmit based on the mannerisms of the user in the video conference. 10. The method of claim 8 , wherein performing an analysis of the video data and the audio data comprises determining that the user is eating, wherein determining which portion of the video data or the audio data to transmit comprises determining which portion of the video data or the audio data to transmit based on the determination that the user is eating. 11. The method of claim 8 , wherein performing an analysis of the video data and the audio data comprises determining that a first user in the video data is speaking and that a second user in the video data is not speaking, wherein determining which portion of the video data or the audio data to transmit comprises determining that a portion of the video data with the first user that is speaking is to be transmitted. 12. The method of claim 11 , wherein performing the analysis of the video data and the audio data comprises performing sound localization on the audio data to generate a localized sound corresponding to the first user that is speaking, wherein determining which portion of the video data or the audio data to transmit comprises determining that the localized sound corresponding to the first user that is speaking is to be transmitted. 13. The method of claim 8 , wherein performing an analysis of the video data and the audio data comprises determining that the user has stood up, wherein determining which portion of the video data or the audio data to transmit comprises determining that no video is to be transmitted based on the determination that the user has stood up. 14. One or more non-transitory computer-readable media comprising a plurality of instructions stored thereon that, when executed, causes a computing device to: receive video data of a user in a video conference; receive audio data of the user in the video conference; perform an analysis of the video data and the audio data, wherein to perform the analysis of the video data and the audio data comprises to perform an analysis of the video data to determine that the user is not facing a camera used to capture the video data; and determine which portion of the video data or the audio data to transmit based on the analysis of the video data and the audio data, wherein to determine which portion of the video data or the audio data to transmit comprises to determine not to transmit a portion of the audio data corresponding to the user not facing the camera. 15. The one or more non-transitory computer-readable media of claim 14 , wherein to perform an analysis of the video data and the audio data comprises to monitor mannerisms of the user in the video conference, wherein to determine which portion of the video data or the audio data to transmit comprises to determine which portion of the video data or the audio data to transmit based on the mannerisms of the user in the video conference. 16. The one or more non-transitory computer-readable media of claim 14 , wherein to perform an analysis of the video data and the audio data comprises to determine that the user is

Assignees

Intel Corp

Inventors

Classifications

G06V40/103
Static body considered as a whole, e.g. static pedestrian or occupant recognition · CPC title
H04N7/155
involving storage of or access to video conference sessions (tracking arrangements for later retrieval of a computer conference content or participants activities H04L12/1831) · CPC title
G06F3/013
Eye tracking input arrangements (G06F3/015 takes precedence) · CPC title
H04N7/142
Constructional details of the terminal equipment, e.g. arrangements of the camera and the display · CPC title
G06V20/49
Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes · CPC title

Patent family

Related publications grouped by family.

View patent family 76764292

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12323558B2 cover?: Techniques for video conferencing are disclosed. In one embodiment, a mirror assembly can be used to position a mirror in front of a camera, reflecting a view of the camera downward towards a whiteboard or other drawing surface, allowing a user to show a drawing during a video conference. In another embodiment, video data and audio data of a user in a video conference can be analyzed to determi…
Who is the assignee on this patent?: Intel Corp
What technology area does this patent fall under?: Primary CPC classification H04M3/568. Mapped technology areas include Electricity.
When was this patent published?: Publication date Tue Jun 03 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).