Methods and apparatus to improve user experience on computing devices
US-2021109585-A1 · Apr 15, 2021 · US
US12323558B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12323558-B2 |
| Application number | US-202117213680-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 26, 2021 |
| Priority date | Mar 26, 2021 |
| Publication date | Jun 3, 2025 |
| Grant date | Jun 3, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Techniques for video conferencing are disclosed. In one embodiment, a mirror assembly can be used to position a mirror in front of a camera, reflecting a view of the camera downward towards a whiteboard or other drawing surface, allowing a user to show a drawing during a video conference. In another embodiment, video data and audio data of a user in a video conference can be analyzed to determine what portions of the video data and audio data should be transmitted to a remote participant in the video conference. Unintentional input supplied by the user in the video data and audio data may be used to determine which portions of the video data and audio data to transmit.
Opening claim text (preview).
The invention claimed is: 1. A computing device comprising: input controller circuitry to: receive video data of a user in a video conference; and receive audio data of the user in the video conference; audiovisual analyzer circuitry to perform an analysis of the video data and the audio data, wherein to perform an analysis of the video data and the audio data comprises to perform an analysis of the video data to determine that the user is talking to a person in a background of the video data; and audiovisual framer circuitry to determine which portion of the video data or the audio data to transmit based on the analysis of the video data and the audio data, wherein to determine which portion of the video data or the audio data to transmit comprises to determine not to transmit a portion of the audio data corresponding to the user talking to the person in the background of the video data. 2. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to monitor mannerisms of the user in the video conference, wherein to determine which portion of the video data or the audio data to transmit comprises to determine which portion of the video data or the audio data to transmit based on the mannerisms of the user in the video conference. 3. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to determine that the user is eating, wherein to determine which portion of the video data or the audio data to transmit comprises to determine which portion of the video data or the audio data to transmit based on the determination that the user is eating. 4. The computing device of claim 1 , wherein to perform the analysis of the video data and the audio data comprises to: determine that a first user in the video data is speaking and that a second user in the video data is not speaking; and perform sound localization on the audio data to generate a localized sound corresponding to the first user that is speaking, wherein to determine which portion of the video data or the audio data to transmit comprises to determine that the localized sound corresponding to the first user that is speaking is to be transmitted. 5. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to determine that the user has stood up, wherein to determine which portion of the video data or the audio data to transmit comprises to determine that no video is to be transmitted based on the determination that the user has stood up. 6. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to determine that a person has entered an environment behind the user, wherein to determine which portion of the video data or the audio data to transmit comprises to determine that video showing the person that entered the environment behind the user should not be transmitted. 7. The computing device of claim 1 , wherein to perform an analysis of the video data and the audio data comprises to determine a gaze of the user, wherein to determine which portion of the video data or the audio data to transmit comprises to determine which portion of the video data or the audio data to transmit based on a gaze of the user. 8. A method comprising: receiving, by a computing device, video data of a user in a video conference; receiving, by the computing device, audio data of the user in the video conference; performing, by the computing device, an analysis of the video data and the audio data, wherein performing the analysis of the video data and the audio data comprises performing an analysis of the video data to determine that the user is not facing a camera used to capture the video data; and determining, by the computing device, which portion of the video data or the audio data to transmit based on the analysis of the video data and the audio data, wherein determining which portion of the video data or the audio data to transmit comprises determining not to transmit a portion of the audio data corresponding to the user not facing the camera. 9. The method of claim 8 , wherein performing an analysis of the video data and the audio data comprises monitoring mannerisms of the user in the video conference, wherein determining which portion of the video data or the audio data to transmit comprises determining which portion of the video data or the audio data to transmit based on the mannerisms of the user in the video conference. 10. The method of claim 8 , wherein performing an analysis of the video data and the audio data comprises determining that the user is eating, wherein determining which portion of the video data or the audio data to transmit comprises determining which portion of the video data or the audio data to transmit based on the determination that the user is eating. 11. The method of claim 8 , wherein performing an analysis of the video data and the audio data comprises determining that a first user in the video data is speaking and that a second user in the video data is not speaking, wherein determining which portion of the video data or the audio data to transmit comprises determining that a portion of the video data with the first user that is speaking is to be transmitted. 12. The method of claim 11 , wherein performing the analysis of the video data and the audio data comprises performing sound localization on the audio data to generate a localized sound corresponding to the first user that is speaking, wherein determining which portion of the video data or the audio data to transmit comprises determining that the localized sound corresponding to the first user that is speaking is to be transmitted. 13. The method of claim 8 , wherein performing an analysis of the video data and the audio data comprises determining that the user has stood up, wherein determining which portion of the video data or the audio data to transmit comprises determining that no video is to be transmitted based on the determination that the user has stood up. 14. One or more non-transitory computer-readable media comprising a plurality of instructions stored thereon that, when executed, causes a computing device to: receive video data of a user in a video conference; receive audio data of the user in the video conference; perform an analysis of the video data and the audio data, wherein to perform the analysis of the video data and the audio data comprises to perform an analysis of the video data to determine that the user is not facing a camera used to capture the video data; and determine which portion of the video data or the audio data to transmit based on the analysis of the video data and the audio data, wherein to determine which portion of the video data or the audio data to transmit comprises to determine not to transmit a portion of the audio data corresponding to the user not facing the camera. 15. The one or more non-transitory computer-readable media of claim 14 , wherein to perform an analysis of the video data and the audio data comprises to monitor mannerisms of the user in the video conference, wherein to determine which portion of the video data or the audio data to transmit comprises to determine which portion of the video data or the audio data to transmit based on the mannerisms of the user in the video conference. 16. The one or more non-transitory computer-readable media of claim 14 , wherein to perform an analysis of the video data and the audio data comprises to determine that the user is
Static body considered as a whole, e.g. static pedestrian or occupant recognition · CPC title
involving storage of or access to video conference sessions (tracking arrangements for later retrieval of a computer conference content or participants activities H04L12/1831) · CPC title
Eye tracking input arrangements (G06F3/015 takes precedence) · CPC title
Constructional details of the terminal equipment, e.g. arrangements of the camera and the display · CPC title
Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.