Assistance during audio and video calls

US11765113B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11765113-B2
Application numberUS-202217991300-A
CountryUS
Kind codeB2
Filing dateNov 21, 2022
Priority dateJul 30, 2017
Publication dateSep 19, 2023
Grant dateSep 19, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Implementations relate to providing information items for display during a communication session. In some implementations, a computer-implemented method includes receiving, during a communication session between a first computing device and a second computing device, first media content from the communication session. The method further includes determining a first information item for display in the communication session based at least in part on the first media content. The method further includes sending a first command to at least one of the first computing device and the second computing device to display the first information item.

First claim

Opening claim text (preview).

The invention claimed is: 1 . A computer-implemented method comprising: determining a user preference associated with a user for a particular virtual assistant from a set of assistants; receiving, during a video communication session between a first computing device associated with the user and a second computing device, first session content from the video communication session; detecting that the first session content includes a request for media; selecting the particular virtual assistant based on the user preference; and sending, by the particular virtual assistant, a first command to at least one of the first computing device or the second computing device to display the media. 2 . The method of claim 1 , further comprising: requesting, by the particular virtual assistant, output from a second virtual assistant of the set of assistants, wherein the second virtual assistant provides a different service than the particular virtual assistant is operable to provide. 3 . The method of claim 2 , wherein the different service is translation of text-to-speech to provide speech output in a target language that is understood by the user. 4 . The method of claim 1 , wherein determining the user preference associated with the user is based on the user explicitly providing the user preference for the particular virtual assistant. 5 . The method of claim 1 , wherein determining the user preference associated with the user is based on at least one selected from the group of user feedback, the user performing an action based on the particular virtual assistant sending the first command, the user choosing an option from the particular virtual assistant that is not offered by other virtual assistants in the set of assistants, the user providing an indication of user satisfaction, and combinations thereof. 6 . The method of claim 1 , wherein the video communication session includes video that includes a face, and wherein the first command causes display of the media such that the face is not obscured. 7 . The method of claim 1 , wherein detecting that the first session content includes the request for media is based on determining from conversation context of the video communication session that an implicit invocation of the particular virtual assistant is associated with a confidence score that exceeds a score threshold. 8 . A computing device comprising: a processor; and a memory coupled to the processor, with instructions stored thereon that, when executed by the processor, cause the processor to perform operations comprising: determining a user preference associated with a user for a particular virtual assistant from a set of assistants; receiving, during a video communication session between a first computing device associated with the user and a second computing device, first session content from the video communication session; detecting that the first session content includes a request for media; selecting the particular virtual assistant based on the user preference; and sending, by the particular virtual assistant, a first command to at least one of the first computing device or the second computing device to display the media. 9 . The computing device of claim 8 , wherein the operations further comprise: requesting, by the particular virtual assistant, output from a second virtual assistant of the set of assistants, wherein the second virtual assistant provides a different service than the particular virtual assistant is operable to provide. 10 . The computing device of claim 9 , wherein the different service is translation of text-to-speech to provide speech output in a target language that is understood by the user. 11 . The computing device of claim 8 , wherein determining the user preference associated with the user is based on the user explicitly providing the user preference for the particular virtual assistant. 12 . The computing device of claim 8 , wherein determining the user preference associated with the user is based on at least one selected from the group of user feedback, the user performing an action based on the particular virtual assistant sending the first command, the user choosing an option from the particular virtual assistant that is not offered by other virtual assistants in the set of assistants, the user providing an indication of user satisfaction, and combinations thereof. 13 . The computing device of claim 8 , wherein the video communication session includes video that includes a face, and wherein the first command causes display of the media such that the face is not obscured. 14 . The computing device of claim 8 , wherein detecting that the first session content includes the request for media is based on determining from conversation context of the video communication session that an implicit invocation of the particular virtual assistant is associated with a confidence score that exceeds a score threshold. 15 . A non-transitory computer-readable medium with instructions stored thereon that, when executed by one or more computers, cause the one or more computers to perform operations, the operations comprising: determining a user preference associated with a user for a particular virtual assistant from a set of assistants; receiving, during a video communication session between a first computing device associated with the user and a second computing device, first session content from the video communication session; detecting that the first session content includes a request for media; selecting the particular virtual assistant based on the user preference; and sending, by the particular virtual assistant, a first command to at least one of the first computing device or the second computing device to display the media. 16 . The non-transitory computer-readable medium of claim 15 , wherein the operations further comprise: requesting, by the particular virtual assistant, output from a second virtual assistant of the set of assistants, wherein the second virtual assistant provides a different service than the particular virtual assistant is operable to provide. 17 . The non-transitory computer-readable medium of claim 16 , wherein the different service is translation of text-to-speech to provide speech output in a target language that is understood by the user. 18 . The non-transitory computer-readable medium of claim 15 , wherein determining the user preference associated with the user is based on the user explicitly providing the user preference for the particular virtual assistant. 19 . The non-transitory computer-readable medium of claim 15 , wherein determining the user preference associated with the user is based on at least one selected from the group of user feedback, the user performing an action based on the particular virtual assistant sending the first command, the user choosing an option from the particular virtual assistant that is not offered by other virtual assistants in the set of assistants, the user providing an indication of user satisfaction, and combinations thereof. 20 . The non-transitory computer-readable medium of claim 15 , wherein the video communication session includes video that includes a face, and wherein the first command causes display of the media such that the face is not obscured.

Assignees

Inventors

Classifications

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • H04L51/10Primary

    Multimedia information · CPC title

  • Language recognition · CPC title

  • for estimating an emotional state · CPC title

  • using artificial neural networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11765113B2 cover?
Implementations relate to providing information items for display during a communication session. In some implementations, a computer-implemented method includes receiving, during a communication session between a first computing device and a second computing device, first media content from the communication session. The method further includes determining a first information item for display …
Who is the assignee on this patent?
Google Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 19 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).