Computerized intelligent assistant for conferences

US11688399B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11688399-B2
Application numberUS-202017115293-A
CountryUS
Kind codeB2
Filing dateDec 8, 2020
Priority dateMay 4, 2018
Publication dateJun 27, 2023
Grant dateJun 27, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for facilitating a remote conference includes receiving a digital video and a computer-readable audio signal. A face recognition machine is operated to recognize a face of a first conference participant in the digital video, and a speech recognition machine is operated to translate the computer-readable audio signal into a first text. An attribution machine attributes the text to the first conference participant. A second computer-readable audio signal is processed similarly, to obtain a second text attributed to a second conference participant. A transcription machine automatically creates a transcript including the first text attributed to the first conference participant and the second text attributed to the second conference participant.

First claim

Opening claim text (preview).

The invention claimed is: 1. At least one computer-storage device embodying computer-usable instructions which, when executed by at least one processor, implement a method for providing computer assistance to participants of a conference, the method comprising: receiving audio of the participants; identifying the participants; translating the audio into text; generating a transcript of the conference; tracking at least one of an arrival or a departure of a first participant during the conference; generating a tailored transcript for the first participant based at least on the transcript of the conference and further selectively based on either the arrival or the departure of the first participant; and providing the tailored transcript to the first participant via at least one of a display, an audio output, an email, a link, a text message, a chat message, an electronic message, or a notification message. 2. The at least one computer-storage device of claim 1 , the method further comprising attributing portions of the text to one or more participants. 3. The at least one computer-storage device of claim 2 , the transcript generated based at least in part on the attribution of portions of the text to the one or more participants. 4. The at least one computer-storage device of claim 1 , the participants comprising local participants and/or remote participants. 5. The at least one computer-storage device of claim 1 , wherein the participants are identified based at least in part on the audio. 6. The at least one computer-storage device of claim 1 , comprising receiving video of the participants, wherein the participants are identified based at least in part on the audio and/or the video. 7. The at least one computer-storage device of claim 1 , wherein generating the tailored transcript comprises determining a time period during the conference when the first participant was absent, based at least in part on the arrival or the departure of the first participant, the tailored transcript comprising: a filtered portion of the transcript, a summarized portion of the transcript, or a contextualized portion of the transcript; or the transcript comprising an indication of a portion of the transcript that corresponds to the time period during the conference when the first participant was absent. 8. The at least one computer-storage device of claim 6 , wherein tracking at least one of the arrival or the departure of the first participant comprises determining an availability of the first participant to participate in the conference, the availability determined based on at least one of determining whether the first participant is logged in to the conference, determining whether the first participant is detectable in the audio, determining whether the first participant is detectable in the video, determining a physical location of the first participant, or detecting an audiovisual cue indicating that the first participant left the conference. 9. A computerized conference assistant for providing assistance to participants of a conference, the computerized conference assistant comprising: at least one processor; at least one storage device storing computer-usable instructions which, when executed by the at least one processor, implement operations comprising: receive audio of the participants; identify the participants; translate the audio into text; generate a transcript of the conference; track at least one of an arrival or a departure of a first participant during the conference; generate a tailored transcript for the first participant based at least on the transcript of the conference and further selectively based on either the arrival or the departure of the first participant; and providing the tailored transcript to the first participant via at least one of a display, an audio output, an email, a link, a text message, a chat message, an electronic message, or a notification message. 10. The computerized conference assistant of claim 9 , the operations further comprising attributing portions of the text to one or more participants, the transcript generated based at least in part on the attribution of portions of the text to the one or more participants. 11. The computerized conference assistant of claim 9 , wherein the participants are identified based at least in part on the audio. 12. The computerized conference assistant of claim 9 , the operations further comprising: receive video of the participants; and identify the participants based at least in part on the audio and/or the video. 13. The computerized conference assistant of claim 9 , wherein the tailored transcript is modified at least in part based on a time period during the conference when the first participant was absent, the time period based at least in part on the arrival or the departure of the first participant. 14. The computerized conference assistant of claim 13 , the operations further comprising: determining one or more times in the transcript comprising content that is of interest to the first participant; wherein the tailored transcript comprises: a filtered portion of the transcript, a summarized portion of the transcript, or a contextualized portion of the transcript, associated with the determined one or more times in the transcript; or the transcript and an indication of a portion of the transcript that corresponds to the time period during the conference when the first participant was absent. 15. The computerized conference assistant of claim 12 , wherein track at least one of the arrival or the departure of the first participant comprises determine whether the first participant is logged in to the conference, determine whether the first participant is detectable in the audio, determine whether the first participant is detectable in the video, determine a physical location of the first participant, or detect an audiovisual cue indicating that the first participant left the conference. 16. A method for providing computer assistance to participants of a conference, comprising: receiving audio of the participants; identifying the participants; generating a record of the conference; attributing portions of the record to one or more participants; tracking at least one of an arrival or a departure of a first participant during the conference; generating a tailored record of the conference for the first participant based at least on the record of the conference and further selectively based on either the arrival or the departure of the first participant; and providing the tailored record of the conference to the first participant via at least one of a display, an audio output, an email, a link, a text message, a chat message, an electronic message, or a notification message. 17. The method of claim 16 , the tailored record of the conference comprising audio, video, and/or text translated from the audio. 18. The method of claim 16 , wherein the participants are identified based at least in part on the audio and/or a received video of the conference. 19. The method of claim 16 , wherein the tailored record of the conference focuses on a time period during the conference when the first participant was absent, based at least in part on the arrival or the departure of the first participant. 20. The method of claim 19 , wherein the tailored record of the conference comprises a portion of the record of the conference, a summarized portion of the record of the conference, or a contextualized portion of the record of the confere

Assignees

Inventors

Classifications

  • G10L15/26Primary

    Speech to text systems (G10L15/08 takes precedence) · CPC title

  • Speaker identification or verification techniques · CPC title

  • Classification, e.g. identification · CPC title

  • Conference systems · CPC title

  • H04N7/147Primary

    Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals (selecting H04Q) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11688399B2 cover?
A method for facilitating a remote conference includes receiving a digital video and a computer-readable audio signal. A face recognition machine is operated to recognize a face of a first conference participant in the digital video, and a speech recognition machine is operated to translate the computer-readable audio signal into a first text. An attribution machine attributes the text to the f…
Who is the assignee on this patent?
Microsoft Technology Licensing Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/26. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 27 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).