Generating summary text compositions
US-2022084524-A1 · Mar 17, 2022 · US
US11870835B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11870835-B2 |
| Application number | US-202117182512-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 23, 2021 |
| Priority date | Feb 23, 2021 |
| Publication date | Jan 9, 2024 |
| Grant date | Jan 9, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The technology disclosed herein enables user notification of word inconsistencies to indicate session quality. In a particular embodiment, a method includes, during a communication session between a first endpoint operated by a first participant and a second endpoint operated by a second participant, determining a first number of words spoken by the first participant during a period of time based on sound captured by the first endpoint for inclusion on the communication session. The method also includes determining a second number of the words spoken by the first participant during the period of time based on audio received at the second endpoint via the communication session. Upon determining that an inconsistency exists between the first number and the second number, the method includes notifying at least one of the first participant and the second participant about the inconsistency.
Opening claim text (preview).
What is claimed is: 1. A method comprising: during a communication session between a first endpoint operated by a first participant and a second endpoint operated by a second participant: determining a first number of words spoken by the first participant during a period of time based on sound captured by the first endpoint for inclusion on the communication session; determining a second number of the words spoken by the first participant during the period of time based on audio received at the second endpoint via the communication session; and upon determining that an inconsistency exists between the first number and the second number, notifying at least one of the first participant and the second participant about the inconsistency. 2. The method of claim 1 , comprising: presenting information about the first number and the second number to the first participant. 3. The method of claim 1 , comprising: receiving a message indicating the second number from the second endpoint; and after receiving the message, comparing the second number to the first number to determine the inconsistency. 4. The method of claim 1 , wherein the inconsistency comprises a difference between the first number and the second number being greater than a threshold number of words. 5. The method of claim 1 , comprising: generating a text transcript of the words spoken by the first participant; and transferring the text transcript to the second endpoint for presentation to the second participant. 6. The method of claim 5 , comprising: receiving a second text transcript of the words spoken by the first participant based on the audio received at the second endpoint; and presenting the first participant with an indication of words missing from the second text transcript relative to the text transcript. 7. The method of claim 1 , wherein notifying at least one of the first participant and the second participant about the inconsistency comprises: presenting, via the second endpoint, an alert to the second participant indicating that fewer than all of the words spoken by the first participant are being reproduced for the second participant. 8. The method of claim 1 , comprising: presenting, via the second endpoint, a metric relating the second number to the first number. 9. The method of claim 1 , comprising: determining a third number of the words spoken by the first participant during the period of time based on audio received, via the communication session, at a server facilitating the communication session; and in response to determining that a second inconsistency exists between the first number and the third number, notifying the first participant that an issue exists between the first endpoint and the server. 10. The method of claim 1 , comprising: monitoring for packet loss in the audio at the second endpoint; and determining that the packet loss satisfies a packet loss threshold, wherein determining the first number and determining the second number occurs in response to determining that the packet loss satisfies the packet loss threshold. 11. An apparatus comprising: one or more computer readable storage media; a processing system operatively coupled with the one or more computer readable storage media; and program instructions stored on the one or more computer readable storage media that, when read and executed by the processing system, direct the processing system to: during a communication session between a first endpoint operated by a first participant and a second endpoint operated by a second participant: determine a first number of words spoken by the first participant during a period of time based on sound captured by the first endpoint for inclusion on the communication session; determine a second number of the words spoken by the first participant during the period of time based on audio received at the second endpoint via the communication session; and upon determining that an inconsistency exists between the first number and the second number, notify at least one of the first participant and the second participant about the inconsistency. 12. The apparatus of claim 11 , wherein the program instructions direct the processing system to: present information about the first number and the second number to the first participant. 13. The apparatus of claim 11 , wherein the program instructions direct the processing system to: receive a message indicating the second number from the second endpoint; and after receiving the message, compare the second number to the first number to determine the inconsistency. 14. The apparatus of claim 11 , wherein the inconsistency comprises a difference between the first number and the second number being greater than a threshold number of words. 15. The apparatus of claim 11 , wherein the program instructions direct the processing system to: generate a text transcript of the words spoken by the first participant; and transfer the text transcript to the second endpoint for presentation to the second participant. 16. The apparatus of claim 15 , wherein the program instructions direct the processing system to: receive a second text transcript of the words spoken by the first participant based on the audio received at the second endpoint; and present the first participant with an indication of words missing from the second text transcript relative to the text transcript. 17. The apparatus of claim 11 , wherein to notify at least one of the first participant and the second participant about the inconsistency, the program instructions direct the processing system to: present, via the second endpoint, an alert to the second participant indicating that fewer than all of the words spoken by the first participant are being reproduced for the second participant. 18. The apparatus of claim 11 , wherein the program instructions direct the processing system to: determine a third number of the words spoken by the first participant during the period of time based on audio received, via the communication session, at a server facilitating the communication session; and in response to determining that a second inconsistency exists between the first number and the third number, notify the first participant that an issue exists between the first endpoint and the server. 19. The apparatus of claim 11 , wherein the program instructions direct the processing system to: monitor for packet loss in the audio at the second endpoint; and determine that the packet loss satisfies a packet loss threshold, wherein the first number and the second number are determined in response to determining that the packet loss satisfies the packet loss threshold. 20. One or more non-transitory computer readable storage media having program instructions stored thereon that, when read and executed by a processing system, direct the processing system to: during a communication session between a first endpoint operated by a first participant and a second endpoint operated by a second participant: determine a first number of words spoken by the first participant during a period of time based on sound captured by the first endpoint for inclusion on the communication session; determine a second number of the words spoken by the first participant during the period of time based on audio received at the second endpoint via the communication session; and upon determining that an inconsistency exists between the first number and the second number, notify at least one of the first particip
Responding to QoS · CPC title
Speech classification or search · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title
Packet loss · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.