Word-based representation of communication session quality

US11870835B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11870835-B2
Application numberUS-202117182512-A
CountryUS
Kind codeB2
Filing dateFeb 23, 2021
Priority dateFeb 23, 2021
Publication dateJan 9, 2024
Grant dateJan 9, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The technology disclosed herein enables user notification of word inconsistencies to indicate session quality. In a particular embodiment, a method includes, during a communication session between a first endpoint operated by a first participant and a second endpoint operated by a second participant, determining a first number of words spoken by the first participant during a period of time based on sound captured by the first endpoint for inclusion on the communication session. The method also includes determining a second number of the words spoken by the first participant during the period of time based on audio received at the second endpoint via the communication session. Upon determining that an inconsistency exists between the first number and the second number, the method includes notifying at least one of the first participant and the second participant about the inconsistency.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: during a communication session between a first endpoint operated by a first participant and a second endpoint operated by a second participant: determining a first number of words spoken by the first participant during a period of time based on sound captured by the first endpoint for inclusion on the communication session; determining a second number of the words spoken by the first participant during the period of time based on audio received at the second endpoint via the communication session; and upon determining that an inconsistency exists between the first number and the second number, notifying at least one of the first participant and the second participant about the inconsistency. 2. The method of claim 1 , comprising: presenting information about the first number and the second number to the first participant. 3. The method of claim 1 , comprising: receiving a message indicating the second number from the second endpoint; and after receiving the message, comparing the second number to the first number to determine the inconsistency. 4. The method of claim 1 , wherein the inconsistency comprises a difference between the first number and the second number being greater than a threshold number of words. 5. The method of claim 1 , comprising: generating a text transcript of the words spoken by the first participant; and transferring the text transcript to the second endpoint for presentation to the second participant. 6. The method of claim 5 , comprising: receiving a second text transcript of the words spoken by the first participant based on the audio received at the second endpoint; and presenting the first participant with an indication of words missing from the second text transcript relative to the text transcript. 7. The method of claim 1 , wherein notifying at least one of the first participant and the second participant about the inconsistency comprises: presenting, via the second endpoint, an alert to the second participant indicating that fewer than all of the words spoken by the first participant are being reproduced for the second participant. 8. The method of claim 1 , comprising: presenting, via the second endpoint, a metric relating the second number to the first number. 9. The method of claim 1 , comprising: determining a third number of the words spoken by the first participant during the period of time based on audio received, via the communication session, at a server facilitating the communication session; and in response to determining that a second inconsistency exists between the first number and the third number, notifying the first participant that an issue exists between the first endpoint and the server. 10. The method of claim 1 , comprising: monitoring for packet loss in the audio at the second endpoint; and determining that the packet loss satisfies a packet loss threshold, wherein determining the first number and determining the second number occurs in response to determining that the packet loss satisfies the packet loss threshold. 11. An apparatus comprising: one or more computer readable storage media; a processing system operatively coupled with the one or more computer readable storage media; and program instructions stored on the one or more computer readable storage media that, when read and executed by the processing system, direct the processing system to: during a communication session between a first endpoint operated by a first participant and a second endpoint operated by a second participant: determine a first number of words spoken by the first participant during a period of time based on sound captured by the first endpoint for inclusion on the communication session; determine a second number of the words spoken by the first participant during the period of time based on audio received at the second endpoint via the communication session; and upon determining that an inconsistency exists between the first number and the second number, notify at least one of the first participant and the second participant about the inconsistency. 12. The apparatus of claim 11 , wherein the program instructions direct the processing system to: present information about the first number and the second number to the first participant. 13. The apparatus of claim 11 , wherein the program instructions direct the processing system to: receive a message indicating the second number from the second endpoint; and after receiving the message, compare the second number to the first number to determine the inconsistency. 14. The apparatus of claim 11 , wherein the inconsistency comprises a difference between the first number and the second number being greater than a threshold number of words. 15. The apparatus of claim 11 , wherein the program instructions direct the processing system to: generate a text transcript of the words spoken by the first participant; and transfer the text transcript to the second endpoint for presentation to the second participant. 16. The apparatus of claim 15 , wherein the program instructions direct the processing system to: receive a second text transcript of the words spoken by the first participant based on the audio received at the second endpoint; and present the first participant with an indication of words missing from the second text transcript relative to the text transcript. 17. The apparatus of claim 11 , wherein to notify at least one of the first participant and the second participant about the inconsistency, the program instructions direct the processing system to: present, via the second endpoint, an alert to the second participant indicating that fewer than all of the words spoken by the first participant are being reproduced for the second participant. 18. The apparatus of claim 11 , wherein the program instructions direct the processing system to: determine a third number of the words spoken by the first participant during the period of time based on audio received, via the communication session, at a server facilitating the communication session; and in response to determining that a second inconsistency exists between the first number and the third number, notify the first participant that an issue exists between the first endpoint and the server. 19. The apparatus of claim 11 , wherein the program instructions direct the processing system to: monitor for packet loss in the audio at the second endpoint; and determine that the packet loss satisfies a packet loss threshold, wherein the first number and the second number are determined in response to determining that the packet loss satisfies the packet loss threshold. 20. One or more non-transitory computer readable storage media having program instructions stored thereon that, when read and executed by a processing system, direct the processing system to: during a communication session between a first endpoint operated by a first participant and a second endpoint operated by a second participant: determine a first number of words spoken by the first participant during a period of time based on sound captured by the first endpoint for inclusion on the communication session; determine a second number of the words spoken by the first participant during the period of time based on audio received at the second endpoint via the communication session; and upon determining that an inconsistency exists between the first number and the second number, notify at least one of the first particip

Assignees

Inventors

Classifications

  • H04L65/80Primary

    Responding to QoS · CPC title

  • Speech classification or search · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

  • Packet loss · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11870835B2 cover?
The technology disclosed herein enables user notification of word inconsistencies to indicate session quality. In a particular embodiment, a method includes, during a communication session between a first endpoint operated by a first participant and a second endpoint operated by a second participant, determining a first number of words spoken by the first participant during a period of time bas…
Who is the assignee on this patent?
Avaya Man Lp
What technology area does this patent fall under?
Primary CPC classification H04L65/80. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jan 09 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).