Methods for the encoding of participants in a conference

US9936077B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9936077-B2
Application numberUS-201414451453-A
CountryUS
Kind codeB2
Filing dateAug 5, 2014
Priority dateAug 5, 2013
Publication dateApr 3, 2018
Grant dateApr 3, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method are presented for the encoding of participants in a conference setting. In an embodiment, audio from conference participants in a voice-over-IP setting may be received and processed by the system. In an embodiment, audio may be received in a compressed form and de-compressed for processing. For each participant, return audio is generated, compressed (if applicable) and transmitted to the participant. The system may recognize when participants are using the same audio encoding format and are thus receiving audio that may be similar or identical. The audio may only be encoded once instead of for each participant. Thus, redundant encodings are recognized and eliminated resulting in less CPU usage.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for reducing computational complexity of participants in a voice-over-IP conference communication setting through recognition and elimination of redundant operations, wherein said method comprises steps of: a. receiving audio for participants to the conference communication setting by a means for signal processing; b. determining a state for each participant, by the means for signal processing wherein each participant is assigned an active state or a non-active state; c. processing, by a conference system, said audio to generate return audio for each participant, wherein said processing comprises: i. recognizing the non-active state participants using similar audio encoding formats and which are receiving similar audio; and ii. producing a single encoding of the similar audio, whereby the redundant similar audio has been removed, for the non-active participants having similar attributes; and d. transmitting, by the conference system, said return audio to said participants, wherein said participants having similar attributes receive the return audio with eliminated redundant operations. 2. The method of claim 1 , wherein the attributes comprise one or more of: compression format, packetization size, gain setting, IVR audio, and Auxiliary audio. 3. The method of claim 1 , wherein the encoding of said similar audio for participants having similar attributes comprises: a. encoding a bit pattern identifying attributes of a participant; and b. combining said bit patterns for all attributes to form a signature. 4. The method of claim 3 , wherein the signature is formed as a 64-bit unsigned integer with a bit range reserved for an attribute. 5. The method of claim 4 , wherein the bit ranges comprise: 0-15, 16-23, 24-27, 28-35, 36-51, and 52-64. 6. The method of claim 1 , wherein step (c.i.) is performed by a means for conference mixing, further comprising steps of: a. reading a segment of audio equal to a time interval for each participant; b. determining which participants contribute to the audio for said time interval and assigning to at least two sets the participants; c. assigning a unique encoder instance to each set of participants; d. preparing the audio and writing the audio to a unique encoder instance, wherein i. the audio is analyzed to determine which inputs contribute to the audio; ii. an output participant list is formed from the analysis; iii. an encoder is assigned to each participant output; and iv. the required number of unique mixes are produced by a means for mixing; and e. writing said encoded audio to each participant associated with an encoder instance. 7. The method of claim 6 , wherein the time interval is 10milliseconds. 8. The method of claim 6 , wherein the time interval is 20milliseconds. 9. The method of claim 6 , wherein the determining of step (b) is made based on at least one of: the audio segment, the current state of the participant, the prior state of the participant, the prior state of the participant as to the speaking state of an other participant, and conference settings. 10. The method of claim 1 , wherein the active participants are speaking. 11. The method of claim 6 , wherein the encoder instance is assigned by designated criteria. 12. The method of claim 11 , wherein the designated criteria comprise: a. determining if a participant is contributing for the time interval and, i. if the participant is contributing for said time interval and has an assigned encoder instance shared with at least one other participant, the participant is assigned a new unique encoder instance with the state copied from the previous encoder instance to maintain audio continuity; and ii. if the participant is not contributing for said time interval, grouping the participant with any other participants based on attribute signature, and assigning the participants in the group a matching encoder. 13. The method of claim 12 , wherein the number of encoder instances is equal to the number of participants. 14. The method of claim 1 , wherein a participant is determined to be contributing by an alert sent by a signal processor when a participant begins producing desired sounds. 15. The method of claim 14 , wherein said desired sounds comprise speech. 16. The method of claim 1 , wherein there is a configured maximum number of conference participants who contribute and said configuration is capable of being exceeded temporarily. 17. The method of claim 1 , wherein said similar audio comprises identical encoding formats. 18. The method of claim 1 , wherein said conference communication setting comprises a video conference. 19. The method of claim 1 , wherein said operations comprise encodings. 20. A method for reducing computational complexity of participants in a voice-over-IP conference call setting through recognition and elimination of redundant encodings, wherein said method comprises steps of: a. receiving an alert from a signal processor when a participant begins producing sound; b. adding said participant, by the signal processor, to a listing of active participants, wherein said active participants are producing sound; c. determining, by the signal processor, if the sound from said participant is speech, wherein the participant is removed from the listing if said sound is determined not to be speech; d. recognizing, by a conference system, if participants who are determined to not be active are using a similar audio encoding format and are receiving similar audio; e. producing, by the conference system, a single encoding of the similar audio, whereby the redundant similar audio has been removed, for the participants having similar attributes who are determined to not be active; and f. preparing and writing, by the conference system, said encoded audio to each participant associated with an encoder instance. 21. The method of claim 20 , wherein step (d) comprises steps of: a. identifying attributes of a participant and encoding a bit pattern identifying a value assigned to said attributes; b. combining said bit patterns for attributes to form a signature with bit ranges reserved for each attribute, wherein participants are using a similar audio encoding format and receiving similar audio if they have the same signature and they are not in the listing of active participants. 22. The method of claim 20 , wherein step (f) further comprises steps of: a. preparing an audio buffer containing a sum of the audio input by active participants; and b. preparing a buffer containing audio from the buffer prepared in step (a) less the audio from the not active participants. 23. The method of claim 20 , wherein encoding said similar audio includes assigning an encoder instance using designated criteria. 24. The method of claim 23 , wherein said designated criteria comprise: a. assigning a unique encoder instance with a state copied from a previous encoder instance to maintain audio continuity to an active participant; and b. assigning a shared encoder instance to a grouping of non-active participants by attribute signature. 25. The method of claim 21 , wherein the attributes comprise one or more of: compression format, packetization size, gain setting, IVR audio, and Auxiliary audio. 26. The method of claim 21 , wherein the signature is formed as a 64-bit unsigned integer with a bit range reserved for an attribute.

Assignees

Inventors

Classifications

  • Arrangements for multi-party communication, e.g. for conferences (data switching systems for conference H04L12/18; arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities H04M3/56; television conferencing systems H04N7/15) · CPC title

  • Networks other than PSTN/ISDN providing telephone service, e.g. Voice over Internet Protocol (VoIP) , including next generation networks with a packet-switched transport layer (H04L65/00 takes precedence; aspects not specific to the type of network H04M3/00; special services in those networks H04M3/42) · CPC title

  • Speech codec negotiation (H04L65/1069 takes precedence) · CPC title

  • Services specially adapted for wireless communication networks; Facilities therefor · CPC title

  • H04M3/568Primary

    audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants (echo suppression in two-way loud-speaking telephone systems H04M9/02; sound field processing per se H04S7/30) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9936077B2 cover?
A system and method are presented for the encoding of participants in a conference setting. In an embodiment, audio from conference participants in a voice-over-IP setting may be received and processed by the system. In an embodiment, audio may be received in a compressed form and de-compressed for processing. For each participant, return audio is generated, compressed (if applicable) and trans…
Who is the assignee on this patent?
Interactive Intelligence Inc, Genesys Telecommunications Laboratories Inc
What technology area does this patent fall under?
Primary CPC classification H04M3/568. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Apr 03 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).