Stream server that modifies a stream according to detected characteristics

US10652396B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10652396-B2
Application numberUS-201816144217-A
CountryUS
Kind codeB2
Filing dateSep 27, 2018
Priority dateSep 27, 2018
Publication dateMay 12, 2020
Grant dateMay 12, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A stream server manages connections by multiple clients. The stream server receives, sends and manages streams for each client. The stream server analyzes incoming streams and maintains a database of device signatures and a database of user profiles based on the analyzed data. The stream server receives a stream from a client, analyzes the stream to identify an applicable device signature, user profile, or both, and modifies the stream. Modification of the stream may include filtering noise, placing the user in receive-only mode, ending a session for a user, adjusting metadata, etc. In addition, modification of the stream may vary based on the role or authorization of the user, or based on commands from one or more of the users.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus comprising: at least one processor; a memory coupled to the at least one processor; a caller profile database residing in the memory that includes a plurality of entries, each of the plurality of entries comprising voice characterization data corresponding to a voice portion of a previously-received audio stream for an identified caller; a device signature database residing in the memory that includes a plurality of entries, each of the plurality of entries comprising a device identifier and corresponding device signature data corresponding to non-voice portions of a previously-received audio stream for an identified device corresponding to the device identifier, wherein the device signature data comprises repetitive sounds, transient sounds, and volume issues; and a voice stream server residing in the memory and executed by the at least one processor, the voice stream server receiving a plurality of audio streams from a plurality of callers, analyzing the plurality of audio streams for voice data and for non-voice data, identifying a first entry in the caller database that corresponds to a first audio stream of the plurality of audio streams, identifying a first entry in the device signature database that corresponds to the first audio stream, and modifying the first audio stream based on the analyzed voice data and the voice characterization data in the first entry in the caller database and based on the analyzed non-voice data and the device signature data in the first entry in the device signature database. 2. The apparatus of claim 1 further wherein the voice stream server comprises a voice to text transcription tool that transcribes the voice data in the first audio stream to text and compares the text to a key word list, and when specified criteria are satisfied with respect to the occurrence of words in the key word list in the text, the voice stream server modifies voice data in the first audio stream. 3. The apparatus of claim 1 wherein the voice stream server modifies the first audio stream by placing a first caller corresponding to the first audio stream in listen-only mode by not forwarding the voice data in the first audio stream received by the voice stream server. 4. The apparatus of claim 3 wherein the voice stream server provides a voice prompt to the first caller that states the first caller is in listen-only mode. 5. The apparatus of claim 1 wherein the voice stream server modifies the first audio stream by ending a session for a first caller corresponding to the first audio stream. 6. The apparatus of claim 5 wherein the voice stream server provides a voice prompt to the first caller before terminating the session for the first caller that states the session for the first caller is being terminated. 7. The apparatus of claim 1 wherein the voice stream server enables and disables modifying the first audio stream based on: caller role; caller authorization; and verbal commands from at least one caller. 8. The apparatus of claim 1 wherein when the identified caller corresponding to one of the plurality of entries in the caller profile database uses a plurality of devices for a plurality of past calls, the device signature database comprises a plurality of entries corresponding to the plurality of devices used by the identified caller. 9. An apparatus comprising: at least one processor; a memory coupled to the at least one processor; a caller profile database residing in the memory that includes a plurality of entries, each of the plurality of entries comprising voice characterization data corresponding to a voice portion of a previously-received audio stream for an identified caller; a device signature database residing in the memory that includes a plurality of entries, each of the plurality of entries comprising a device identifier and corresponding device signature data corresponding to non-voice portions of a previously-received audio stream for an identified device corresponding to the device identifier, wherein the device signature data comprises repetitive sounds, transient sounds, and volume issues; and a voice stream server residing in the memory and executed by the at least one processor, the voice stream server receiving a plurality of audio streams from a plurality of callers, analyzing the plurality of audio streams for voice data and for non-voice data, identifying a first entry in the caller database that corresponds to voice data in a first audio stream of the plurality of audio streams, modifying the first audio stream based on voice characterization data in the first entry in the caller database, identifying a first entry in the device signature database that corresponds to the first audio stream, and modifying the first audio stream based on the device signature data in the first entry in the device signature database, wherein the voice stream server comprises a voice to text transcription tool that transcribes the voice data in the first audio stream to text and compares the text to a key word list, and when specified criteria are satisfied with respect to the occurrence of words in the key word list in the text, the voice stream server modifies the voice data in the first audio stream by placing a first caller corresponding to the first audio stream in listen-only mode by not forwarding the voice data in the first audio stream received by the voice stream server, wherein the voice stream server provides a voice prompt to the first caller that states the first caller is in listen-only mode, wherein the voice stream server enables and disables modifying the first audio stream based on: caller role; caller authorization; and verbal commands from at least one caller. 10. The apparatus of claim 9 wherein when the identified caller corresponding to one of the plurality of entries in the caller profile database uses a plurality of devices for a plurality of past calls, the device signature database comprises a plurality of entries corresponding to the plurality of devices used by the identified caller. 11. A computer-implemented method executed by at least one processor for processing a plurality of audio streams from a plurality of callers, the method comprising: providing a caller profile database that includes a plurality of entries, each of the plurality of entries comprising voice characterization data corresponding to a voice portion of a previously-received audio stream for an identified caller; providing a device signature database residing in the memory that includes a plurality of entries, each of the plurality of entries comprising a device identifier and corresponding device signature data corresponding to non-voice portions of a previously-received audio stream for an identified device corresponding to the device identifier, wherein the device signature data comprises repetitive sounds, transient sounds, and volume issues; receiving the plurality of audio streams from the plurality of callers; analyzing the plurality of audio streams for voice data and for non-voice data; identifying a first entry in the caller database that corresponds to a first audio stream of the plurality of audio streams; identifying a first entry in the device signature database that corresponds to the first audio stream; and modifying the first audio stream based on the analyzed voice data and the voice characterization data in the first entry in the caller database and based on the analyzed non-voice data and the device signature data in the first entry in the device signature database. 12. The method of claim 11 further comprising: transcribing the voice data in the first audio stream to t

Assignees

Inventors

Classifications

  • using speech recognition · CPC title

  • with voice recognition means · CPC title

  • to the speaker · CPC title

  • H04L65/403Primary

    Arrangements for multi-party communication, e.g. for conferences (data switching systems for conference H04L12/18; arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities H04M3/56; television conferencing systems H04N7/15) · CPC title

  • H04M3/568Primary

    audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants (echo suppression in two-way loud-speaking telephone systems H04M9/02; sound field processing per se H04S7/30) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10652396B2 cover?
A stream server manages connections by multiple clients. The stream server receives, sends and manages streams for each client. The stream server analyzes incoming streams and maintains a database of device signatures and a database of user profiles based on the analyzed data. The stream server receives a stream from a client, analyzes the stream to identify an applicable device signature, user…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification H04L65/403. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue May 12 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).