System and method for intelligent configuration of an audio channel with background analysis

US9930085B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9930085-B2
Application numberUS-201514867495-A
CountryUS
Kind codeB2
Filing dateSep 28, 2015
Priority dateSep 28, 2015
Publication dateMar 27, 2018
Grant dateMar 27, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems and computer program products for configuring an audio channel are provided. Aspects include generating a confidence metric indicative of at least one control cue in a telecommunication audio feed input. Generating the confidence metric can include analyzing the control cue to determine a cue type, assigning a confidence metric value for the control cue based on the cue type, and comparing the confidence metric value to a predetermined threshold value associated with the cue type. Aspects also include updating a context history with the cue type and configuring an audio channel output based on the confidence metric and context history.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for configuring an audio channel with a processor, the method comprising: generating a confidence metric indicative of at least one control cue in a telecommunication audio feed, wherein generating the confidence metric comprises: analyzing the at least one control cue to determine a cue type; assigning a confidence metric value for the at least one control cue based on the cue type, wherein the cue type comprises both explicit speech to perform an action and muffled voice having a lower amplitude than the average amplitude of other portions of the audio feed; comparing the confidence metric value to a predetermined threshold value associated with the cue type; updating a context history with the cue type and the confidence metric value; and configuring an input of the audio channel based on the confidence metric and the context history. 2. The method of claim 1 , wherein the processor configures the input of the audio channel by one of muting the input and unmuting the input. 3. The method of claim 1 , wherein analyzing the at least one control cue comprises: identifying at least one word verbalized on the telecommunication audio feed; and determining a relative context of the conversation based on the at least one word. 4. The method of claim 1 , wherein the at least one control cue comprises one or more of an input volume, an output volume, a spoken word, a series of spoken words having a conversational context, accelerometer data, photoelectric data, and a global positioning system (GPS) data. 5. The method of claim 1 , wherein the cue type is a predetermined time period of high or low volume, wherein the volume is an amplitude measured with respect to an average amplitude. 6. The method of claim 1 , wherein the at least one control cue is a verbalized name of a telecommunication participant in the telecommunication audio feed. 7. The method of claim 1 , wherein the at least one control cue is silence for a predetermined period of time. 8. The method of claim 1 , wherein the at least one control cue is one or more of a low volume word and a combination of words having a predetermined relevance value. 9. The method of claim 8 , wherein the predetermined relevance value is based on a context of a conversation on the telecommunication audio feed. 10. The method of claim 1 , wherein the at least one control cue is a combination of an accelerometer reading and an input volume. 11. The method of claim 1 , wherein creating the context history is indicative of one or more control cues associated with a particular user. 12. The method of claim 1 , wherein the confidence metric comprises a plurality of control cues, and the confidence metric value increases with respect to a greater number of control cues. 13. The method of claim 1 , further comprising outputting an audio cue indicative that the processor has configured the input to the audio channel. 14. The method of claim 1 , wherein the processor prompts for user input regarding a cue sensitivity setting, and the confidence metric value is based in part on the cue sensitivity setting. 15. The method of claim 1 , wherein the processor prompts for user input indicative of a situational context and a preferred response to the situational context. 16. The method of claim 1 , wherein the context history is indicative of a situational context and a user's response to the situational context. 17. The method of claim 16 , wherein the context history is based on a plurality of telecommunication audio feeds. 18. A system for configuring an audio channel, the system comprising an audio input device configured to receive a telecommunication audio feed input; and a processor operatively connected to the audio input device and configured to: monitor the audio feed input; generate a confidence metric indicative of at least one control cue in the audio feed; determine a cue type, wherein the cue type comprises both explicit speech to perform an action and muffled voice having a lower amplitude than the average amplitude of other portions of the audio feed; assign a confidence metric value for the at least one control cue based on the cue type; compare the confidence metric value to a predetermined threshold value associated with the cue type; update a context history with the cue type and the confidence metric value; and configure an input of the audio channel based on the confidence metric and the context history. 19. A non-transitory computer-readable storage medium storing a computer program product executable to perform a method, the method including: generating a confidence metric indicative of at least one control cue in a telecommunication audio feed, wherein generating the confidence metric comprises: analyzing the at least one control cue to determine a cue type; assigning a confidence metric value for the at least one control cue based on the cue type; comparing the confidence metric value to a predetermined threshold value associated with the cue type, wherein the cue type comprises both explicit speech to perform an action and muffled voice having a lower amplitude than the average amplitude of other portions of the audio feed; updating a context history with the cue type and the confidence metric value; and configuring an input of the audio channel based on the confidence metric and the context history.

Assignees

Inventors

Classifications

  • Electricity · mapped topic

  • H04L65/601Primary

    Electricity · mapped topic

  • for computer conferences, e.g. chat rooms (instant messaging H04L51/04; protocols for multimedia communication H04L65/1101; arrangements for multi-party communication H04L65/403; telephonic conference arrangements H04M3/56; television conference systems H04N7/15) · CPC title

  • audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants (echo suppression in two-way loud-speaking telephone systems H04M9/02; sound field processing per se H04S7/30) · CPC title

  • Media network packet handling · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9930085B2 cover?
Methods, systems and computer program products for configuring an audio channel are provided. Aspects include generating a confidence metric indicative of at least one control cue in a telecommunication audio feed input. Generating the confidence metric can include analyzing the control cue to determine a cue type, assigning a confidence metric value for the control cue based on the cue type, a…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification H04L65/601. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Mar 27 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).