Audio stream mixing system and method

US10747497B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10747497-B2
Application numberUS-201916663426-A
CountryUS
Kind codeB2
Filing dateOct 25, 2019
Priority dateAug 8, 2018
Publication dateAug 18, 2020
Grant dateAug 18, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided are a system and method of mixing a second audio stream with a first audio stream in an audio output device. The system is configured to execute the method, comprising buffering and outputting the first audio stream via the audio output device as unmodified output, determining at least one insertion spot within the first audio stream, modifying the first audio stream at an insertion spot to avoid content loss, outputting the second audio stream at the insertion spot, and resuming unmodified output of the first audio stream at or near a completion of the second audio stream. Modifying the first audio stream can include pausing and/or warping the first audio stream at the insertion spot. The audio output device can be a vehicle head unit or a wireless device, such as a mobile phone.

First claim

Opening claim text (preview).

What is claimed is: 1. An audio stream mixing system, comprising: one or more processors coupled to one or more computer storage devices, one or more first audio stream sources, one or more second audio stream sources, and one or more audio output devices, wherein the one or more processors are configured to: buffer the first audio stream from a first audio stream source in the one or more computer storage devices; determine an insertion spot within the first audio stream; output the buffered first audio stream via the one or more audio output devices, wherein the one or more processors are configured to modify output of at least a portion of the first audio stream to accommodate output of a second audio stream at the insertion spot to minimize or avoid content degradation or loss of the first audio stream, the second audio stream received from a second audio stream source; output the second audio stream via the one or more audio output devices at the insertion spot; and continue output of the buffered first audio stream via the one or more audio output devices after completion of the second audio stream. 2. The system of claim 1 , wherein the second audio stream is received during output of the first audio stream. 3. The system of claim 1 , wherein the one or more processors are configured to buffer the first audio stream in response to receipt of the second audio stream. 4. The system of claim 1 , wherein the first audio stream is a radio stream. 5. The system of claim 1 , wherein the first audio stream is playback of content from a tangible storage medium local to the audio output device. 6. The system of claim 5 , wherein the tangible storage medium is a compact disc, unified serial bus medium, hard drive, or a computer memory. 7. The system of claim 1 , wherein the second audio stream is received by the audio output device with an urgency or maximum delay indicator. 8. The system of claim 7 , wherein the one or more processors are configured to: identify a maximum delay for output of the second audio stream based on the urgency or maximum delay indicator. 9. The system of claim 7 , wherein the one or more processors are configured to: determine the insertion spot based, at least in part, on the urgency or maximum delay indicator. 10. The system of claim 1 , wherein the one or more processors are configured to find a gap or pause within the first audio stream to determine the insertion spot. 11. The system of claim 1 , wherein the one or more processors are configured to analyze the first audio stream using one or more speech analysis techniques to find ends of sentences, phrases, words, or other natural points of interruption to determine the insertion spot. 12. The system of claim 11 , wherein the one or more speech analysis techniques includes at least one of Voice Activity Detection (VAD), Automatic Speech Recognition (ASR), and Natural Language Understanding (NLU). 13. The system of claim 1 , wherein the one or more processors are configured to find a low volume level within the first audio stream to determine the insertion spot. 14. The system of claim 1 , wherein the one or more processors are configured to pause the first audio stream at the insertion spot as a modification of the first audio stream. 15. The system of claim 1 , wherein the one or more processors are configured to modify a time-frequency structure of the first audio stream as a modification of the first audio stream. 16. The system of claim 1 , wherein the one or more processors are configured to warp the first audio stream at or near the insertion spot as a modification to the first audio stream. 17. The system of claim 1 , wherein the one or more processors are configured to modify the second audio stream and output the second audio stream as a modified second audio stream at the insertion spot. 18. The system of claim 17 , wherein the modified second audio stream includes a modified time-frequency structure. 19. The system of claim 17 , wherein the one or more processors are configured to modify a voice style of the second audio stream to improve intelligibility relative to the first audio stream. 20. The system of claim 1 , wherein the audio output device is or forms part of a vehicle head unit. 21. The system of claim 20 , wherein the second audio stream is an announcement or an alert from a vehicle navigation system, vehicle monitoring system, or a text to speech system. 22. The system of claim 1 , wherein the second audio stream is an announcement or an alert from an advertising system. 23. The system of claim 1 , wherein the audio output device is a wireless portable device comprising a mobile phone, tablet, or phablet.

Assignees

Inventors

Classifications

  • Cross-faders therefor · CPC title

  • Detection of presence or absence of voice signals (switching of direction of transmission by voice frequency in two-way loud-speaking telephone systems H04M9/10) · CPC title

  • specially adapted for particular use · CPC title

  • Time compression or expansion · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10747497B2 cover?
Provided are a system and method of mixing a second audio stream with a first audio stream in an audio output device. The system is configured to execute the method, comprising buffering and outputting the first audio stream via the audio output device as unmodified output, determining at least one insertion spot within the first audio stream, modifying the first audio stream at an insertion sp…
Who is the assignee on this patent?
Cerence Operating Co
What technology area does this patent fall under?
Primary CPC classification G06F3/165. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 18 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).