Low-complexity packet loss concealment for transcoded audio signals

US11107481B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11107481-B2
Application numberUS-201916379479-A
CountryUS
Kind codeB2
Filing dateApr 9, 2019
Priority dateApr 9, 2018
Publication dateAug 31, 2021
Grant dateAug 31, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are described for concealing packet loss in a received audio stream. Packets of the audio stream may be received in a non-lapped transform domain format, where at least one packet is missing in the stream. The received packets are decoded, and each missing packet in the decoded stream is replaced by a reduced-energy signal block. Each reduced-energy signal block may also be modified at a beginning or ending boundary, and shifted such that a start or end of each missing packet does not coincide with a peak of a transform window of a lapped transform domain format. The raw audio signal may then be encoded into transform windows having the lapped transform domain format. Packet loss concealment may then be performed for selected transform windows that include modified reduced-energy blocks, either prior to transmission or after transmission by the receiving endpoint.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for concealing packet loss in a received audio stream, the method comprising: receiving, by an audio communication system comprising one or more servers, packets of an audio stream, the received audio stream being in a non-lapped transform domain format, the packets having a sequential order, where a block of packets is missing in the sequence, the block of packets including at least one packet, each packet of the audio stream having a received window size; decoding, by the audio communication system, the received packets to a sequence of blocks of a raw audio signal; replacing, by the audio communication system, each missing packet in the sequence with a reduced-energy signal block, whereby the sequence of blocks of the raw audio signal comprises at least one reduced-energy signal block, wherein the reduced-energy signal block has a lower energy in comparison to remaining blocks of the raw audio signal, such that the reduced-energy signal block is distinguishable from remaining blocks of the received raw audio signal; modifying, by the audio communication system, each reduced-energy signal block of the raw audio signal at one of a beginning boundary and an ending boundary; shifting, by the audio communication system, the raw audio signal by a predetermined delay such that a start and end of each reduced energy block of the raw audio signal does not coincide with a peak of a transform window of a lapped transform domain format; encoding, by the server, the raw audio signal into transform windows having the lapped transform domain format, wherein packet loss concealment is performed for a selected transform window of the encoded audio signal, the selected transform window having a modified reduced energy signal block located at a center portion; and transmitting, by the audio communication system, the encoded audio signal to an endpoint. 2. The method of claim 1 , wherein the transmitted audio signal, when received by a listener endpoint, is decoded to output raw audio, the decoding comprising overlapping the transform window containing the packet loss concealment with adjacent transform windows containing received audio packets. 3. The method of claim 1 , the packet loss concealment comprising replacing the selected transform window that includes the modified reduced-energy signal block with a copy of a previous transform window of the encoded audio signal. 4. The method of claim 3 , the at least one missing packet including multiple consecutive packets in the sequence, where the copy of the previous transform window is used to replace each transform window that includes the missing multiple consecutive packets. 5. The method of claim 3 , the at least one missing packet including multiple consecutive packets in the sequence, where a copy of two transform windows preceding the multiple consecutive missing packets is used to replace one or more pairs of transform windows that include the multiple consecutive packets. 6. The method of claim 3 , the at least one missing packet including multiple consecutive packets in the sequence, where the selected transform window has a decreased gain compared to a previous transform window. 7. The method of claim 1 , the modifying each reduced-energy signal block comprising replacing a portion of each reduced-energy signal block near the ending boundary with a time-reversed copy of the raw audio signal packet immediately following each reduced-energy signal block. 8. The method of claim 7 , wherein the portion of each reduced-energy signal block has a duration of 1 / 4 a length of a transform window size. 9. The method of claim 1 , the modifying each reduced-energy signal block comprising replacing a portion of each reduced-energy signal block near the beginning boundary with a time-reversed copy of a raw audio signal packet immediately preceding each reduced-energy signal block. 10. The method of claim 1 , the modifying each reduced-energy signal block comprising replacing a portion of each reduced-energy signal block near the beginning boundary with a signal represented as g*(x 0 −x 1 _reversed), where g is a gain used in the packet loss concealment, x 0 is a raw audio signal segment, having ¼ of a transform window length, occurring ½ the transform window length before the block of silence, and x 1 _reversed is a time-reversed copy of a raw audio signal segment, having ¼ the transform window length, immediately preceding the reduced-energy signal block. 11. The method of claim 1 , wherein each transform window of the lapped transform domain format comprises a plurality of audio samples, audio samples near boundaries of each transform window being encoded with a lower gain than audio samples near a center of each transform window. 12. The method of claim 1 the performing packet loss concealment further comprising at least one of dithering, decorrelation, and stochastic processing to reduce correlation and potential artifacts due to the encoding into the lapped transform domain format. 13. The method of claim 1 , where the encoding the raw audio signal further comprises generating metadata to identify presence of transform windows that include packet loss concealment-generated data, the method further comprising transmitting the generated metadata for downstream decisions about decoding or rendering the encoded audio signal. 14. A computer program product comprising a non-transitory computer-readable medium storing computer-readable program code that, when executed by one or more processors, causes the one or more processors to perform operations comprising: receiving packets of an audio stream, the received audio stream being in a non-lapped transform domain format, the packets having a sequential order, where at least one packet is missing in the sequence, each packet having a received window size; decoding the received packets to a sequence of blocks of a raw audio signal; replacing each missing packet in the sequence with a reduced-energy signal block, whereby the sequence of blocks of the raw audio signal comprises at least one reduced-energy signal block, wherein the reduced-energy signal block has a lower energy in comparison to remaining blocks of the raw audio signal, such that the reduced-energy signal block is distinguishable from remaining blocks of the received raw audio signal; modifying each reduced-energy signal block of the raw audio signal at one of a beginning boundary and an ending boundary of inserted blocks; shifting the raw audio signal by a predetermined delay such that a start and end of each reduced energy block of the raw audio signal does not coincide with a peak of a transform window of a lapped transform domain format; encoding the raw audio signal into transform windows having the lapped transform domain format, packet loss concealment being performed for a selected transform window, of the encoded audio signal, having a modified reduced energy signal block located at a center portion of the selected transform window; and transmitting the encoded audio signal. 15. A system for concealing packet loss in a received audio stream, the system comprising: a communications transceiver that receives packets of an audio stream, the received audio stream being in a non-lapped transform domain format, the packets having a sequential order, where at least one packet is missing in the sequence, each packet having a received window size; a decoder that decodes the received packets from the non-lapped format to a sequence of blocks of a raw audio signal and replaces each missing packet in the sequence with a reduced-energy signal b

Assignees

Inventors

Classifications

  • Media network packetisation · CPC title

  • using orthogonal transformation · CPC title

  • G10L19/173Primary

    Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding · CPC title

  • Responding to QoS · CPC title

  • Arrangements for multi-party communication, e.g. for conferences (data switching systems for conference H04L12/18; arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities H04M3/56; television conferencing systems H04N7/15) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11107481B2 cover?
Systems and methods are described for concealing packet loss in a received audio stream. Packets of the audio stream may be received in a non-lapped transform domain format, where at least one packet is missing in the stream. The received packets are decoded, and each missing packet in the decoded stream is replaced by a reduced-energy signal block. Each reduced-energy signal block may also be …
Who is the assignee on this patent?
Dolby Laboratories Licensing Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/173. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 31 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).