Audio playback method, apparatus and system

US9313250B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9313250-B2
Application numberUS-201414261782-A
CountryUS
Kind codeB2
Filing dateApr 25, 2014
Priority dateJun 4, 2013
Publication dateApr 12, 2016
Grant dateApr 12, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An audio playback method is provided. The method includes identifying a captured audio data frame according to a type of the audio data frame and sending the identified audio data frame to an audio receiving end. The method also includes receiving the audio data frame that is identified according to the type of the audio data frame and determining the type of the audio data frame and evaluating network transmission status based on the identification. Further, the method includes adjusting a threshold value of a jitter buffer that is used to cache the audio data frame when the type of the audio data frame is a music frame and evaluation result of the network transmission status does not meet a preset transmission baseline condition.

First claim

Opening claim text (preview).

What is claimed is: 1. An audio playback method, comprising: identifying, by an audio sending end, a captured audio data frame according to a type of the audio data frame; sending, by the audio sending end, the identified audio data frame to an audio receiving end; receiving, by the audio receiving end, the audio data frame that is identified according to the type of the audio data frame; based on the identification, determining, by the audio receiving end, the type of the audio data frame and an evaluation result of network transmission status; and when the type of the audio data frame is a music frame and evaluation result of the network transmission status does not meet a preset transmission baseline condition, adjusting, by the audio receiving end, a threshold value of a jitter buffer that is used to cache the audio data frame; wherein identifying the captured audio data frame according to the type of the audio data frame further includes: calculating frame energy of the audio data frame; and comparing the frame energy of the audio data frame with a preset frame energy threshold value, wherein: when the frame energy is lower than the preset frame energy threshold value, the audio data frame is identified as a mute frame; when the frame energy is higher than the preset frame energy threshold value, the speech probability density Ps and the music probability density Pm of the audio data frame are further calculated; and when Ps/Pm is higher than a preset threshold value, the audio data frame is identified as a speech frame. 2. The method according to claim 1 , wherein: evaluating network transmission status further includes calculating a packet loss rate of audio data frames by a packet sequence number; and adjusting a threshold value of the jitter buffer further includes, when the type of the audio data frame is a music frame and the packet loss rate of the audio data frames is higher than a preset threshold of the packet loss rate, increasing the threshold value of the jitter buffer that is used to cache the audio data frame. 3. The method according to claim 1 , wherein: evaluating network transmission status further includes calculating a network jitter index by packet arrival time; and adjusting a threshold value of the jitter buffer further includes, when the type of the audio data frame is a music frame and the network jitter index is higher than a preset threshold of the network jitter index, increasing the threshold value of the jitter buffer that is used to cache the audio data frame. 4. The method according to claim 1 , wherein: evaluating network transmission status further includes calculating the packet loss rate of the audio data frame by the packet sequence number and calculating the network jitter index by the packet arrival time; and adjusting a threshold value of the jitter buffer that is used to cache the audio data frame further includes, when the type of the audio data frame is a music frame, the packet loss rate of the audio data frame is higher than the preset threshold of the packet loss rate, and the network jitter index is higher than the preset threshold of the network jitter index, increasing the threshold value of the jitter buffer that is used to cache the audio data frame. 5. The method according to claim 1 , further including: when Ps/Pm is not higher than the preset threshold value, jointly judging the audio data frame and identifying the audio data frame according to the type of the audio data frame. 6. An audio playback apparatus, comprising: one or more processors; memory; and one or more computer programs stored in the memory and to be executed by the one or more processors, the one or more programs comprising: an audio data frame receiving unit configured to receive an audio data frame, wherein the audio data frame is identified by an audio sending end according to a type of the audio data frame; a network transmission status evaluation unit configured to evaluate network transmission status of the audio data frame; a data frame type determination unit configured to, based on an identification of the audio data frame, determine a type of the audio data frame; and a jitter buffer adjustment unit configured to, when the type of the audio data frame is a music frame and evaluation result of the network transmission status does not meet a preset transmission baseline condition, adjust a threshold value of a jitter buffer that is used to cache the audio data frame; wherein identifying the audio data frame according to the type of the audio data frame further includes: calculating frame energy of the audio data frame; and comparing the frame energy of the audio data frame with a preset frame energy threshold value, wherein: when the frame energy is lower than the preset frame energy threshold value, the audio data frame is identified as a mute frame; when the frame energy is higher than the preset frame energy threshold value, the speech probability density Ps and the music probability density Pm of the audio data frame are further calculated; and when Ps/Pm is higher than a preset threshold value, the audio data frame is identified as a speech frame. 7. The apparatus according to claim 6 , wherein: the network transmission status evaluation unit is configured to calculate a packet loss rate of the audio data frame by a packet sequence number; and the jitter buffer adjustment unit is configured to, when the type of the audio data frame is a music frame and the packet loss rate of the audio data frame is higher than a preset threshold of the packet loss rate, increase the threshold value of the jitter buffer that is used to cache the audio data frame. 8. The apparatus according to claim 6 , wherein: the network transmission status evaluation unit is configured to calculate a network jitter index by packet arrival time; and the jitter buffer adjustment unit is configured to, when the type of the audio data frame is a music frame and the network jitter index is higher than a preset threshold of the network jitter index, increase the threshold value of the jitter buffer that is used to cache the audio data frame. 9. The apparatus according to claim 6 , wherein: the network transmission status evaluation unit is configured to calculate the packet loss rate of the audio data frame by the packet sequence number and calculate the network jitter index by the packet arrival time; and the jitter buffer adjustment unit is configured to, when the type of the audio data frame is a music frame, the packet loss rate of the audio data frame is higher than the preset threshold of the packet loss rate, and the network jitter index is higher than the preset threshold of the network jitter index, increase the threshold value of the jitter buffer that is used to cache the audio data frame. 10. An audio playback system, comprising: an audio sending apparatus; and an audio receiving apparatus, wherein: the audio sending apparatus is configured to: capture an audio data frame; identify the captured audio data frame according to a type of the audio data frame; and send the identified audio data frame to the audio receiving apparatus, and the audio receiving apparatus configured to: receive the identified audio data frame; determine the type of the audio data frame based on an identification; and evaluate network transmission status, wherein, when the type of the audio data frame is a music frame and an evaluation result of the network transmission status does not meet a preset transmission baseline condition, a threshold value of the jitter buffer that is used to cache the audio data frame is adjusted; wherein the audio sending apparatus is confi

Assignees

Inventors

Classifications

  • G10H1/0058Primary

    Transmission between separate instruments or between individual components of a musical system (G10H1/0083 takes precedence) · CPC title

  • H04L65/80Primary

    Responding to QoS · CPC title

  • Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments · CPC title

  • Jitter · CPC title

  • for semi-reliable protocols, e.g. for less sensitive applications such as streaming video (buffer level management for video bitstream receiver H04N21/44004) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9313250B2 cover?
An audio playback method is provided. The method includes identifying a captured audio data frame according to a type of the audio data frame and sending the identified audio data frame to an audio receiving end. The method also includes receiving the audio data frame that is identified according to the type of the audio data frame and determining the type of the audio data frame and evaluating…
Who is the assignee on this patent?
Tencent Tech Shenzhen Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10H1/0058. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 12 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).