Synchronizing recorded audio content and companion content

US9697871B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9697871-B2
Application numberUS-201213570179-A
CountryUS
Kind codeB2
Filing dateAug 8, 2012
Priority dateMar 23, 2011
Publication dateJul 4, 2017
Grant dateJul 4, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Aspects of the present disclosure relate to synchronously presenting companion content, such as text content of an electronic book, while recording or presenting narration audio content spoken by a narrator. For example, recorded audio content may be received that corresponds to words of the companion content as spoken by a narrator. The recorded audio content may be received at least substantially in real time as the words are spoken. Content synchronization information for the recorded audio content and the text content may be generated, where the content synchronization information maps portions of the recorded audio content to corresponding portions of the text content. The audio content and the text content may be synchronously presented to a user based at least in part on the content synchronization information.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: as implemented by one or more computing devices configured with specific executable instructions, receiving audio content that includes words read aloud from published digital content, wherein the audio content is received at least substantially in real time as the words are read aloud; generating content synchronization information for the received audio content and the published digital content, wherein the content synchronization information associates portions of the audio content with corresponding portions of the published digital content; and causing visual presentation of the published digital content in synchronization with the received audio content as the audio content is received, wherein causing the visual presentation includes changing portions of the published digital content presented for display to correspond to audio portions received in the audio content, wherein the published digital content is presented in synchronization with the received audio content based at least in part on the generated content synchronization information. 2. The computer-implemented method of claim 1 , wherein the audio content is received as a data stream. 3. The computer-implemented method of claim 1 , further comprising retrieving the published digital content from a data store. 4. The computer-implemented method of claim 1 , further comprising maintaining synchronization of a current position in the presented published digital content as additional audio content is received. 5. The computer-implemented method of claim 4 , wherein the current position is visually indicated in the presentation of the published digital content. 6. The computer-implemented method of claim 1 , wherein the audio content is received via a microphone or other audio recording device. 7. The computer-implemented method of claim 1 , wherein the audio content is received from another computing device. 8. The computer-implemented method of claim 7 , further comprising aurally presenting the received audio content in synchronization with the visual presentation of the published digital content. 9. The computer-implemented method of claim 1 , further comprising generating for display additional information regarding at least one word in the published digital content based at least in part on a determination that the at least one word was not read correctly in the audio content, wherein the additional information includes at least one of pronunciation information or a definition. 10. A computer readable, non-transitory storage medium storing computer-executable instructions that, when executed by a computer system, configure the computer system to perform operations comprising: retrieving published digital content from a data store, the published digital content including text content; receiving streaming audio content associated with the published digital content; receiving content synchronization information for the audio content and the published digital content, wherein the content synchronization information associates portions of the audio content with corresponding portions of the published digital content; and generating for display portions of the published digital content in synchronization with aural presentation of the streaming audio content as the streaming audio content is received, wherein the portions of the published digital content displayed are changed to correspond to portions of the streaming audio content aurally presented, wherein the published digital content is presented in synchronization with the streaming audio content based at least in part on the received content synchronization information. 11. The computer readable, non-transitory storage medium of claim 10 , wherein the instructions further configure the computer system to visually distinguish a portion of the published digital content to indicate that the portion corresponds to a current position in the streaming audio content. 12. The computer readable, non-transitory storage medium of claim 11 , wherein visually distinguishing a portion of the published digital content comprises highlighting, emboldening or underlining text content. 13. The computer readable, non-transitory storage medium of claim 11 , wherein the visually distinguished portion comprises at least one of a word, syllable, letter, sentence, line or paragraph.

Assignees

Inventors

Classifications

  • G11B27/11Primary

    by using information not detectable on the record carrier · CPC title

  • with both visual and audible presentation of the material to be studied · CPC title

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • Indexing; Addressing; Timing or synchronising; Measuring tape travel · CPC title

  • for reading, e.g. e-books (constructional details of portable computers G06F1/1613) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9697871B2 cover?
Aspects of the present disclosure relate to synchronously presenting companion content, such as text content of an electronic book, while recording or presenting narration audio content spoken by a narrator. For example, recorded audio content may be received that corresponds to words of the companion content as spoken by a narrator. The recorded audio content may be received at least substanti…
Who is the assignee on this patent?
Hwang Douglas C, Story Jr Guy A, Audible Inc
What technology area does this patent fall under?
Primary CPC classification G11B27/11. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 04 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).