Context awareness in auditory browsing

US9483573B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9483573-B2
Application numberUS-201213708142-A
CountryUS
Kind codeB2
Filing dateDec 7, 2012
Priority dateDec 7, 2012
Publication dateNov 1, 2016
Grant dateNov 1, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In a method for generating an audio summary of a portion of an electronic document, a user input selecting a focus position within a rendered electronic document is received. The plurality of document elements included in the rendered electronic document is identified. A plurality of audio objects corresponding to the plurality of document elements is generated. An audio signal is generated. The audio signal includes a subset of the plurality of audio objects corresponding to a subset of the plurality of document elements contained within a predetermined range from the focus position. The audio signal indicates the spatial relation between the elements of the elements subset. The audio signal is rendered to the user.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for generating an audio summary of a portion of an electronic document, the method comprising: receiving, by a computer, an electronic document; identifying, by the computer, content and location of a plurality of document elements included in the electronic document; creating a Document Object Model (DOM) that hierarchically organizes the document elements; classifying one or more pluralities of the document elements into respective collections based on the hierarchical organization of each plurality of document elements in the DOM, each collection having an associated name; generating audio objects corresponding to text information within each of the document elements and each collection name; rendering the electronic document for display on a display screen, and displaying the rendered document on a display screen; receiving user input indicating a focus position within the displayed electronic document; generating an audio signal that includes one or more of the audio objects based on the proximity to the focus position of the corresponding document elements within a predetermined range from the focus position, wherein if a collection of document elements is located near the boundary of the predetermined range, the audio object corresponding to the collection name will be included in the audio signal, and if the collection of document elements is in the immediate vicinity of the focus position, the audio objects corresponding to the document elements will be included in the audio signal; and rendering the audio signal to the user. 2. The method of claim 1 , wherein the audio signal comprises a multi channel signal. 3. The method of claim 2 , wherein rendering the audio signal further comprises simultaneously rendering the audio subset using multiple channels. 4. The method of claim 1 , wherein generating the plurality of audio objects further comprises associating at least one rendering parameter with each of the plurality of audio objects and wherein generating the audio signal comprises adjusting the at least one rendering parameter for the audio subset based on the spatial relation between elements of the elements subset. 5. The method of claim 4 , wherein the at least one rendering parameter includes at least one of volume, tempo, treble, bass, or stereo-width. 6. The method of claim 1 , wherein the predetermined range from the focus position is defined by a circle having a center at the focus position. 7. The method of claim 1 , wherein the computing device comprises a mobile computing device. 8. The method of claim 1 , wherein receiving user input indicating a focus position further comprises: receiving user input from a contact-sensitive display in response to a finger touch to the display; and determining a focus position and a direction in which the finger touch points from the user input; wherein the predetermined range is conical-shaped emanating a distance from the focus position in the direction in which the finger touch points. 9. The method of claim 1 , wherein the predetermined range from the focus position extends beyond the visible portion of the rendered electronic document. 10. A computer program product for generating an audio summary of a portion of an electronic document, the computer program product comprising one or more computer-readable tangible storage devices and a plurality of program instructions stored on at least one of the one or more computer-readable tangible storage devices, the plurality of program instructions comprising: program instructions to receive an electronic document; program instructions to identify content and location of a plurality of document elements included in the electronic document; program instructions to create a Document Object Model (DOM) that hierarchically organizes the document elements; program instructions to classify one or more pluralities of the document elements into respective collections based on the hierarchical organization of each plurality of document elements in the DOM, each collection having an associated name; program instructions to generate of audio objects corresponding to the text information within each of the document elements and each collection name; program instructions to render the electronic document for display on a display screen, and displaying the rendered document on a display screen; program instructions to receive user input indicating a focus position within the displayed electronic document; program instructions to generate an audio signal that includes one or more of the based on the proximity to the focus position of the corresponding document elements within a predetermined range from the focus position, wherein if a collection of document elements is located near the boundary of the predetermined range, the audio object corresponding to the collection name will be included in the audio signal, and if the collection of document elements is in the immediate vicinity of the focus position, the audio objects corresponding to the document elements will be included in the audio signal; and program instructions to render the audio signal to the user. 11. The computer program product of claim 10 , wherein the audio signal comprises a multi channel signal. 12. The computer program product of claim 11 , wherein the program instructions to render the audio signal comprise program instructions to simultaneously render the audio subset using multiple channels. 13. The computer program product of claim 10 , wherein the program instructions to generate the plurality of audio objects further comprise program instructions to associate at least one rendering parameter with each of the plurality of audio objects and wherein the program instructions to generate the audio signal comprise program instructions to adjust the at least one rendering parameter for the audio subset based on the spatial relation between elements of the elements subset. 14. The computer program product of claim 13 , wherein the at least one rendering parameter includes at least one of volume, tempo, treble, bass, or stereo-width. 15. The computer program product of claim 10 , wherein the predetermined range from the focus position is defined by a circle having a center at the focus position. 16. The computer program product of claim 10 , wherein the program instructions to receiving user input indicating a focus position further comprises: program instructions to receive user input from a contact-sensitive display in response to a finger touch to the display; and program instructions to determine a focus position and a direction in which the finger touch points from the user input; wherein the predetermined range is conical-shaped emanating a distance from the focus position in the direction in which the finger touch points. 17. The computer program product of claim 10 , wherein the predetermined range from the focus position extends beyond the visible portion of the rendered electronic document. 18. A computer system for generating an audio summary of a portion of an electronic document, the computer system comprising one or more processors, one or more computer-readable tangible storage devices, and a plurality of program instructions stored on at least one of the one or more storage devices for execution by at least one of the one or more processors, the plurality of program instructions comprising: program instructions to receive an electronic document; program instructions to identify content and location of a plurality of document el

Assignees

Inventors

Classifications

  • G06F16/957Primary

    Browsing optimisation, e.g. caching or content distillation · CPC title

  • Speech synthesis; Text to speech systems · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9483573B2 cover?
In a method for generating an audio summary of a portion of an electronic document, a user input selecting a focus position within a rendered electronic document is received. The plurality of document elements included in the rendered electronic document is identified. A plurality of audio objects corresponding to the plurality of document elements is generated. An audio signal is generated. Th…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/957. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 01 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).