Systems and methods for providing audio to a user based on gaze input
US-2016132289-A1 · May 12, 2016 · US
US9740452B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9740452-B2 |
| Application number | US-201514859945-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 21, 2015 |
| Priority date | Aug 23, 2013 |
| Publication date | Aug 22, 2017 |
| Grant date | Aug 22, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
According to the invention, a method for providing audio to a user is disclosed. The method may include determining, with an eye tracking device, a gaze point of a user on a display. The method may also include causing, with a computer system, an audio device to produce audio to the user, where content of the audio may be based at least in part on the gaze point of the user on the display.
Opening claim text (preview).
What is claimed is: 1. A non-transitory machine readable medium with instructions stored thereon, the instructions executable by at least one processor for at least: determining a gaze point of a user on a display; determining a virtual distance and a virtual direction from each of a plurality of virtual sound sources to the gaze point of the user on the display; determining a number of sound channels produced by an audio device; determining an area of the display associated with each sound channel; and causing the audio device to produce audio, wherein the audio is based on at least one sound channel associated with an area of the display device in which the gaze point of the user is located, and each virtual sound source is produced in a manner based at least in part on the virtual distance and the virtual direction from each virtual sound source to the gaze point of the user. 2. The non-transitory machine readable medium of claim 1 , wherein each virtual sound source being produced in a manner based at least in part on the virtual distance and the virtual direction from each virtual sound source to the gaze point of the user comprises: varying a volume level of each virtual sound source based on the virtual distance of each virtual sound source to the gaze point of the user. 3. The non-transitory machine readable medium of claim 1 , wherein the virtual direction comprises at least one selection from a group consisting of: a X-direction; a Y-direction; and a Z-direction. 4. The non-transitory machine readable medium of claim 1 , wherein each virtual sound source being produced in a manner based at least in part on the virtual distance and the virtual direction from each virtual sound source to the gaze point of the user comprises: producing multiple virtual audio sources at multiple dynamic volumes. 5. The non-transitory machine readable medium of claim 1 , wherein the instructions are further executable by at least one processor for at least: determining head information associated with the user; and wherein content of the audio is further based at least in part on the head information associated with the user. 6. The non-transitory machine readable medium of claim 5 , wherein head information comprises at least one selection from a group consisting of: position of the user's head; angle of the user's head; orientation of the user's head; and size of the user's head. 7. The non-transitory machine readable medium of claim 5 , wherein determining head information associated with the user comprises: determining a position and/or orientation of the user's eyes. 8. A system, wherein the system comprises: an eye tracking device for at least determining a gaze point of a user on a display; and a processor for at least: determining a virtual distance and direction from each of a plurality of virtual sound sources to the gaze point of the user on the display; determining a number of sound channels produced by an audio device; determining an area of the display associated with each sound channel; and causing the audio device to produce audio, wherein the audio is based on at least one sound channel associated with an area of the display device in which the gaze point of the user is located, and each virtual sound source is produced in a manner based at least in part on the virtual distance and direction from each virtual sound source to the gaze point of the user. 9. The system of claim 8 , wherein each virtual sound source being produced in a manner based at least in part on the virtual distance and the virtual direction from each virtual sound source to the gaze point of the user comprises: varying a volume level of each virtual sound source based on the virtual distance of each virtual sound source to the gaze point of the user. 10. The system of claim 8 , wherein the virtual direction comprises at least one selection from a group consisting of: a X-direction; a Y-direction; and a Z-direction. 11. The system of claim 8 , wherein each virtual sound source being produced in a manner based at least in part on the virtual distance and the virtual direction from each virtual sound source to the gaze point of the user comprises: producing multiple virtual audio sources at multiple dynamic volumes. 12. The system of claim 8 , wherein the processor is further for at least: determining head information associated with the user; and wherein content of the audio is further based at least in part on the head information associated with the user. 13. The system of claim 12 , wherein head information comprises at least one selection from a group consisting of: position of the user's head; angle of the user's head; orientation of the user's head; and size of the user's head. 14. The system of claim 12 , wherein determining head information associated with the user comprises: determining a position and/or orientation of the user's eyes. 15. A method, comprising: determining a gaze point of a user on a display; determining a virtual distance and direction from each of a plurality of virtual sound sources to the gaze point of the user on the display; determining a number of sound channels produced by an audio device; determining an area of the display associated with each sound channel; and causing the audio device to produce audio, wherein the audio is based on at least one sound channel associated with an area of the display device in which the gaze point of the user is located, and each virtual sound source is produced in a manner based at least in part on the virtual distance and direction from each virtual sound source to the gaze point of the user. 16. The method of claim 15 , wherein each virtual sound source being produced in a manner based at least in part on the virtual distance and the virtual direction from each virtual sound source to the gaze point of the user comprises: varying a volume level of each virtual sound source based on the virtual distance of each virtual sound source to the gaze point of the user. 17. The method of claim 15 , wherein the virtual direction comprises at least one selection from a group consisting of: a X-direction; a Y-direction; and a Z-direction. 18. The method of claim 15 , wherein each virtual sound source being produced in a manner based at least in part on the virtual distance and the virtual direction from each virtual sound source to the gaze point of the user comprises: producing multiple virtual audio sources at multiple dynamic volumes. 19. The method of claim 15 , wherein the method further comprises: determining head information associated with the user; and wherein content of the audio is further based at least in part on the head information associated with the user. 20. The method of claim 19 , wherein determining head information associated with the user comprises: determining a position and/or orientation of the user's eyes.
Tracking of listener position or orientation · CPC title
Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved · CPC title
Eye tracking input arrangements (G06F3/015 takes precedence) · CPC title
Electronic adaptation dependent on speaker or headphone connection · CPC title
Head tracking input arrangements · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.