Information processing device, information processing method, and information processing program

US11595772B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11595772-B2
Application numberUS-201917282705-A
CountryUS
Kind codeB2
Filing dateOct 3, 2019
Priority dateOct 10, 2018
Publication dateFeb 28, 2023
Grant dateFeb 28, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An information processing device (100) according to the present disclosure includes: an acquisition unit (141) configured to acquire a first image including a content image of an ear of a user; and a calculation unit (142) configured to calculate, based on the first image acquired by the acquisition unit (141), a head-related transfer function corresponding to the user by using a learned model having learned to output a head-related transfer function corresponding to an ear when an image including a content image of the ear is input.

First claim

Opening claim text (preview).

The invention claimed is: 1. An information processing device comprising: circuitry configured to function as: an acquisition unit configured to acquire a first image including a content image of an ear of a user; a calculation unit configured to calculate, based on the first image acquired by the acquisition unit, a head-related transfer function corresponding to the user by using a learned model having learned to output a head-related transfer function corresponding to an ear when an image including a content image of the ear is input; and a first learning unit configured to generate an ear parameter estimation model by learning a relation between a plurality of ear images obtained by changing: texture of three-dimensional data of the ear or a head, or luminance in rendering, and an ear parameter common to the ear images, wherein the texture includes a skin color of the ear or of the head. 2. The information processing device according to claim 1 , wherein the acquisition unit acquires an ear parameter that is a variable representing a characteristic of the ear included in the first image, and the calculation unit calculates the head-related transfer function corresponding to the user by inputting the ear parameter to the learned model. 3. The information processing device according to claim 2 , wherein the acquisition unit acquires the ear parameter of the ear included in the first image by using an ear parameter estimation model having learned to output an ear parameter corresponding to an ear when an image including a content image of the ear is input. 4. The information processing device according to claim 3 , wherein the first learning unit is configured to generate the ear parameter estimation model by learning a relation between an image including a content image of an ear and an ear parameter of the ear. 5. The information processing device according to claim 4 , wherein the first learning unit generates the ear parameter estimation model by learning a relation between the ear parameter and an ear image obtained by rendering three-dimensional data of the ear generated based on the ear parameter. 6. The information processing device according to claim 5 , wherein the first learning unit generates the ear parameter estimation model by learning the relation between the plurality of ear images obtained by changing a camera angle in rendering. 7. The information processing device according to claim 4 , further comprising a second learning unit configured to generate the learned model by learning a relation between an image including a content image of an ear and a head-related transfer function corresponding to the ear. 8. The information processing device according to claim 7 , wherein the second learning unit performs acoustic simulation for three-dimensional data obtained by synthesizing three-dimensional data of the ear generated based on the ear parameter and three-dimensional data of a head, and generates the learned model by learning a relation between a head-related transfer function obtained through the acoustic simulation and the ear parameter. 9. The information processing device according to claim 8 , wherein the second learning unit compresses an information amount of the head-related transfer function obtained through the acoustic simulation, and generates the learned model by learning a relation between the compressed head-related transfer function and the ear parameter. 10. The information processing device according to claim 8 , wherein the second learning unit sets a hearing point of three-dimensional data of the ear generated based on the ear parameter, and performs the acoustic simulation by using the set hearing point. 11. The information processing device according to claim 1 , further comprising a preprocessing unit configured to specify a content image of an ear of the user in a second image including a content image of the entire head of the user, and detect a specified range as the first image, wherein the acquisition unit acquires the first image detected by the preprocessing unit. 12. The information processing device according to claim 11 , wherein the preprocessing unit specifies the range based on a relation between a feature point of the head of the user included in the second image and a posture of the user. 13. The information processing device according to claim 12 , wherein when the range cannot be specified based on the relation between the feature point of the head of the user included in the second image and the posture of the user, the preprocessing unit newly requests acquisition of an image different from the second image and including a content image of the entire head of the user. 14. The information processing device according to claim 11 , wherein the preprocessing unit specifies a content image of an ear of the user by correcting rotation of the second image based on correction information included in the second image, and detects a specified range as the first image. 15. An information processing method by which a computer performs: acquiring a first image including a content image of an ear of a user; calculating, based on the acquired first image, a head-related transfer function corresponding to the user by using a learned model having learned to output a head-related transfer function corresponding to an ear when an image including a content image of the ear is input; and generating an ear parameter estimation model by learning a relation between a plurality of ear images obtained by changing: texture of three-dimensional data of the ear or a head, or luminance in rendering, and an ear parameter common to the ear images, wherein the texture includes a skin color of the ear or of the head. 16. A non-transitory computer-readable storage medium encoded with executable instructions that, when executed by at least one processor, cause the at least one processor to perform: acquiring a first image including a content image of an ear of a user; calculating, based on the first image acquired, a head-related transfer function corresponding to the user by using a learned model having learned to output a head-related transfer function corresponding to an ear when an image including a content image of the ear is input; and generating an ear parameter estimation model by learning a relation between a plurality of ear images obtained by changing: texture of three-dimensional data of the ear or a head, or luminance in rendering, and an ear parameter common to the ear images, wherein the texture includes a skin color of the ear or of the head.

Assignees

Inventors

Classifications

  • by matching two-dimensional images to three-dimensional objects · CPC title

  • Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods · CPC title

  • Headphones for stereophonic communication {(details thereof, e.g. relating to batteries, cables or control elements H04R1/10)} · CPC title

  • using classification, e.g. of video objects · CPC title

  • Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11595772B2 cover?
An information processing device (100) according to the present disclosure includes: an acquisition unit (141) configured to acquire a first image including a content image of an ear of a user; and a calculation unit (142) configured to calculate, based on the first image acquired by the acquisition unit (141), a head-related transfer function corresponding to the user by using a learned model …
Who is the assignee on this patent?
Sony Group Corp
What technology area does this patent fall under?
Primary CPC classification H04S3/004. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Feb 28 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).