Information processing apparatus, information processing method, and program

US9477304B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9477304-B2
Application numberUS-201113116848-A
CountryUS
Kind codeB2
Filing dateMay 26, 2011
Priority dateJun 2, 2010
Publication dateOct 25, 2016
Grant dateOct 25, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An information processing apparatus includes an image analysis unit that executes a process for analyzing an image captured by a camera, a speech analysis unit that executes a process for analyzing speech input from a microphone, and a data processing unit that receives a result of the analysis conducted by the image analysis unit and a result of the analysis conducted by the speech analysis unit and that executes output control of help information for a user. The data processing unit calculates a degree of difficulty of the user on the basis of at least either the result of the image analysis or the result of the speech analysis and, if the degree of difficulty that has been calculated is equal to or more than a predetermined threshold value, executes a process for outputting help information to the user.

First claim

Opening claim text (preview).

What is claimed is: 1. An information processing electronic device configured to: execute a first process for analyzing an image captured by a camera and for determining whether the image contains a face; execute a second process for analyzing speech input from a microphone; receive a result of the first process and a result of the second process and execute output control of help information for a user; judge a duration over which a face of the user faces a particular direction on the basis of the result of the first process, if the image is determined to contain a face; determine a number of times the information processing electronic device rejects execution of a process, corresponding to the speech input, in response to the speech input made by the user; determine a user level of the user based on the number of times the information processing electronic device rejects the execution of the process corresponding to the speech input, a feature quantity and a predetermined threshold value for the number of times of rejecting the execution of the process, wherein the feature quantity represents information regarding reasons for rejecting the execution of the process; and calculate a degree of difficulty of the user by using information regarding the judgment, the number of times and the determined user level, and if the degree of difficulty that has been calculated is equal to or more than a predetermined threshold value, execute a process for outputting the help information to the user. 2. The information processing electronic device according to claim 1 , further configured to: determine whether or not a face of the user faces the information processing electronic device on the basis of the result of the first process and calculate the degree of difficulty by using information regarding the determination. 3. The information processing electronic device according to claim 1 , further configured to: calculate the degree of difficulty by using information regarding a judgment as to whether or not a process corresponding to a request made by the user has been executed. 4. The information processing electronic device according to claim 1 , further configured to: calculate the degree of difficulty on the basis of information regarding a time elapsed since the information processing electronic device executed a process for responding to the user. 5. The information processing electronic device according to claim 1 , further configured to: obtain and store state transition of the information processing electronic device; and execute a process for outputting help information corresponding to a stored system state. 6. The information processing electronic device of claim 1 , wherein the first process comprises the information processing electronic device being configured to: responsive to determining the image contains a face: determine whether the face contained in the image is a face of the user; estimate an angle of the face with respect to the information processing electronic device; and responsive to determining that the face contained in the image is a face of the user, determine that the face of the user faces the particular direction based on the estimated angle. 7. The information processing electronic device according to claim 1 , wherein the number of times the information processing electronic device rejects the execution of the process corresponding to the speech input is determined for a session, and wherein the session refers to a period of time until the information processing electronic device executes the process corresponding to the speech input. 8. The information procession electronic device according to claim 1 , wherein the reason for rejecting the execution of the process includes at least one of: a failure in detection of voiced frames, a judgment that an utterance has been made outside a domain, and results of speech analysis judged to have low reliability. 9. The information processing electronic device according to claim 1 , further configured to determine a learning level of the user in a predetermined period of time until a present time. 10. The information processing electronic device according to claim 1 , further configured to match a direction of the face of the user at a first instance with the direction of the face of the user at a second instance, wherein the second instance is before the first instance. 11. The information processing electronic device according to claim 1 , further configured to compare the duration over which the face of the user faces the particular direction with a predetermined threshold time to determine the degree of difficulty of the user. 12. An information processing method that is used in an information processing apparatus, the information processing method comprising: executing, with an image analysis unit, a process for analyzing an image captured by a camera; executing, with a speech analysis unit, a process for analyzing speech input from a microphone; and receiving, with a data processing unit, a result of the analysis conducted by the image analysis unit and a result of the analysis conducted by the speech analysis unit and executing output control of help information for a user, wherein, in the receiving, a degree of difficulty of the user is calculated based on: a duration over which a face of the user faces a particular direction, if the image is determined to contain a face, the duration being determined based on the result of the image analysis, a number of times the information processing apparatus rejects execution of a process, corresponding to the speech input, in response to the speech input made by the user, and a user level of the user determined based on the number of times the information processing electronic device rejects the execution of the process corresponding to the speech input, a feature quantity and a predetermined threshold value for the number of times of rejecting the execution of the process, wherein the feature quantity represents information regarding reasons for rejecting the execution of the process, and if the degree of difficulty that has been calculated is equal to or more than a predetermined threshold value, a process for outputting the help information to the user is executed. 13. A non-transitory computer-readable medium comprising instructions that, when executed by an information processing apparatus, cause the information processing apparatus to: cause an image analysis unit to execute a process for analyzing an image captured by a camera; cause a speech analysis unit to execute a process for analyzing speech input from a microphone; and cause a data processing unit to receive a result of the analysis conducted by the image analysis unit and a result of the analysis conducted by the speech analysis unit and to execute output control of help information to a user, wherein, in the causing of the data processing unit to receive the result, a degree of difficulty of the user is calculated based on: a duration over which a face of the user faces a particular direction, if the image is determined to contain a face, the duration being determined based on the result of the image analysis, a number of times the information processing apparatus rejects execution of a process, corresponding to the speech input, in response to the speech input made by the user, and a user level of the user determined based on the number of times the information processing electronic device rejects the execution of the process corresponding to the speech input, a feature quantity and a predetermined threshold value for the number of times of

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • G06F3/012Primary

    Head tracking input arrangements · CPC title

  • Sound input; Sound output (speech processing G10L) · CPC title

  • Human faces, e.g. facial parts, sketches or expressions · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9477304B2 cover?
An information processing apparatus includes an image analysis unit that executes a process for analyzing an image captured by a camera, a speech analysis unit that executes a process for analyzing speech input from a microphone, and a data processing unit that receives a result of the analysis conducted by the image analysis unit and a result of the analysis conducted by the speech analysis un…
Who is the assignee on this patent?
Sano Akane, Sony Corp
What technology area does this patent fall under?
Primary CPC classification G06F3/012. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 25 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).