Classifying Segments of Speech Based on Acoustic Features and Context
US-2018068656-A1 · Mar 8, 2018 · US
US10482882B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10482882-B2 |
| Application number | US-201715825919-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 29, 2017 |
| Priority date | May 3, 2017 |
| Publication date | Nov 19, 2019 |
| Grant date | Nov 19, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods, apparatus, and computer readable media are described related to automated assistants that proactively incorporate, into human-to-computer dialog sessions, unsolicited content of potential interest to a user. In various implementations, based on content of an existing human-to-computer dialog session between a user and an automated assistant, an entity mentioned by the user or automated assistant may be identified. Fact(s)s related to the entity or to another entity that is related to the entity may be identified based on entity data contained in database(s). For each of the fact(s), a corresponding measure of potential interest to the user may be determined. Unsolicited natural language content may then be generated that includes one or more of the facts selected based on the corresponding measure(s) of potential interest. The automated assistant may then incorporate the unsolicited content into the existing human-to-computer dialog session or a subsequent human-to-computer dialog session.
Opening claim text (preview).
What is claimed is: 1. A method implemented by one or more processors, comprising: identifying an entity based on a state of a media or gaming software application being executed on a first client device operated by a user, wherein the entity is identified without using explicit input from the user; determining that an automated assistant operating on a second client device associated with the user has no outstanding obligations to the user, wherein the first and second client devices are part of a coordinated ecosystem of client devices; identifying one or more facts about the entity based on entity data contained in one or more databases; determining, for each of the one or more facts, a corresponding measure of potential interest to the user; generating, by one or more of the processors, unsolicited natural language content, wherein the unsolicited natural language content includes one or more of the facts selected based on the corresponding one or more measures of potential interest; and after the determination that the automated assistant has no outstanding obligations to the user, incorporating, by the automated assistant into a new or existing human-to-computer dialog session between the user and the automated assistant, the unsolicited natural language content, wherein the incorporating causes the unsolicited natural language content to be automatically output to the user as part of the new or existing human-to-computer dialog session. 2. The method of claim 1 , wherein determining the corresponding measure of potential interest is based on data associated with a user profile associated the user. 3. The method of claim 2 , wherein the data associated with the user profile includes a search history associated with the user. 4. The method of claim 1 , wherein determining the corresponding measure of potential interest is based on analysis of a textual corpus of one or more previous online conversations between multiple people different than the user. 5. The method of claim 4 , wherein the analysis includes detection in the textual corpus of one or more references to one or more entities, as well as detection of one or references to facts about the one or more entities in the textual corpus within a particular proximity of the one or more references to the one or more entities. 6. The method of claim 5 , wherein the one or more entities include the entity determined based on the state of the media or gaming software application. 7. The method of claim 5 , wherein the one or more entities share an entity class with the entity determined based on the state of the media or gaming software application. 8. The method of claim 5 , wherein the one or more entities share one or more attributes with the entity determined based on the state of the media or gaming software application. 9. The method of claim 1 , wherein determining the corresponding measure of potential interest comprises detecting that a given fact of the one or more facts has been previously referenced in the existing human-to-computer dialog session between the user and the automated assistant or in a previous human-to-computer dialog session between the user and the automated assistant, wherein the measure of potential interest determined for the given fact reflects the detecting. 10. The method of claim 1 , further comprising eliminating, by one or more of the processors, a given fact of the one or more facts from consideration based on detecting that the given fact has been previously referenced in the existing human-to-computer dialog session between the user and the automated assistant or in a previous human-to-computer dialog session between the user and the automated assistant. 11. The method of claim 1 , wherein the entity relates to a video game, and a given fact of the one or more facts is a fact related to the video game. 12. The method of claim 1 , wherein the entity is a person, and a given fact of the one or more facts is an upcoming event involving the person. 13. A system comprising one or more processors and memory operably coupled with the one or more processors, wherein the memory stores instructions that, in response to execution of the instructions by one or more processors, cause the one or more processors to perform the following operations: identifying an entity based on a state of a media or gaming software application executing on a first client device operated by a user, wherein the entity is identified without using explicit input from the user; determining that an automated assistant operating on a second client device associated with the user has no outstanding obligations to the user, wherein the first and second client devices are part of a coordinated ecosystem of client devices; identifying one or more facts about the entity based on entity data contained in one or more databases; determining, for each of the one or more facts, a corresponding measure of potential interest to the user; generating, by one or more of the processors, unsolicited natural language content, wherein the unsolicited natural language content includes one or more of the facts selected based on the corresponding one or more measures of potential interest; and after the determination that the automated assistant has no outstanding obligations to the user, incorporating, by the automated assistant into a new or existing human-to-computer dialog session between the user and the automated assistant, the unsolicited natural language content, wherein the incorporating causes the unsolicited natural language content to be automatically output to the user as part of the new or existing human-to-computer dialog session. 14. The system of claim 13 , wherein determining the corresponding measure of potential interest is based on data associated with a user profile associated the user. 15. The system of claim 14 , wherein the data associated with the user profile includes a search history associated with the user. 16. The system of claim 13 , wherein determining the corresponding measure of potential interest is based on analysis of a textual corpus of one or more previous online conversations between multiple people different than the user. 17. The system of claim 16 , wherein the analysis includes detection in the textual corpus of one or more references to one or more entities, as well as detection of one or references to facts about the one or more entities in the textual corpus within a particular proximity of the one or more references to the one or more entities. 18. The system of claim 17 , wherein the one or more entities include the entity determined based on the state of the media or gaming software application. 19. The system of claim 13 , wherein the entity relates to a video game, and a given fact of the one or more facts is a fact related to the video game. 20. At least one non-transitory computer-readable medium comprising instructions that, in response to execution of the instructions by one or more processors, cause the one or more processors to perform the following operations: identifying an entity based on a state of a media or gaming software application executing on a first client device operated by a user, wherein the entity is identified without using explicit input from the user; determining that an automated assistant operating on a second client device associated with the user has no outstanding obligations to the user, wherein the first and second client devices are part of a coordinated ecosystem of client devices; identifying
Natural language generation · CPC title
Discourse or dialogue representation · CPC title
Named entity recognition · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.