Hierarchical natural language understanding systems

US11880659B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11880659-B2
Application numberUS-202117162318-A
CountryUS
Kind codeB2
Filing dateJan 29, 2021
Priority dateJan 29, 2021
Publication dateJan 23, 2024
Grant dateJan 23, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems for hierarchical natural language understanding are described. A representation of an utterance is inputted to a first machine learning model to obtain information on the first utterance. According to the information on the utterance a determination that the representation of the utterance is to be inputted to a second machine learning model that performs a dedicated natural language task is performed. In response to determining that the representation of the utterance is to be inputted to a second machine learning model, the utterance is inputted to the second machine learning model to obtain an output of the dedicated natural language task.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: inputting a representation of a first utterance to a first machine learning model to obtain information on the first utterance; according to the information on the first utterance, determining that the representation of the first utterance is to be inputted to a second machine learning model that performs a dedicated natural language task; and in response to determining that the representation of the first utterance is to be inputted to a second machine learning model, inputting the representation of the first utterance to the second machine learning model to obtain an output of the dedicated natural language task. 2. The method of claim 1 further comprising: inputting a representation of a second utterance to the first machine learning model to obtain information on the second utterance; determining that the information on the second utterance is an output of a second natural language task; and in response to determining that the information on the second utterance is the output of the second natural language task bypassing the second machine learning model. 3. The method of claim 1 , wherein the dedicated natural language task is one of natural language inference, paraphrasing, named entity recognition, and question answering. 4. The method of claim 1 , wherein the information on the first utterance includes one or a combination of a field associated with the first utterance, an entity associated with the first utterance, one or more filtered historical data associated with the first utterance, a stance of a user associated with the first utterance, a sentiment of the user associated with the first utterance. 5. The method of claim 4 , wherein one or more of the field associated with the first utterance, the entity associated with the first utterance, the one or more filtered historical data associated with the first utterance, the stance of the user associated with the first utterance, the sentiment of the user associated with the first utterance are inputted to the second machine learning model with the representation of the first utterance to obtain the output of the dedicated natural language task. 6. The method of claim 1 , wherein the first machine learning model is a zero-shot model that is operative to observe a representation of an utterance from a class that was not observed during a training phase of the first machine learning model and is operative to predict one or more classes that the representation of the first utterance belongs to. 7. The method of claim 1 , wherein the first machine learning model and the second machine learning model enable a conversational artificial intelligence system for communication with a user. 8. The method of claim 1 , wherein the second machine learning model is for use in a predetermined field. 9. The method of claim 1 , wherein in response to determining that the representation of the first utterance is to be inputted to a second machine learning model, further inputting the information on the first utterance to the second machine learning model to obtain the output of the dedicated natural language task. 10. The method of claim 1 , the determining that the representation of the first utterance is to be inputted to a second machine learning model includes: selecting, based on the information on the first utterance, the second machine learning model from a plurality of second machine learning models. 11. A non-transitory machine-readable storage medium that provides instructions that, if executed by a processor, will cause said processor to perform operations comprising: inputting a representation of a first utterance to a first machine learning model to obtain information on the first utterance; according to the information on the first utterance, determining that the representation of the first utterance is to be inputted to a second machine learning model that performs a dedicated natural language task; and in response to determining that the representation of the first utterance is to be inputted to a second machine learning model, inputting the representation of the first utterance to the second machine learning model to obtain an output of the dedicated natural language task. 12. The non-transitory machine-readable storage medium of claim 11 , wherein the operations further comprise: inputting a representation of a second utterance to the first machine learning model to obtain information on the second utterance; determining that the information on the second utterance is an output of a second natural language task; and in response to determining that the information on the second utterance is the output of the second natural language task bypassing the second machine learning model. 13. The non-transitory machine-readable storage medium of claim 11 , wherein the dedicated natural language task is one of natural language inference, paraphrasing, named entity recognition, and question answering. 14. The non-transitory machine-readable storage medium of claim 11 , wherein the information on the first utterance includes one or a combination of a field associated with the first utterance, an entity associated with the first utterance, one or more filtered historical data associated with the first utterance, a stance of a user associated with the first utterance, a sentiment of the user associated with the first utterance. 15. The non-transitory machine-readable storage medium of claim 14 , wherein one or more of the field associated with the first utterance, the entity associated with the first utterance, the one or more filtered historical data associated with the first utterance, the stance of the user associated with the first utterance, the sentiment of the user associated with the first utterance, are inputted to the second machine learning model with the representation of the first utterance to obtain the output of the dedicated natural language task. 16. The non-transitory machine-readable storage medium of claim 11 , wherein the first machine learning model is a zero-shot model that is operative to observe a representation of an utterance from a class that was not observed during a training phase of the first machine learning model and is operative to predict one or more classes that the representation of the first utterance belongs to. 17. The non-transitory machine-readable storage medium of claim 11 , wherein the first machine learning model and the second machine learning model enable a conversational artificial intelligence system for communication with a user. 18. The non-transitory machine-readable storage medium of claim 11 , wherein the second machine learning model is for use in a predetermined field. 19. The non-transitory machine-readable storage medium of claim 11 , wherein in response to determining that the representation of the first utterance is to be inputted to a second machine learning model, further inputting the information on the first utterance to the second machine learning model to obtain the output of the dedicated natural language task. 20. The non-transitory machine-readable storage medium of claim 11 , the determining that the representation of the first utterance is to be inputted to a second machine learning model includes: selecting, based on the information on the first utterance, the second machine learning model from a plurality of second machine learning models.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11880659B2 cover?
Methods and systems for hierarchical natural language understanding are described. A representation of an utterance is inputted to a first machine learning model to obtain information on the first utterance. According to the information on the utterance a determination that the representation of the utterance is to be inputted to a second machine learning model that performs a dedicated natural…
Who is the assignee on this patent?
Salesforce Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 23 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).