What technology area does this patent fall under?

Primary CPC classification G06F40/20. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Sep 26 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Goal-oriented conversation with code-mixed language

US11769004B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11769004-B2
Application number	US-202016732708-A
Country	US
Kind code	B2
Filing date	Jan 2, 2020
Priority date	Jan 2, 2020
Publication date	Sep 26, 2023
Grant date	Sep 26, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer system may create a language model corpus including multilingual alignment for training a combined language model and train (or pre-train) the combined language model. The computer system may create an adverse medication reaction corpus to include adverse medication reaction utterances and label an N-gram of an utterance in the adverse medication reaction utterances as a response to query, for multiple N-grams. The computer system may generate a code-mixed utterance model to perform code-mixed utterances in a turn by turn dialogue, by at least adding additional output layer including at least a start vector, language vector, and a query vector including at least the labeled N-gram, which are additional to the combined language model's predicted next words.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented system comprising: a hardware processor; a memory device coupled with the hardware processor: the hardware processor configured to at least: create a language model corpus including multilingual alignment for training a combined language model, the language model corpus created from data sourced from a computer network and train the combined language model; create an adverse medication reaction corpus by analyzing data from online sources which include adverse medication reaction utterances and labeling an N-gram of an utterance in the adverse medication reaction utterances as a response to query; and generate a code-mixed utterance model to perform code-mixed utterances in a turn by turn dialogue, by at least adding additional output layer including at least a start vector, language vector, and a query vector including at least the labeled N-gram, which are additional to the combined language model's predicted next words, wherein the start vector encodes what word should start a response to an input utterance, and the language vector encodes what language out of multiple languages used in the code-mixed utterances should a next word of the response be in, wherein the code-mixed utterances represent utterances performed using multiple languages in a mixed manner in a conversation; and wherein the hardware processor is configured to align text from multiple languages by adversarial learning of a linear mapping between the multiple languages in an embedding space and refining alignments using geometric transformation. 2. The system of claim 1 , wherein the hardware processor is further configured to augment the adverse drag reaction corpus with at least one paraphrase of at least one of the utterances, the paraphrase being in multiple languages corresponding to the multiple languages. 3. The system of claim 1 , wherein the code-mixed utterance model includes a loss function including a 2-class softmax, in which a probability of a next utterance being a logical utterance with respect to a previous utterance is maximized. 4. The system of claim 1 , wherein the hardware processor is further configured to perform a fuzzification of the n-gram. 5. The system of claim 1 , wherein the combined language model includes a recurrent neural network. 6. The system of claim 1 , wherein the code-mixed utterance model includes a recurrent neural network. 7. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a device to cause the device to: create a language model corpus including multilingual alignment for training a combined language model, the language model corpus created from data sourced from a computer network and train the combined language model; create an adverse medication reaction corpus by analyzing data from online sources which include adverse medication reaction utterances and labeling an N-gram of an utterance in the adverse medication reaction utterances as a response to query; and generate a code-mixed utterance model to perform code-mixed utterances in a turn by turn dialogue, by at least adding additional output layer including at least a start vector, language vector, and a query vector including at least the labeled N-gram, which are additional to the combined language model's predicted next words, wherein the start vector encodes what word should start a response to an input utterance, and the language vector encodes what language out of multiple languages used in the code-mixed utterances should a next word of the response be in, wherein the code-mixed utterances represent utterances performed using multiple languages in a mixed manner in a conversation; and wherein the device is caused to align text from multiple languages by adversarial learning of a linear mapping between the multiple languages in an embedding space and refining alignments using geometric transformation. 8. The computer program product of claim 7 , wherein the device is further caused to augment the adverse drag reaction corpus with at least one paraphrase of at least one of the utterances, the paraphrase being in multiple languages corresponding to the multiple languages. 9. The computer program product of claim 7 , wherein the code-mixed utterance model includes a loss function including a 2-class softmax, in which a probability of a next utterance being a logical utterance with respect to a previous utterance is maximized. 10. The computer program product of claim 7 , wherein the device is further caused to perform a fuzzification of the n-gram. 11. The computer program product of claim 7 , wherein the combined language model includes a recurrent neural network. 12. The computer program product of claim 7 , wherein the code-mixed utterance model includes a recurrent neural network. 13. A computer-implemented method comprising: creating a language model corpus including multilingual alignment for training a combined language model, the language model corpus created from data sourced from a computer network and train the combined language model; creating an adverse medication reaction corpus by analyzing data from online sources which include adverse medication reaction utterances and labeling an N-gram of an utterance in the adverse medication reaction utterances as a response to query; and generating a code-mixed utterance model to perform code-mixed utterances in a turn by turn dialogue, by at least adding additional output layer including at least a start vector, language vector, and a query vector including at least the labeled N-gram, which are additional to the combined language model's predicted next words, wherein the start vector encodes what word should start a response to an input utterance, and the language vector encodes what language out of multiple languages used in the code-mixed utterances should a next word of the response be in, wherein the code-mixed utterances represent utterances performed using multiple languages in a mixed manner in a conversation; and aligning text from multiple languages by adversarial learning of a linear mapping between the multiple languages in an embedding space and refining alignments using geometric transformation. 14. The method of claim 13 , wherein further including augmenting the adverse drag reaction corpus with at least one paraphrase of at least one of the utterances, the paraphrase being in multiple languages corresponding to the multiple languages. 15. The method of claim 13 , wherein the code-mixed utterance model includes a loss function including a 2-class softmax, in which a probability of a next utterance being a logical utterance with respect to a previous utterance is maximized. 16. The method of claim 13 , further including performing a fuzzification of the n-gram. 17. The method of claim 13 , wherein generating the code-mixed utterance model includes training a recurrent neural network.

Assignees

Inventors

Classifications

G06N3/0442
characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title
G06N3/09
Supervised learning · CPC title
G06N3/094
Adversarial learning · CPC title
G06F40/20Primary
Natural language analysis (semantic analysis of natural language G06F40/30) · CPC title
G06N3/044
Recurrent networks, e.g. Hopfield networks · CPC title

Patent family

Related publications grouped by family.

View patent family 76655481

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11769004B2 cover?: A computer system may create a language model corpus including multilingual alignment for training a combined language model and train (or pre-train) the combined language model. The computer system may create an adverse medication reaction corpus to include adverse medication reaction utterances and label an N-gram of an utterance in the adverse medication reaction utterances as a response to …
Who is the assignee on this patent?: IBM
What technology area does this patent fall under?: Primary CPC classification G06F40/20. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Sep 26 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Methods and systems for informing self-treatment remedy selection

Method and System for Language and Domain Acceleration with Embedding Evaluation

Clause-wise text-to-sql generation

End of query detection

Frequently asked questions