Method for human-machine dialogue, computing device and computer-readable storage medium

US12282747B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12282747-B2
Application numberUS-202217870813-A
CountryUS
Kind codeB2
Filing dateJul 21, 2022
Priority dateDec 29, 2020
Publication dateApr 22, 2025
Grant dateApr 22, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method includes: acquiring an input sentence in a first language in a current round of conversation; translating the input sentence in the first language to obtain an input sentence in a second language, according to dialogue contents in the first language and dialogue contents in the second language that have a mutual translation relationship with the dialogue contents in the first language in historical rounds of conversation; invoking a multi-round conversation generation model to parse the input sentence in the second language in the current round of conversation to generate an output sentence in the second language in the current round of conversation; translating the output sentence in the second language in the current round of conversation to obtain at least one candidate result in the first language; and determining an output sentence in the first language from the at least one candidate result in the first language.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for human-machine dialogue, comprising: acquiring an input sentence in a first language in a current round of conversation; translating, through a first sentence translation model, the input sentence in the first language in the current round of conversation to obtain an input sentence in a second language in the current round of conversation, according to dialogue contents in the first language and dialogue contents in the second language that have a mutual translation relationship with the dialogue contents in the first language in historical rounds of conversation, wherein the dialogue contents in the first language comprise input sentences in the first language and output sentences in the first language in corresponding rounds of conversation, and the dialogue contents in the second language comprise input sentences in the second language and output sentences in the second language in corresponding rounds of conversation, and wherein the first sentence translation model is obtained by performing training parallel corpora between the second language and the first language of multiple rounds of continuous conversation in open domain; invoking a pre-stored multi-round conversation generation model to parse the input sentence in the second language in the current round of conversation, according to the dialogue contents in the second language in the historical rounds of conversation, to generate an output sentence in the second language in the current round of conversation, wherein the multi-round conversation generation module is obtained by perform training based on multi-round dialogue corpora in the second language; translating, through a second sentence translation model, the output sentence in the second language in the current round of conversation, according to the dialogue contents in the first language and in the second language in the historical rounds of conversation, and the input sentence in the first language and the input sentence in the second language in the current round of conversation, to obtain at least one candidate result in the first language, wherein the second sentence translation model is obtained by performing training parallel corpora between the first language and the second language of multiple rounds of continuous conversation in the open domain; and determining an output sentence in the first language in the current round of conversation from the at least one candidate result in the first language for output. 2. The method of claim 1 , wherein determining the output sentence in the first language in the current round of conversation from the at least one candidate result in the first language for output comprises: for each of the at least one candidate result in the first language, invoking a pre-stored coherence evaluation model to calculate an expression coherence of the candidate result in the first language according to the dialogue contents in the first language in the historical rounds of conversation and the input sentence in the first language in the current round of conversation; and selecting one of the at least one candidate result with a largest expressive coherence as the output sentence in the first language in the current round of conversation and outputting the output sentence in the first language in the current round of conversation. 3. The method of claim 2 , further comprising: acquiring a plurality of valid dialogue corpus samples in the first language, wherein each of the valid dialogue corpus samples comprises dialogue contents in the first language in multiple consecutive rounds of conversation; for each of the valid dialogue corpus samples, constructing a negative dialogue corpus sample corresponding to the valid dialogue corpus sample; and training an initial classification model using the plurality of the valid dialogue corpus samples and the negative dialogue corpus samples to obtain the coherence evaluation model. 4. The method of claim 3 , wherein constructing a negative dialogue corpus sample corresponding to the valid dialogue corpus sample comprises: extracting, from the plurality of valid dialogue corpus samples, a target output sentence in the first language corresponding to a last one of the multiple consecutive rounds of conversation; translating the target output sentence in the first language to obtain a corresponding expression sentence in the second language; translating the expression sentence in the second language to a corresponding expression sentence in the first language; calculating a minimum edit distance between the target output sentence in the first language and the expression sentence in the first language; comparing the minimum edit distance with a preset distance threshold to obtain a comparison result, and determining a negative output sentence in the first language matching the target output sentence in the first language according to the comparison result; and replacing the target output sentence in the first language in valid dialogue corpus samples with the negative output sentence in the first language to obtain the negative dialogue corpus sample. 5. The method of claim 4 , wherein determining the negative output sentence in the first language matching the target output sentence in the first language according to the comparison result comprises: in response to the minimum edit distance being greater than the preset distance threshold, determining the expression sentence in the first language as the negative output sentence in the first language; and in response to the minimum edit distance being equal to the preset distance threshold, performing a synonym substitution on at least one word in the expression sentence in the first language to obtain the negative output sentence in the first language. 6. The method of claim 1 , wherein each parallel corpus required for training the first sentence translation model is expressed by a long corpus sequence as follows: [ . . . ,L t-2 ,H t-2 ,L t-1 ,H t-1 ,L t ,H t ], where, t is an integer, H represents sentences in the second language, L represents sentences in the first language, and H t-n represents a translated parallel corpus of L t-n that has same subscript with H t-n , wherein n=0, 1, 2, 3, 4 . . . ; wherein a first-language conversation operation of a same round of conversation comprises: an input operation of an input sentence in the first language, and an output operation of an output sentence in the first language in a round of conversation; and a second-language conversation operation of the same round of conversation comprises: an input operation of an input sentence in the second language, and an output operation of an output sentence in the second language in the round of conversation. 7. The method of claim 6 , wherein, when t is an even number, L t represents an output sentence in the first language in the t/2th round of conversation, L t-1 represents an input sentence in the first language in the t/2th round of conversation, H t represents an output sentence in the second language in the t/2th round of conversation, H t-1 represents an input sentence in the second language in the t/2th round of conversation, H t-2 represents an output sentence in the second language in the (t−2)/2th round of conversation, and L t-2 represents an output sentence in the first language in the (t−2)/2th round of conversation. 8. The method of claim 6 , wherein, when t is an odd number, L t represents an input sentence in the first language in the (t+1)/2th round of conversation, L t-1 represents an output sentence in the first language in the (t−1)/2th round of conversation, H t represents an input sentence in the second language in the (t+1)/2

Assignees

Inventors

Classifications

  • using phonetics · CPC title

  • Machine learning · CPC title

  • Natural language query formulation or dialogue systems · CPC title

  • Translation of the query language, e.g. Chinese to English · CPC title

  • Machine-assisted translation, e.g. using translation memory · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12282747B2 cover?
A method includes: acquiring an input sentence in a first language in a current round of conversation; translating the input sentence in the first language to obtain an input sentence in a second language, according to dialogue contents in the first language and dialogue contents in the second language that have a mutual translation relationship with the dialogue contents in the first language …
Who is the assignee on this patent?
Ubtech Robotics Corp Ltd
What technology area does this patent fall under?
Primary CPC classification G06F16/3329. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 22 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).