System and method for adaptive masking and non-directional language understanding and generation

US2022180071A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2022180071-A1
Application numberUS-202117540768-A
CountryUS
Kind codeA1
Filing dateDec 2, 2021
Priority dateDec 4, 2020
Publication dateJun 9, 2022
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided are a system and method for adaptive masking and non-directional language understanding and generation. The system for adaptive masking and non-directional language understanding and generation according to the present invention includes an encoder unit including an adaptive masking block for performing masking on training data, a language generator for restoring masked words, and an encoder for detecting whether or not the restored sentence construction words are original, and a decoder unit including a generation word position detector for detecting a position of a word to be generated next, a language generator for determining a word suitable for the corresponding position, and a non-directional training data generator for decoder training.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system for adaptive masking and non-directional language understanding and generation, the system comprising: an encoder unit including an adaptive masking block for performing masking on training data, a language generator for restoring masked words, and an encoder for detecting whether or not the restored sentence construction words are original; and a decoder unit including a generation word position detector for detecting a position of a word to be generated next, a language generator for determining a word suitable for the corresponding position, and a non-directional training data generator for decoder training. 2 . The system of claim 1 , wherein the adaptive masking block performs masking by converting a predetermined ratio of words into a special symbol. 3 . The system of claim 1 , wherein the language generator restores the masked words to obtain a converted input string. 4 . The system of claim 3 , wherein the encoder compares an input string with a converted input string to perform change token prediction. 5 . The system of claim 1 , wherein the decoder unit generates a word by inputting a context, determines a next word generation position by inputting the context and a pre-generated word, generates a next word by inputting the context and pre-generated word to the determined word generation position, and stops a non-directional language generation procedure when the generated word is a sentence termination symbol. 6 . The system of claim 1 , wherein the generation word position detector derives the position of the word to be generated next by inputting a current context and a generated partial result using non-directional training data having a corresponding language generation order. 7 . The system of claim 1 , wherein the non-directional training data generator derives a language generation order that is highly relevant to input context. 8 . The system of claim 1 , wherein the decoder unit performs parallel decoding at a time of language generation. 9 . The system of claim 1 , wherein, when masking is performed, the encoder adjusts a masking ratio by reflecting characteristics of a language generator in which training is in progress. 10 . The system of claim 1 , wherein, as performance of the language generator is improved, noise of an input sentence is maintained at a predetermined ratio or more by increasing a masking probability value for a construction vocabulary.

Assignees

Inventors

Classifications

  • Natural language generation · CPC title

  • Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

  • Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title

  • Lexical analysis, e.g. tokenisation or collocates · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2022180071A1 cover?
Provided are a system and method for adaptive masking and non-directional language understanding and generation. The system for adaptive masking and non-directional language understanding and generation according to the present invention includes an encoder unit including an adaptive masking block for performing masking on training data, a language generator for restoring masked words, and an e…
Who is the assignee on this patent?
Electronics & Telecommunications Res Inst
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jun 09 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).