Methods and systems of interchanging code-mixed words and uni-language words

US11151326B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11151326-B2
Application numberUS-201916365714-A
CountryUS
Kind codeB2
Filing dateMar 27, 2019
Priority dateFeb 13, 2019
Publication dateOct 19, 2021
Grant dateOct 19, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and a systems of interchanging code-mixed words and uni-language words are described. In an embodiment, the method may include identifying two or more portions of a target word, wherein the target word is one of a code-mixed word and a uni-language word. The method may further include determining one or more equivalent words corresponding to at least one portion of the two or more portions, wherein the one or more equivalent words are in at least one predetermined target language. The method may further include selecting at least one equivalent word from the one or more equivalent words, based on a context of the target word within a sentence comprising the target word. The method may further include replacing the at least one portion of the two or more portions of the target word with the selected at least one equivalent word.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of interchanging code-mixed words and uni-language words, the method comprising: identifying, by a translation device, two or more portions of a target word, wherein the target word is one of a code-mixed word and a uni-language word; determining, by the translation device, one or more equivalent words corresponding to at least one portion of the two or more portions, wherein the one or more equivalent words are in at least one predetermined target language, wherein determining the one or more equivalent words comprises identifying a derivative word for a portion of the at least one portion to be replaced, and wherein the derivative word is in a native language associated with the portion; selecting, by the translation device, at least one equivalent word from the one or more equivalent words, based on a context of the target word within a sentence comprising the target word; and replacing, by the translation device, the at least one portion of the two or more portions of the target word with the selected at least one equivalent word. 2. The method of claim 1 , wherein at least one remaining portion of the two or more portions is an ending phoneme corresponding to a verb. 3. The method of claim 2 , wherein the at least one predetermined target language is determined based on the at least one portion that is an ending phoneme. 4. The method of claim 1 , wherein the two or more portions are identified using a bidirectional Long Short Term Memory (LSTM). 5. The method of claim 1 , wherein the at least one predetermined target language corresponds to a user preference. 6. The method of claim 1 , wherein selecting the at least one equivalent word comprises identifying the at least one equivalent word associated with the derivative word. 7. The method of claim 6 further comprising: computing a first probability score for each of the at least one equivalent word, based on an associated usage probability, wherein the associated usage probability of an equivalent word of the at least one equivalent word is derived based on linguistic rules of the at least one predetermined target language; and computing a second probability score for each of the at least one equivalent word, based on an associated transition probability, wherein the associated transition probability of the equivalent word of the at least one equivalent word is derived based on a placement of the equivalent word within the sentence comprising the target word. 8. The method of claim 7 further comprising: determining a combined probability score for each of the at least one equivalent word based on the associated first probability score and the second probability score; and selecting an equivalent word from the at least one equivalent word based on the combined probability score determined for each of the at least one equivalent word. 9. The method of claim 8 further comprising: receiving an alternate word suggestion for the equivalent word from a user; and training the translation device based on the alternate word suggested by the user. 10. A translation device for interchanging code-mixed words and uni-language words, the translation device comprising: a processor; and a memory communicatively coupled to the processor, wherein the memory stores processor instructions, which, on execution, causes the processor to: identify two or more portions of a target word, wherein the target word is one of a code-mixed word and a uni-language word; determine one or more equivalent words corresponding to at least one portion of the two or more portions, wherein the one or more equivalent words are in at least one predetermined target language, wherein determining the one or more equivalent words comprises identifying a derivative word for a portion of the at least one portion to be replaced, and wherein the derivative word is in a native language associated with the portion; select at least one equivalent word from the one or more equivalent words, based on a context of the target word within a sentence comprising the target word; and replace the at least one portion of the two or more portions of the target word with the selected at least one equivalent word. 11. The translation device of claim 10 , wherein at least one remaining portion of the two or more portions is an ending phoneme corresponding to a verb, and wherein the at least one predetermined target language is determined based on the at least one portion that is an ending phoneme. 12. The translation device of claim 10 , wherein the two or more portions are identified using a bidirectional Long Short Term Memory (LSTM). 13. The translation device of claim 10 , wherein the at least one predetermined target language corresponds to a user preference. 14. The translation device of claim 10 , wherein selecting the at least one equivalent word comprises identifying the at least one equivalent word associated with the derivative word. 15. The translation device of claim 14 , wherein the processor instructions further cause the processor to: compute a first probability score for each of the at least one equivalent word, based on an associated usage probability, wherein the associated usage probability of an equivalent word of the at least one equivalent word is derived based on linguistic rules of the at least one predetermined target language; and compute a second probability score for each of the at least one equivalent word, based on an associated transition probability, wherein the associated transition probability of the equivalent word of the at least one equivalent word is derived based on a placement of the equivalent word within the sentence comprising the target word. 16. The translation device of claim 15 , wherein the processor instructions further cause the processor to: determine a combined probability score for each of the at least one equivalent word based on the associated first probability score and the second probability score; and select an equivalent word from the at least one equivalent word based on the combined probability score determined for each of the at least one equivalent word. 17. The translation device of claim 16 , wherein the processor instructions further cause the processor to: receive an alternate word suggestion for the equivalent word from a user; and train the translation device based on the alternate word suggested by the user. 18. A non-transitory computer-readable storage medium having stored thereon, a set of computer-executable instructions causing a computer comprising one or more processors to perform steps comprising: identifying two or more portions of a target word, wherein the target word is one of a code-mixed word and a uni-language word; determining one or more equivalent words corresponding to at least one portion of the two or more portions, wherein the one or more equivalent words are in at least one predetermined target language, wherein determining the one or more equivalent words comprises identifying a derivative word for a portion of the at least one portion to be replaced, and wherein the derivative word is in a native language associated with the portion; selecting at least one equivalent word from the one or more equivalent words, based on a context of the target word within a sentence comprising the target word; and replacing the at least one portion of the two or more portions of the target word with the selected at least one equivalent word.

Assignees

Inventors

Classifications

  • Grammatical analysis; Style critique · CPC title

  • G06F40/263Primary

    Language identification · CPC title

  • Statistical methods, e.g. probability models · CPC title

  • Recognition of textual entities · CPC title

  • G06F40/30Primary

    Semantic analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11151326B2 cover?
A method and a systems of interchanging code-mixed words and uni-language words are described. In an embodiment, the method may include identifying two or more portions of a target word, wherein the target word is one of a code-mixed word and a uni-language word. The method may further include determining one or more equivalent words corresponding to at least one portion of the two or more port…
Who is the assignee on this patent?
Wipro Ltd
What technology area does this patent fall under?
Primary CPC classification G06F40/263. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 19 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).