Encoding and adaptive, scalable accessing of distributed models

US9619465B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9619465-B2
Application numberUS-201414285693-A
CountryUS
Kind codeB2
Filing dateMay 23, 2014
Priority dateFeb 17, 2006
Publication dateApr 11, 2017
Grant dateApr 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, methods, and apparatus for accessing distributed models in automated machine processing, including using large language models in machine translation, speech recognition and other applications.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: a translation server operable to perform machine translation obtaining translation model data from a translation model for translation between a source language and a target language and language model data from a language model for the target language, the translation server further operable to translate text in the source language into the target language using the obtained translation model data and language model data, the translation server comprising: a request queue operable to store requests for language model data to be obtained for translating a segment in the source language, and a segment translation server cache operable to store language model data obtained by the requests by the translation server, wherein the translation server is further operable to: process the translation of the segment using language model data from a second language model for the target language to produce an initial translation of the segment before the requests for the language data in the language model in the request queue are sent out, update the requests for the language model data of the language model in the request queue based on the initial translation, send out the updated requests in the request queue to obtain language model data from the language model for processing the initial translation, and after the updated requests are served and the data for the updated requests are stored in the segment translation server cache, process the initial translation with the data for the updated requests to produce a final translation. 2. The system of claim 1 , wherein: the segment translation server cache is operable to delete the obtained language model data after the segment is translated. 3. The system of claim 1 , wherein: the segment translation server cache is operable to delete the obtained language model data periodically. 4. The system of claim 1 , wherein the translation server is operable to process translation of the segment before all of the requests in the request queue are served, and the translation server is further operable to finalize translation of the segment using the language model data in the segment translation server cache obtained by the requests. 5. The system of claim 1 , wherein: the segment translation server cache is further operable to store history information of translation of an assigned segment. 6. The system of claim 1 , wherein the translation model is divided into a plurality of translation model partitions, each translation model partition being less than the entire translation model and being stored on a different translation model server of a plurality of translation model servers, and the respective translation model partitions together constituting the entire translation model. 7. The system of claim 1 , wherein language model is divided into a plurality of language model partitions, each language model partition being less than the entire language model and the respective language model partitions together constituting the entire language model. 8. A system comprising: a translation server operable to perform machine translation obtaining translation model data from a translation model for translation between a source language and a target language and language model data from a language model for the target language, the translation server further operable to translate text in the source language into the target language using the obtained translation model data and language model data, the translation server comprising: a request queue operable to store requests for language model data to be obtained for translating a segment in the source language, a segment translation server cache operable to store language model data obtained by the requests by the translation server; and a second segment translation server cache storing a selected portion of the language model, wherein the translation server is operable to: process the translation of the segment using language model data from a second language model for the target language to produce an initial translation of the segment before the requests for the language data in the language model in the request queue are sent out, update the requests for the language model data of the language model in the request queue based on the initial translation, send out the updated requests in the request queue to obtain language model data from the language model for processing the initial translation, and after the updated requests are served and the data for the updated requests are stored in the segment translation server cache, process the initial translation with the data for the updated requests to produce a final translation, after completing translation of the segment, delete data in the segment translation server cache and retain the selected portion of the language model in the second segment translation server cache. 9. The system of claim 8 , wherein: the translation server is operable to look up the second segment translation server cache for a piece of language model data needed for translating the segment before generating a request for the piece of language model data, and when the piece of language model data is present in the second segment translation server cache, the translation server is operable to use the piece of language model data for translation without generating the request for the piece of language model data. 10. The system of claim 8 , wherein the translation model is divided into a plurality of translation model partitions, each translation model partition being less than the entire translation model and being stored on a different translation model server of a plurality of translation model servers, and the respective translation model partitions together constituting the entire translation model. 11. The system of claim 8 , wherein language model is divided into a plurality of language model partitions, each language model partition being less than the entire language model and the respective language model partitions together constituting the entire language model. 12. The system of claim 8 , wherein the translation server is further operable to: obtain the translation model data from the translation model based on the segment; translate the segment into a set of possible translations based on the translation model data; obtain the language model data from the language model based on the set of possible translations, the language model data matching at least one token in at least one possible translation of the set of possible translations; and determine a translation of the segment based on the obtained language model data and the set of possible translations. 13. A system comprising: a translation server operable to perform machine translation obtaining translation model data from a translation model for translation between a source language and a target language and language model data from a language model for the target language, the translation server further operable to translate text in the source language into the target language using the obtained translation model data and language model data, the translation server comprising: a request queue operable to store requests for language model data to be obtained for translating a segment in the source language, a segment translation server cache operable to store language model data obtained by the requests by the translation server; and a second segment translation server cache storing a selected portion of the language model, wherein the translation server is operable to: periodically del

Assignees

Inventors

Classifications

  • G06F40/58Primary

    Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title

  • Machine-assisted translation, e.g. using translation memory · CPC title

  • G06F40/40Primary

    Processing or translation of natural language (natural language analysis G06F40/20; semantic analysis G06F40/30) · CPC title

  • using very large corpora, e.g. the web · CPC title

  • G06F40/44Primary

    Statistical methods, e.g. probability models · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9619465B2 cover?
Systems, methods, and apparatus for accessing distributed models in automated machine processing, including using large language models in machine translation, speech recognition and other applications.
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/58. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).