Language identification from short strings

US2016357728A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016357728-A1
Application numberUS-201514845180-A
CountryUS
Kind codeA1
Filing dateSep 3, 2015
Priority dateJun 4, 2015
Publication dateDec 8, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and processes for language identification from short strings are provided. In accordance with one example, a method includes, at a first electronic device with one or more processors and memory, receiving user input including an n-gram and determining a similarity between a representation of the n-gram and a representation of a first language. The representation of the first language is based on an occurrence of each of a plurality of n-grams in the first language and an occurrence of each of the plurality of n-grams in a second language. The method further includes determining whether the similarity between the representation of the n-gram and the representation of the first language satisfies a threshold.

First claim

Opening claim text (preview).

What is claimed is: 1 . A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to: receive user input including an n-gram; determine a similarity between a representation of the n-gram and a representation of a first language, wherein the representation of the first language is based on an occurrence of each of a plurality of n-grams in the first language and an occurrence of each of the plurality of n-grams in a second language; and determine whether the similarity between the representation of the n-gram and the representation of the first language satisfies a threshold. 2 . The non-transitory computer-readable storage medium of claim 1 , wherein determining the similarity between the representation of the n-gram and the representation of the first language includes determining a proximity of a spatial representation of the n-gram to a spatial representation of the first language. 3 . The non-transitory computer-readable storage medium of claim 1 , wherein the representation of the first language is based on a respective indexing power of each of the plurality of n-grams. 4 . The non-transitory computer-readable storage medium of claim 1 , wherein the instructions, which when executed by one or more processors of an electronic device, further cause the electronic device to: in accordance with a determination that the similarity between the representation of the n-gram and the representation of the first language satisfies the threshold, select the first language. 5 . The non-transitory computer-readable storage medium of claim 1 , wherein the instructions, which when executed by one or more processors of an electronic device, further cause the electronic device to: in accordance with a determination that the similarity between the representation of the n-gram and the representation of the first language does not satisfy the threshold, display a first affordance that, when activated, causes the first language to be selected; and display a second affordance that, when activated, causes the second language to be selected. 6 . The non-transitory computer-readable storage medium of claim 5 , wherein the instructions, which when executed by one or more processors of an electronic device, further cause the electronic device to: in response to the selection of the second affordance, adjust the representation of the second language. 7 . The non-transitory computer-readable storage medium of claim 1 , wherein the instructions, which when executed by one or more processors of an electronic device, further cause the electronic device to: determine whether the similarity between the representation of the n-gram and the representation of the first language is greater than a similarity between the representation of the n-gram and a representation of the second language. 8 . The non-transitory computer-readable storage medium of claim 1 , wherein the n-gram is a first n-gram and wherein the instructions, which when executed by one or more processors of an electronic device, further cause the electronic device to: receive user input including a second n-gram; and select the second language based on the second n-gram. 9 . The non-transitory computer-readable storage medium of claim 1 , wherein the n-gram is a character n-gram. 10 . A method, comprising: at an electronic device with one or more processors and memory: receiving user input including an n-gram; determining a similarity between a representation of the n-gram and a representation of a first language, wherein the representation of the first language is based on an occurrence of each of a plurality of n-grams in the first language and an occurrence of each of the plurality of n-grams in a second language; and determining whether the similarity between the representation of the n-gram and the representation of the first language satisfies a threshold. 11 . An electronic device, comprising: one or more processors; a memory; and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for: receiving user input including an n-gram; determining a similarity between a representation of the n-gram and a representation of a first language, wherein the representation of the first language is based on an occurrence of each of a plurality of n-grams in the first language and an occurrence of each of the plurality of n-grams in a second language; and determining whether the similarity between the representation of the n-gram and the representation of the first language satisfies a threshold.

Assignees

Inventors

Classifications

  • G06F40/263Primary

    Language identification · CPC title

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • Semantic analysis · CPC title

  • Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title

  • G06F17/275Primary

    Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016357728A1 cover?
Systems and processes for language identification from short strings are provided. In accordance with one example, a method includes, at a first electronic device with one or more processors and memory, receiving user input including an n-gram and determining a similarity between a representation of the n-gram and a representation of a first language. The representation of the first language is…
Who is the assignee on this patent?
Apple Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/263. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 08 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).