Spell correction with hidden markov models on online social networks
US-2016299883-A1 · Oct 13, 2016 · US
US9785630B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9785630-B2 |
| Application number | US-201514724641-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 28, 2015 |
| Priority date | May 30, 2014 |
| Publication date | Oct 10, 2017 |
| Grant date | Oct 10, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Systems and processes are disclosed for predicting words in a text entry environment. Candidate words and probabilities associated therewith can be determined by combining a word n-gram language model and a unigram language model. Using the word n-gram language model, based on previously entered words, candidate words can be identified and a probability can be calculated for each candidate word. Using the unigram language model, based on a character entered for a new word, candidate words beginning with the character can be identified along with a probability for each candidate word. In some examples, a geometry score can be included in the unigram probability related to typing geometry on a virtual keyboard. The probabilities of the n-gram language model and unigram model can be combined, and the candidate word or words having the highest probability can be displayed for a user.
Opening claim text (preview).
What is claimed is: 1. A method for predicting words, the method comprising: at an electronic device: receiving typed input from a user, wherein the typed input comprises a character associated with a new word; determining, using a word n-gram model, a first probability of a predicted word based on a previously entered word in the typed input; determining, using a unigram model, a second probability of the predicted word based on the character associated with the new word in the typed input, wherein the second probability is determined based on a geometry score associated with the typed input; determining a combined probability of the predicted word based on the first probability and the second probability; and causing the predicted word to be displayed based on the combined probability. 2. The method of claim 1 , wherein determining the first probability comprises: determining a set of predicted words and associated probabilities based on the previously entered word in the typed input. 3. The method of claim 2 , further comprising: generating a subset of the set of predicted words by removing words from the set based on the character associated with the new word in the typed input. 4. The method of claim 3 , wherein the subset of predicted words comprises words having prefixes that comprise the character associated with the new word in the typed input; and wherein the subset of predicted words comprises the predicted word. 5. The method of claim 1 , wherein the geometry score is determined based on a key selection. 6. The method of claim 5 , wherein the geometry score comprises a likelihood of a sequence of characters given the key selection. 7. The method of claim 5 , wherein the key selection comprises key selection on a virtual keyboard. 8. The method of claim 1 , wherein determining the second probability comprises: determining a set of predicted words and associated probabilities based on the character associated with the new word in the typed input. 9. The method of claim 8 , wherein the set of predicted words comprises words having prefixes that comprise the character associated with the new word in the typed input; and wherein the set of predicted words comprises the predicted word. 10. The method of claim 1 , wherein determining the second probability comprises: traversing a unigram trie to determine the second probability. 11. The method of claim 1 , wherein determining the first probability comprises: determining the first probability based on a sequence of previously entered words. 12. The method of claim 1 , wherein determining the second probability comprises: determining the second probability based on a sequence of characters in the typed input associated with the new word. 13. The method of claim 1 , wherein determining the combined probability comprises: determining the product of the first probability and the second probability. 14. The method of claim 1 , wherein the electronic device comprises a phone, a desktop computer, a laptop computer, a tablet computer, a television, a television set top box, or a wearable electronic device. 15. A non-transitory computer-readable storage medium comprising instructions for causing one or more processors to: receive typed input from a user, wherein the typed input comprises a character associated with a new word; determine, using a word n-gram model, a first probability of a predicted word based on a previously entered word in the typed input; determine, using a unigram model, a second probability of the predicted word based on the character associated with the new word in the typed input, wherein the second probability is determined based on a geometry score associated with the typed input; determine a combined probability of the predicted word based on the first probability and the second probability; and cause the predicted word to be displayed based on the combined probability. 16. A system comprising: one or more processors; memory; one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the one or more programs including instructions for: receiving typed input from a user, wherein the typed input comprises a character associated with a new word; determining, using a word n-gram model, a first probability of a predicted word based on a previously entered word in the typed input; determining, using a unigram model, a second probability of the predicted word based on the character associated with the new word in the typed input, wherein the second probability is determined based on a geometry score associated with the typed input; determining a combined probability of the predicted word based on the first probability and the second probability; and causing the predicted word to be displayed based on the combined probability. 17. The computer-readable storage medium of claim 15 , wherein determining the first probability comprises: determining a set of predicted words and associated probabilities based on the previously entered word in the typed input. 18. The computer-readable storage medium of claim 17 , further comprising instructions for causing the one or more processors to: generate a subset of the set of predicted words by removing words from the set based on the character associated with the new word in the typed input. 19. The system of claim 16 , wherein determining the first probability comprises: determining a set of predicted words and associated probabilities based on the previously entered word in the typed input. 20. The system of claim 19 , wherein the one or more programs further include instructions for: generating a subset of the set of predicted words by removing words from the set based on the character associated with the new word in the typed input.
Converting codes to words; Guess-ahead of partial word inputs · CPC title
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.