Construction of a phonetic representation of a generated string of characters

US10102189B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10102189-B2
Application numberUS-201514977090-A
CountryUS
Kind codeB2
Filing dateDec 21, 2015
Priority dateDec 21, 2015
Publication dateOct 16, 2018
Grant dateOct 16, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided are methods, devices, and computer-readable media for generating a string of characters based on a set of rules; parsing the string of characters into string of graphemes; determining one or more phonetic representations for one or more graphemes in the string of graphemes based on a first data structure; determining at least one grapheme representation for one or more of the one or more phonetic representations based on a second data structure; and constructing the phonetic representation of the string of characters based on the grapheme representation that was determined.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method, comprising: generating a string of characters based on a set of rules; parsing the string of characters into a first string of graphemes; adding one or more characters to the first string of graphemes to represent missing characters in the string of characters to create a second string of graphemes; grouping the second string of graphemes into a plurality of pseudo-graphemes, wherein two or more graphemes in the second string of graphemes that are phonetized together are grouped to a single pseudo-grapheme; accessing a first data structure that maps each pseudo-grapheme in the plurality of pseudo-graphemes to one or more universal phonetic representations based on an international phonetic alphabet, wherein the first data structure comprises a plurality of first nodes with each first node of the plurality of first nodes having a respective weight assigned that corresponds to a pronunciation of a first grapheme; determining one or more phonetic representations for each pseudo-grapheme in the plurality of pseudo-graphemes based on the first data structure; accessing a second data structure that maps the one or more universal phonetic representations to one or more graphemes in a third string of graphemes, wherein the second data structure comprises a plurality of second nodes with each second node of the plurality of second nodes having a respective weight assigned that corresponds to a likely representation of a second grapheme; determining at least one grapheme representation for one or more of the one or more phonetic representations based on the second data structure; constructing a second phonetic representation of the string of characters based on the at least one grapheme representation that was determined; providing the second phonetic representation to a domain name verifier to determine that the phonetic representation is available to be registered as a domain name; and providing an offer to a user to register the second phonetic representation with a domain name system. 2. The method of claim 1 , further comprising: ranking each grapheme representation to produce a ranked list, wherein the ranking is based on a likelihood that a grapheme representation sounds similar to a pronunciation sound of the string of characters; and filtering the ranked list to produce a subset of grapheme representations. 3. The method of claim 2 , further comprising determining a first composite weight for the one or more phonetic representations based on the first data structure. 4. The method of claim 2 , further comprising determining a second composite weight for the one or more graphemes based on the second data structure. 5. The method of claim 4 , wherein the filtering is based on the second composite weight. 6. The method of claim 1 , further comprising creating the first data structure and the second data structure as information gain trees. 7. The method of claim 1 , wherein the set of rules includes at least one of a length of the string of characters, at least one character in the string of characters, and a position of at least one character in the string of characters. 8. A device, comprising: a memory storing instructions; and at least one processor, operably connected to the memory, implemented at least in part in hardware, and configured to execute the instructions to perform operations comprising: generating a string of characters based on a set of rules; parsing the string of characters into a first string of graphemes; adding one or more characters to the first string of graphemes to represent missing characters in the string of characters to create a second string of graphemes; grouping the second string of graphemes into a plurality of pseudo-graphemes, wherein two or more graphemes in the second string of graphemes that are phonetized together are grouped to a single pseudo-grapheme; accessing a first data structure that maps each pseudo-grapheme in the plurality of pseudo-graphemes to one or more universal phonetic representations based on an international phonetic alphabet, wherein the first data structure comprises a plurality of first nodes with each first node of the plurality of first nodes having a respective weight assigned that corresponds to a likely pronunciation of a first grapheme; determining one or more phonetic representations for each pseudo-grapheme in the plurality of pseudo-graphemes based on the first data structure; accessing a second data structure that maps the one or more universal phonetic representations to one or more graphemes in a third string of graphemes, wherein the second data structure comprises a plurality of second nodes with each second node of the plurality of second nodes having a respective weight assigned that corresponds to a likely representation of a second grapheme; determining at least one grapheme representation for one or more of the one or more phonetic representation based on the second data structure; constructing a second phonetic representation of the string of characters based on the at least one grapheme representation that was determined; providing the second phonetic representation to a domain name verifier to determine that the phonetic representation is available to be registered as a domain name; and providing an offer to a user to register the second phonetic representation with a domain name system. 9. The device of claim 8 , the operations further comprising: ranking each grapheme representation to produce a ranked list, wherein the ranking is based on a likelihood that a grapheme representation sounds similar to a pronunciation sound of the string of characters; and filtering the ranked list to produce a subset of grapheme representations. 10. The device of claim 8 , the operations further comprising creating the first data structure and the second data structure as information gain trees. 11. The device of claim 8 , the operations further comprising determining a first composite weight for the one or more phonetic representations based on the first data structure. 12. The device of claim 8 , further comprising determining a second composite weight for the one or more graphemes based on the second data structure. 13. The device of claim 12 , wherein the filtering is based on the second composite weight. 14. The device of claim 8 , wherein the set of rules includes at least one of a length of the string of characters, at least one character in the string of characters, and a position of at least one character in the string of characters. 15. A non-transitory computer-readable medium comprising computer-interpretable instructions which, when executed by at least one electronic processor, cause the at least one electronic processor to perform a method of converting a string of characters into a phonetic representation, the method comprising: generating a string of characters based on a set of rules; parsing the string of characters into a first string of graphemes; adding one or more characters to the first string of graphemes to represent missing characters in the string of characters to create a second string of graphemes; grouping the second string of graphemes into a plurality of pseudo-graphemes, wherein two or more graphemes in the second string of graphemes that are phonetized together are grouped to a single pseudo-grapheme; accessing a first data structure that maps each pseudo-grapheme in the plurality of pseudo-graphemes to one or more universal phonetic representations based on an international phonetic alphabet, wherein the first data structure comprises a plu

Assignees

Inventors

Classifications

  • Character encoding · CPC title

  • Domain name generation or assignment · CPC title

  • Administrative registration, e.g. for domain names at internet corporation for assigned names and numbers [ICANN] · CPC title

  • G06F40/157Primary

    using dictionaries or tables · CPC title

  • Parsing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10102189B2 cover?
Provided are methods, devices, and computer-readable media for generating a string of characters based on a set of rules; parsing the string of characters into string of graphemes; determining one or more phonetic representations for one or more graphemes in the string of graphemes based on a first data structure; determining at least one grapheme representation for one or more of the one or mo…
Who is the assignee on this patent?
Verisign Inc
What technology area does this patent fall under?
Primary CPC classification H04L61/3025. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Oct 16 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).