Method to synthesize personalized phonetic transcription

US9990916B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9990916-B2
Application numberUS-201615139141-A
CountryUS
Kind codeB2
Filing dateApr 26, 2016
Priority dateApr 26, 2016
Publication dateJun 5, 2018
Grant dateJun 5, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Technology related to improving synthesis of foreign regional nouns using personalized and culturally correct phonetic transcription is described. The technology includes systems and methods for generating personalized speech by receiving an input, the input including textual data, identifying a regional noun in the textual data, and determining a user accent classification based on a context of the input. The method may further include determining a personalized phonetic transcription of the regional noun corresponding to the user accent classification and using a phonetic inventory stored in a database and outputting the personalized phonetic transcription.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving an input, the input including textual data; identifying a regional noun in the textual data; determining a user accent classification based on a context of the input; accessing a phonetic inventory stored in a database, in which multiple phonetic transcriptions are stored for each of a plurality of regional nouns, including the regional noun; executing a search of the database using the user accent classification and the regional noun as search parameters; determining a personalized phonetic transcription of the regional noun, selected from the multiple phonetic transcriptions stored for the regional noun as a search result of executing the search of the phonetic inventory stored in the database; outputting the personalized phonetic transcription; and synthesizing audio output of the textual data, based on the personalized phonetic transcription. 2. The computer-implemented method of claim 1 , wherein determining the personalized phonetic transcription of the regional noun is based on speech patterns of a user. 3. The computer-implemented method of claim 1 , wherein identifying a regional noun in the textual data includes narrowing a set of regional nouns searched based on a geographic location in which a user device is located, the user device being a device via which the input is received. 4. The computer-implemented method of claim 1 , wherein the context of the input includes a vernacular of a user. 5. The computer-implemented method of claim 4 , further comprising determining the vernacular of the user based on an online presence of the user. 6. The computer-implemented method of claim 1 , wherein the textual data includes orthographic text. 7. A computer-implemented method comprising: receiving an audio input, the audio input including speech; identifying a regional noun in the speech; generating a phonetic transcription of the regional noun using the audio input; determining a user accent classification based on a context of the audio input; executing a search of a phonetic inventory stored in a database, using the generated phonetic transcription and the user accent classification as search parameters, wherein the phonetic inventory stores multiple phonetic transcriptions for each of a plurality of regional nouns, including the regional noun; returning the stored phonetic transcription of the regional noun as a search result of executing the search of the phonetic inventory stored in the database; translating the regional noun into textual data using the stored phonetic transcription; and outputting the textual data. 8. The computer-implemented method of claim 7 , wherein generating the phonetic transcription of the regional noun includes personalizing the phonetic transcription for a user based on speech patterns of the user. 9. The computer-implemented method of claim 7 , wherein identifying a regional noun in the textual data includes narrowing a set of regional nouns searched based on a geographic location in which a user device is located, wherein the user device is a device via which the audio input is received. 10. The computer-implemented method of claim 7 , wherein the textual data includes orthographic text. 11. The computer-implemented method of claim 7 , wherein the context of the input further includes a vernacular of a user. 12. The computer-implemented method of claim 11 , further comprising determining the vernacular based on an online presence of the user. 13. The computer-implemented method of claim 11 , wherein outputting the textual data includes outputting a localized spelling of orthographic text based on the vernacular of the user. 14. A system comprising: one or more processors; and a memory coupled to the one or more processors and storing instructions that, when executed by the one or more processors, cause the system to: receive a textual input including a word; determine a user accent classification based on a context of the textual input; accessing a phonetic inventory stored in a database, in which multiple phonetic transcriptions are stored for each of a plurality of words, including the word; executing a search of the database using the user accent classification and the regional noun as search parameters; determine a personalized phonetic transcription of the word, selected from the multiple phonetic transcriptions stored for the word as a search result of executing the search of the phonetic inventory stored in the database; output the personalized phonetic transcription; and synthesize audio output of the textual input, based on the personalized phonetic transcription. 15. The system of claim 14 , wherein to determine the personalized phonetic transcription of the word is based on speech patterns of a user. 16. The system of claim 14 , wherein the instructions, when executed by the one or more processors, further cause the system to narrow a set of words in the phonetic inventory against which the personalized phonetic transcription is determined using a geographic location in which a user device is located, wherein the user device is a device via which the textual input is received. 17. The system of claim 14 , wherein the context of the textual input includes a vernacular of a user. 18. The system of claim 17 , wherein the instructions, when executed by the one or more processors, further cause the system to determine the vernacular of the user based on a current or past residence of the user. 19. The system of claim 14 , wherein the textual input includes orthographic text.

Assignees

Inventors

Classifications

  • Semantic analysis · CPC title

  • Language identification · CPC title

  • Dictionaries · CPC title

  • G10L15/07Primary

    to the speaker · CPC title

  • G10L13/08Primary

    Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9990916B2 cover?
Technology related to improving synthesis of foreign regional nouns using personalized and culturally correct phonetic transcription is described. The technology includes systems and methods for generating personalized speech by receiving an input, the input including textual data, identifying a regional noun in the textual data, and determining a user accent classification based on a context o…
Who is the assignee on this patent?
Adobe Systems Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/07. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 05 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).