Delay in video for language translation
US-2015046146-A1 · Feb 12, 2015 · US
US2016170970A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2016170970-A1 |
| Application number | US-201414569343-A |
| Country | US |
| Kind code | A1 |
| Filing date | Dec 12, 2014 |
| Priority date | Dec 12, 2014 |
| Publication date | Jun 16, 2016 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
There is provided an apparatus comprising at least one processor and a memory comprising code that, when executed on the at least one processor, causes the apparatus to receive an input user setting relating to relative volumes of the speech data in a preferred language and speech data in a non-preferred language when the speech data is played-out; and cause play-out of received speech data so that the volume of the played-out speech data is set in dependence on the user input and whether the received speech data is in the preferred language or the non-preferred language.
Opening claim text (preview).
1 . An apparatus comprising: at least one processor; and a memory comprising code that, when executed on the at least one processor, causes the apparatus to: receive an input user setting relating to relative volumes of speech data in a preferred language and speech data in a non-preferred language when speech data is played out; and cause play-out of received speech data so that the volume of the played-out speech data is set in dependence on the user input and whether the received speech data is in the preferred language or the non-preferred language. 2 . An apparatus as claimed in claim 1 , wherein the memory further comprises code that, when executed on the at least one processor, causes the apparatus to: determine that received speech data being played-out to the user comprises speech data in the preferred language and speech data in the non-preferred language; and in response to the determination, automatically adjust the volume of the played-out speech data to output the speech data in the preferred language and the speech data in the non-preferred language to a user at different volumes. 3 . An apparatus comprising: at least one processor; and a memory comprising code that, when executed on the at least one processor, causes the apparatus to: cause play-out of received speech data in a preferred language and received speech data in a non-preferred language to a user simultaneously; determine that speech data in the preferred language and the speech data in the non-preferred language are being played-out to the user simultaneously; and in response to the determination, automatically adjust the relative volumes of the played-out speech data to output the speech data in the preferred language and the speech data in the non-preferred language to a user at different volumes. 4 . An apparatus as claimed in claim 3 , wherein the memory further comprises code that, when executed on the at least one processor, causes the apparatus to: receive an input user setting on the apparatus relating to relative volumes of speech data in a preferred language and speech data in a non-preferred when speech data is played out; wherein the adjustment to the volume of the played-out speech data to output the speech data in the preferred language and the speech data in the non-preferred language to a user at different volumes is dependent on the user setting. 5 . An apparatus as claimed in claim 1 , wherein the apparatus is a user device operatively connected to at least one speaker, and wherein the play-out of the speech data is effected through the at least one speaker. 6 . An apparatus as claimed in claim 5 when dependent on any of claim 1 , wherein the memory further comprises code that, when executed on the at least one processor, causes the apparatus to: cause play-out the speech data in the preferred language at a higher volume than the speech data in the non-preferred language. 7 . An apparatus as claimed in claim 1 , wherein the memory further comprises code that, when executed on the at least one processor, causes the apparatus to: receive speech data in the preferred language and speech data in the non-preferred language in the same audio stream. 8 . An apparatus as claimed in claim 1 , wherein the apparatus is a server located remotely from a source of the speech data, and wherein the memory further comprises code that, when executed on the at least one processor, causes the apparatus to: receive an indication of a preferred language of a recipient of the speech data; and cause the speech data to be translated into the preferred language, thereby forming the speech data in a preferred language. 9 . An apparatus as claimed in claim 8 , wherein the memory further comprises code that, when executed on the at least one processor, causes the apparatus to: transmit at least the translated speech data to an originator of the speech data with an indication of the language of the translated speech data. 10 . An apparatus as claimed in claim 8 , wherein the memory further comprises code that, when executed on the at least one processor, causes the apparatus to: transmit to said recipient the translated speech data with an indication of the language of the translated speech data; and transmit to said recipient the speech data with an indication of the language of the speech data. 11 . An apparatus as claimed in claim 1 , wherein the speech data is real-time audio data originating during a voice call and/or a video call. 12 . A method comprising: receiving an input user setting relating to relative volumes of speech data in a preferred language and speech data in a non-preferred language when speech data is played out; and causing play-out of received speech data so that the volume of the played-out speech data is set in dependence on the user input and whether the received speech data is in the preferred language or the non-preferred language. 13 . A method as claimed in claim 12 , further comprising: determining that received speech data being played-out to the user comprises speech data in a preferred language and speech data in a non-preferred language; and in response to the determining, automatically adjusting the volume of the played-out speech data to output the speech data in the preferred language and the speech data in the non-preferred language to a user at different volumes. 14 . A method as claimed in claim 12 , further comprising effectuating the play-out of the speech data through at least one speaker. 15 . A method as claimed in claim 12 , further comprising: causing play-out the speech data in the preferred language at a higher volume than the speech data in the non-preferred language. 16 . A method as claimed in claim 12 , further comprising: receiving speech data from a microphone operatively connected to the apparatus; transmitting the speech data to a remote server; receiving a translation of the speech data in a non-preferred language from the remote server; and causing play-out the received speech data at a volume associated with the non-preferred language. 17 . A method as claimed in claim 12 , further comprising: receiving an indication of a preferred language of a recipient of the speech data; and causing the speech data to be translated into the preferred language, thereby forming the speech data in a preferred language. 18 . A method as claimed in claim 17 , further comprising: transmitting at least the translated speech data to an originator of the received speech data with an indication of the language of the translated speech data. 19 . A method as claimed in claim 18 , wherein the speech data and the translated speech data are received in the same audio stream. 20 . A method as claimed in claim 12 , wherein the speech data is real-time audio data originating during a voice call and/or a video conference.
Speech to text systems (G10L15/08 takes precedence) · CPC title
Changing voice quality, e.g. pitch or formants · CPC title
Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title
Voice editing, e.g. manipulating the voice of the synthesiser · CPC title
Systems providing special services or facilities to subscribers (specially adapted for wireless communication networks H04W4/00) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.