Who is the assignee on this patent?

Dolby Laboratories Licensing Corp

What technology area does this patent fall under?

Primary CPC classification G06F40/263. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Oct 08 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Translation with conversational overlap

US10437934B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10437934-B2
Application number	US-201715650561-A
Country	US
Kind code	B2
Filing date	Jul 14, 2017
Priority date	Sep 27, 2016
Publication date	Oct 8, 2019
Grant date	Oct 8, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A plurality of utterances of a first user from the language of the first user is translated into a language of a second user. The confidence scores associated with the translated utterances are compared with a confidence threshold. A predetermined utterance gap is adjusted based on the comparison. The predetermined utterance gap is a duration of time that occurs between utterances.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving a plurality of utterances of a first person in a first language; detecting an utterance gap between sequential utterances of the plurality of utterances; determining, prior to translating an utterance, whether the utterance will be translated by comparing the utterance gap after the utterance is completed to a threshold utterance gap and, if it is determined that the utterance will be translated: translating the utterance from the first language to a second language to produce a translated utterance; determining a translation confidence score for the translated utterance; determining whether the confidence score is greater than or equal to a confidence level; determining, based on whether the confidence score is greater than or equal to the confidence level, whether the confidence score is great enough to output the translated utterance; and determining accrued translation confidence scores for a plurality of utterances, wherein the threshold utterance gap is increased if a percentage of the accrued translation confidence scores is less than a confidence threshold. 2. The method of claim 1 , wherein the plurality of utterances include a first utterance of the first person and a second utterance of the first person, wherein the first utterance of the first person and the second utterance of the first person are received at a device associated with second person. 3. The method of claim 2 , wherein the first utterance of the first person and the second utterance of the first person are data associated with spoken utterances transmitted from a device associated with the first person. 4. The method of claim 2 , wherein the first utterance of the first person and the second utterance of the first person are spoken utterances of the first person. 5. The method of claim 1 , wherein the threshold utterance gap is less than a turn threshold duration. 6. The method of claim 1 , further comprising outputting the translated utterance at a device associated with a second person. 7. The method of claim 6 , wherein the device associated with the second person includes a pair of earbuds. 8. The method of claim 7 , wherein the pair of earbuds are configured to occlude a direct sound path associated with the plurality of utterances of the first person by attenuating the plurality of utterances. 9. The method of claim 8 , wherein an amount of attenuation of the plurality of utterances is adjustable. 10. The method of claim 6 , wherein the translated utterance is outputted to appear to come from a predetermined spatial location. 11. The method of claim 6 , wherein the translated utterance is outputted to appear to come from a spatial location of the first person. 12. The method of claim 1 , wherein the threshold utterance gap is adjustable based at least in part on a speech pattern of the first person. 13. The method of claim 1 , wherein the threshold utterance gap is adjustable based at least in part on a cadence of the first person's speech. 14. A system, comprising: a processor configured for: receiving a plurality of utterances of a first person in a first language; detecting an utterance gap between sequential utterances of the plurality of utterances; determining, prior to translating an utterance, whether the utterance will be translated by comparing the utterance gap after the utterance is completed to a threshold utterance gap and, if the processor determines that the utterance will be translated: translating the utterance from the first language to a second language to produce a translated utterance; determining a translation confidence score for the translated utterance; determining whether the confidence score is greater than or equal to a confidence level; determining, based on whether the confidence score is greater than or equal to the confidence level, whether the confidence score is great enough to output the translated utterance; and determining accrued translation confidence scores for a plurality of utterances, wherein the threshold utterance gap is decreased if a percentage of the accrued translation confidence scores is greater than or equal to a confidence threshold and wherein the confidence threshold corresponds with a percentage of translations that are accurate. 15. The system of claim 14 , wherein the threshold utterance gap is increased if a percentage of the accrued translation confidence scores is less than the confidence threshold. 16. The system of claim 14 , wherein the processor is further configured to output the translated plurality of utterances at a device associated with a second person. 17. A computer program product, the computer program product being embodied in a non-transitory computer readable storage medium and comprising computer instructions for: receiving a plurality of utterances of a first person in a first language; detecting an utterance gap between sequential utterances of the plurality of utterances; determining, prior to translating an utterance, whether the utterance will be translated by comparing the utterance gap after the utterance is completed to a threshold utterance gap and, if it is determined that the utterance will be translated: translating the utterance from the first language to a second language to produce a translated utterance; determining a translation confidence score for the translated utterance; determining whether the confidence score is greater than or equal to a confidence level; determining, based on whether the confidence score is greater than or equal to the confidence level, whether the confidence score is great enough to output the translated utterance; and determining accrued translation confidence scores for a plurality of utterances, wherein the threshold utterance gap is increased if a percentage of the accrued translation confidence scores is less than a confidence threshold. 18. The system of claim 14 , wherein the processor is further configured for determining whether a maximum condition is satisfied if the processor determines that the confidence score is not great enough to output the translated utterance. 19. The system of claim 18 , wherein the processor is further configured for combining the translated utterance with a subsequent utterance if the processor determines that the maximum condition is not satisfied.

Assignees

Dolby Laboratories Licensing Corp

Inventors

Classifications

G06F40/263Primary
Language identification · CPC title
G06F40/58
Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title
G06F40/47Primary
Machine-assisted translation, e.g. using translation memory · CPC title
H04R1/1083
Reduction of ambient noise (active noise reduction per se G10K11/175; protective devices for the ear, e.g. providing acoustic protection A61F11/06) · CPC title
H04R3/005
for combining the signals of two or more microphones (specially adapted for hearing aids H04R25/407) · CPC title

Patent family

Related publications grouped by family.

View patent family 59653533

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10437934B2 cover?: A plurality of utterances of a first user from the language of the first user is translated into a language of a second user. The confidence scores associated with the translated utterances are compared with a confidence threshold. A predetermined utterance gap is adjusted based on the comparison. The predetermined utterance gap is a duration of time that occurs between utterances.
Who is the assignee on this patent?: Dolby Laboratories Licensing Corp
What technology area does this patent fall under?: Primary CPC classification G06F40/263. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Oct 08 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Speech translation apparatus and method

Real time multi-language voice translation

In-Call Translation

System and method for translating real-time speech using segmentation based on conjunction locations

Frequently asked questions