What technology area does this patent fall under?

Primary CPC classification G10L15/30. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Mar 16 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).

Source-based automatic speech recognition

US10950239B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10950239-B2
Application number	US-201514920021-A
Country	US
Kind code	B2
Filing date	Oct 22, 2015
Priority date	Oct 22, 2015
Publication date	Mar 16, 2021
Grant date	Mar 16, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Recognizing a user's speech is a computationally demanding task. If a user calls a destination server, little may be known about the user or the user's speech profile. The user's source system (device and/or server) may have an extensive profile of the user. As provided herein, a source device may provide translated text and/or speech attributes to a destination server. As a benefit, the recognition algorithm may be well tuned to the user and provide the recognized content to the destination. Additionally, the destination may provide domain attributes to allow the source recognition engine to better recognize the spoken content.

First claim

Opening claim text (preview).

What is claimed is: 1. A destination server, comprising: a network interface to a communications network; a microprocessor having access to the network interface; and the microprocessor that, via the network interface, engages in a call with a source server, the call comprising a voice channel comprising a spoken portion provided by a source user and a data channel comprising a machine-readable cue of the spoken portion and wherein the machine-readable cue comprises a speech attribute of the source user; wherein the microprocessor executes a speech recognition algorithm to recognize the spoken portion and wherein the speech recognition algorithm is seeded with the machine-readable cue; and wherein the microprocessor executes instructions in accordance with the microprocessor recognized spoken portion on the voice channel and the machine-readable cue received on the data channel. 2. The destination server of claim 1 , wherein the microprocessor receives indicia of source-side speech recognition. 3. The destination server of claim 2 , wherein the microprocessor, in response to receiving the indicia of source-side speech recognition, replies via the data channel with a domain attribute associated with the destination server. 4. The destination server of claim 1 , wherein the microprocessor executes a speech recognition algorithm utilizing an acoustic model, selected in accordance with the speech attribute of the source user in the machine-readable cue, and derives machine-readable content from a waveform portion of the call. 5. The destination server of claim 1 , wherein the machine-readable cue further comprises human-readable text of a machine-readable recognition of the spoken portion. 6. The destination server of claim 1 , wherein the data channel comprises a Real-Time Transport Protocol (RTP) text stream.

Assignees

Avaya Inc

Inventors

Classifications

G10L2015/226
using non-speech characteristics · CPC title
G10L15/32
Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems · CPC title
G10L15/30Primary
Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title
G10L15/22
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
G06F40/58
Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation · CPC title

Patent family

Related publications grouped by family.

View patent family 58558806

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10950239B2 cover?: Recognizing a user's speech is a computationally demanding task. If a user calls a destination server, little may be known about the user or the user's speech profile. The user's source system (device and/or server) may have an extensive profile of the user. As provided herein, a source device may provide translated text and/or speech attributes to a destination server. As a benefit, the recogn…
Who is the assignee on this patent?: Avaya Inc
What technology area does this patent fall under?: Primary CPC classification G10L15/30. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Mar 16 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).