What technology area does this patent fall under?

Primary CPC classification G10L13/08. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Jan 01 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Voice continuation over network with audio quality degradation

US2026004768A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2026004768-A1
Application number	US-202418757866-A
Country	US
Kind code	A1
Filing date	Jun 28, 2024
Priority date	Jun 28, 2024
Publication date	Jan 1, 2026
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for voice continuation over a network with audio quality degradation according to an embodiment includes receiving, by a first computing device, a user’s voice audio captured by a second computing device, receiving, by the first computing device, text corresponding with the user’s voice audio, wherein the user’s voice audio is transformed into the text, determining, by the first computing device, a quality of the user’s voice audio, and performing, by the first computing device, voice restitution to generate a cloned user voice audio speaking the text corresponding with the user’s voice audio based on one or more voice model parameters of the user’s voice in response to determining that the quality of the user’s voice audio is degraded.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for voice continuation over a network with audio quality degradation, the method comprising: receiving, by a first computing device, a user’s voice audio captured by a second computing device; receiving, by the first computing device, text corresponding with the user’s voice audio, wherein the user’s voice audio is transformed into the text; determining, by the first computing device, a quality of the user’s voice audio; and performing, by the first computing device, voice restitution to generate a cloned user voice audio speaking the text corresponding with the user’s voice audio based on one or more voice model parameters of the user’s voice in response to determining that the quality of the user’s voice audio is degraded. 2 . The method of claim 1 , wherein determining the quality of the user’s voice audio comprises determining a bandwidth of a network connection between the first computing device and the second computing device. 3 . The method of claim 2 , wherein determining that the quality of the user’s voice audio is degraded comprises determining the bandwidth of the network connection between the first computing device and the second computing device is below a predefined threshold. 4 . The method of claim 1 , wherein determining the quality of the user’s voice audio comprises determining a latency of a network connection between the first computing device and the second computing device. 5 . The method of claim 4 , wherein determining that the quality of the user’s voice audio is degraded comprises determining the latency of the network connection between the first computing device and the second computing device is above a predefined threshold. 6 . The method of claim 1 , further comprising: receiving, by the second computing device, the user’s voice audio; transforming, by the second computing device, the user’s voice audio into the text corresponding with the user’s voice audio using automatic speech recognition; and transmitting, by the second computing device, the user’s voice audio and the text corresponding with the user’s voice audio to the first computing device. 7 . The method of claim 1 , further comprising: generating, by the second computing system, the one or more voice model parameters of the user’s voice based on an initial user’s voice audio captured by the second computing system; and transmitting, by the second computing system, the one or more voice model parameters to the first computing system. 8 . The method of claim 7 , wherein the user’s voice audio captured by the second computing device and received by the first computing device and the initial user’s voice audio captured by the second computing system occur in a same conversation between a user of the first computing device and a user of the second computing device. 9 . The method of claim 7 , further comprising configuring, by the first computing device, a voice restitution system based on the one or more voice model parameters. 10 . The method of claim 1 , further comprising playing the user’s voice audio on the first computing device in response to determining that the quality of the user’s voice audio is not degraded. 11 . A system for voice continuation over a network with audio quality degradation, the system comprising: a first computing device comprising at least one first processor and at least one first memory comprising a first plurality of instructions stored thereon; and a second computing device comprising at least one second processor and at least one second memory comprising a second plurality of instructions stored thereon; wherein the first plurality of instructions, in response to execution by the at least one first processor, causes the first computing system to: receive a user’s voice audio captured by the second computing device; receive text corresponding with the user’s voice audio, wherein the user’s voice audio is transformed into the text; determine a quality of the user’s voice audio; and perform voice restitution to generate a cloned user voice audio speaking the text corresponding with the user’s voice audio based on one or more voice model parameters of the user’s voice in response to a determination that the quality of the user’s voice audio is degraded. 12 . The system of claim 11 , wherein to determine the quality of the user’s voice audio comprises to determine a bandwidth of a network connection between the first computing device and the second computing device. 13 . The system of claim 12 , wherein the determination that the quality of the user’s voice audio is degraded comprises a determination that the bandwidth of the network connection between the first computing device and the second computing device is below a predefined threshold. 14 . The system of claim 11 , wherein to determine the quality of the user’s voice audio comprises to determine a latency of a network connection between the first computing device and the second computing device. 15 . The system of claim 14 , wherein the determination that the quality of the user’s voice audio is degraded comprises a determination that the latency of the network connection between the first computing device and the second computing device is above a predefined threshold. 16 . The system of claim 11 , wherein the second plurality of instructions, in response to execution by the at least one second processor, causes the second computing system to: receive the user’s voice audio; transform the user’s voice audio into the text corresponding with the user’s voice audio using automatic speech recognition; and transmit the user’s voice audio and the text corresponding with the user’s voice audio to the first computing device. 17 . The system of claim 11 , wherein the second plurality of instructions, in response to execution by the at least one second processor, causes the second computing system to: generate the one or more voice model parameters of the user’s voice based on an initial user’s voice audio captured by the second computing system; and transmit the one or more voice model parameters to the first computing system. 18 . The system of claim 17 , wherein the user’s voice audio captured by the second computing device and received by the first computing device and the initial user’s voice audio captured by the second computing system occur in a same conversation between a user of the first computing device and a user of the second computing device. 19 . The system of claim 17 , wherein the first plurality of instructions, in response to execution by the at least one first processor, causes the first computing system to configure a voice restitution system based on the one or more voice model parameters. 20 . The system of claim 11 , wherein the first plurality of instructions, in response to execution by the at least one first processor, causes the first computing system to play the user’s voice audio in response to a determination that the quality of the user’s voice audio is not degraded.

Assignees

Genesys Cloud Services Inc

Inventors

Classifications

G10L15/26
Speech to text systems (G10L15/08 takes precedence) · CPC title
G10L13/08Primary
Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination · CPC title
H04M7/0057
Services where the data services network provides a telephone service in addition or as an alternative, e.g. for backup purposes, to the telephone service provided by the telephone services network · CPC title
H04M2201/39
using speech synthesis · CPC title
H04M2201/40
using speech recognition · CPC title

Patent family

Related publications grouped by family.

View patent family 96703500

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2026004768A1 cover?: A method for voice continuation over a network with audio quality degradation according to an embodiment includes receiving, by a first computing device, a user’s voice audio captured by a second computing device, receiving, by the first computing device, text corresponding with the user’s voice audio, wherein the user’s voice audio is transformed into the text, determining, by the first comput…
Who is the assignee on this patent?: Genesys Cloud Services Inc
What technology area does this patent fall under?: Primary CPC classification G10L13/08. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Jan 01 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

System and method for contextual analysis and metadata database generation for user-specific speech patterns

Caption assisted calling to maintain connection in challenging network conditions

Voice changer

Frequently asked questions