Voice continuation over network with audio quality degradation

US2026004768A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2026004768-A1
Application numberUS-202418757866-A
CountryUS
Kind codeA1
Filing dateJun 28, 2024
Priority dateJun 28, 2024
Publication dateJan 1, 2026
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for voice continuation over a network with audio quality degradation according to an embodiment includes receiving, by a first computing device, a user’s voice audio captured by a second computing device, receiving, by the first computing device, text corresponding with the user’s voice audio, wherein the user’s voice audio is transformed into the text, determining, by the first computing device, a quality of the user’s voice audio, and performing, by the first computing device, voice restitution to generate a cloned user voice audio speaking the text corresponding with the user’s voice audio based on one or more voice model parameters of the user’s voice in response to determining that the quality of the user’s voice audio is degraded.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for voice continuation over a network with audio quality degradation, the method comprising: receiving, by a first computing device, a user’s voice audio captured by a second computing device; receiving, by the first computing device, text corresponding with the user’s voice audio, wherein the user’s voice audio is transformed into the text; determining, by the first computing device, a quality of the user’s voice audio; and performing, by the first computing device, voice restitution to generate a cloned user voice audio speaking the text corresponding with the user’s voice audio based on one or more voice model parameters of the user’s voice in response to determining that the quality of the user’s voice audio is degraded. 2 . The method of claim 1 , wherein determining the quality of the user’s voice audio comprises determining a bandwidth of a network connection between the first computing device and the second computing device. 3 . The method of claim 2 , wherein determining that the quality of the user’s voice audio is degraded comprises determining the bandwidth of the network connection between the first computing device and the second computing device is below a predefined threshold. 4 . The method of claim 1 , wherein determining the quality of the user’s voice audio comprises determining a latency of a network connection between the first computing device and the second computing device. 5 . The method of claim 4 , wherein determining that the quality of the user’s voice audio is degraded comprises determining the latency of the network connection between the first computing device and the second computing device is above a predefined threshold. 6 . The method of claim 1 , further comprising: receiving, by the second computing device, the user’s voice audio; transforming, by the second computing device, the user’s voice audio into the text corresponding with the user’s voice audio using automatic speech recognition; and transmitting, by the second computing device, the user’s voice audio and the text corresponding with the user’s voice audio to the first computing device. 7 . The method of claim 1 , further comprising: generating, by the second computing system, the one or more voice model parameters of the user’s voice based on an initial user’s voice audio captured by the second computing system; and transmitting, by the second computing system, the one or more voice model parameters to the first computing system. 8 . The method of claim 7 , wherein the user’s voice audio captured by the second computing device and received by the first computing device and the initial user’s voice audio captured by the second computing system occur in a same conversation between a user of the first computing device and a user of the second computing device. 9 . The method of claim 7 , further comprising configuring, by the first computing device, a voice restitution system based on the one or more voice model parameters. 10 . The method of claim 1 , further comprising playing the user’s voice audio on the first computing device in response to determining that the quality of the user’s voice audio is not degraded. 11 . A system for voice continuation over a network with audio quality degradation, the system comprising: a first computing device comprising at least one first processor and at least one first memory comprising a first plurality of instructions stored thereon; and a second computing device comprising at least one second processor and at least one second memory comprising a second plurality of instructions stored thereon; wherein the first plurality of instructions, in response to execution by the at least one first processor, causes the first computing system to: receive a user’s voice audio captured by the second computing device; receive text corresponding with the user’s voice audio, wherein the user’s voice audio is transformed into the text; determine a quality of the user’s voice audio; and perform voice restitution to generate a cloned user voice audio speaking the text corresponding with the user’s voice audio based on one or more voice model parameters of the user’s voice in response to a determination that the quality of the user’s voice audio is degraded. 12 . The system of claim 11 , wherein to determine the quality of the user’s voice audio comprises to determine a bandwidth of a network connection between the first computing device and the second computing device. 13 . The system of claim 12 , wherein the determination that the quality of the user’s voice audio is degraded comprises a determination that the bandwidth of the network connection between the first computing device and the second computing device is below a predefined threshold. 14 . The system of claim 11 , wherein to determine the quality of the user’s voice audio comprises to determine a latency of a network connection between the first computing device and the second computing device. 15 . The system of claim 14 , wherein the determination that the quality of the user’s voice audio is degraded comprises a determination that the latency of the network connection between the first computing device and the second computing device is above a predefined threshold. 16 . The system of claim 11 , wherein the second plurality of instructions, in response to execution by the at least one second processor, causes the second computing system to: receive the user’s voice audio; transform the user’s voice audio into the text corresponding with the user’s voice audio using automatic speech recognition; and transmit the user’s voice audio and the text corresponding with the user’s voice audio to the first computing device. 17 . The system of claim 11 , wherein the second plurality of instructions, in response to execution by the at least one second processor, causes the second computing system to: generate the one or more voice model parameters of the user’s voice based on an initial user’s voice audio captured by the second computing system; and transmit the one or more voice model parameters to the first computing system. 18 . The system of claim 17 , wherein the user’s voice audio captured by the second computing device and received by the first computing device and the initial user’s voice audio captured by the second computing system occur in a same conversation between a user of the first computing device and a user of the second computing device. 19 . The system of claim 17 , wherein the first plurality of instructions, in response to execution by the at least one first processor, causes the first computing system to configure a voice restitution system based on the one or more voice model parameters. 20 . The system of claim 11 , wherein the first plurality of instructions, in response to execution by the at least one first processor, causes the first computing system to play the user’s voice audio in response to a determination that the quality of the user’s voice audio is not degraded.

Assignees

Inventors

Classifications

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • G10L13/08Primary

    Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination · CPC title

  • Services where the data services network provides a telephone service in addition or as an alternative, e.g. for backup purposes, to the telephone service provided by the telephone services network · CPC title

  • using speech synthesis · CPC title

  • using speech recognition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2026004768A1 cover?
A method for voice continuation over a network with audio quality degradation according to an embodiment includes receiving, by a first computing device, a user’s voice audio captured by a second computing device, receiving, by the first computing device, text corresponding with the user’s voice audio, wherein the user’s voice audio is transformed into the text, determining, by the first comput…
Who is the assignee on this patent?
Genesys Cloud Services Inc
What technology area does this patent fall under?
Primary CPC classification G10L13/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jan 01 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).