Text-driven editor for audio and video assembly
US-2022130427-A1 · Apr 28, 2022 · US
US12206930B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12206930-B2 |
| Application number | US-202318154412-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 13, 2023 |
| Priority date | Jan 13, 2023 |
| Publication date | Jan 21, 2025 |
| Grant date | Jan 21, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Embodiments of the present disclosure provide, a method, a system, and a computer storage media that provide mechanisms for multimedia effect addition and editing support for text-based video editing tools. The method includes generating a user interface (UI) displaying a transcript of an audio track of a video and receiving, via the UI, input identifying selection of a text segment from the transcript. The method also includes in response to receiving, via the UI, input identifying selection of a particular type of text stylization or layout for application to the text segment. The method further includes identifying a video effect corresponding to the particular type of text stylization or layout, applying the video effect to a video segment corresponding to the text segment, and applying the particular type of text stylization or layout to the text segment to visually represent the video effect in the transcript.
Opening claim text (preview).
What is claimed is: 1. One or more computer storage media storing computer-useable instructions that, when executed by one or more computing devices, cause the one or more computing devices to perform operations comprising: generating a user interface (UI) displaying a transcript of an audio track of a video; receiving, via the UI, input identifying selection of a text segment from the transcript; and in response to receiving, via the UI, input identifying selection of a particular type of text stylization or layout for application to the text segment: identifying a video effect corresponding to the particular type of text stylization or layout, wherein the video effect is mapped to the particular type of text stylization or layout based on a configurable mapping; in response to identifying the video effect, displaying, via the UI, video effect options; receiving, via the UI, input identifying selection of the video effect from the video effect options; applying the video effect to a video segment corresponding to the text segment; and applying the particular type of text stylization or layout to the text segment to visually represent the video effect in the transcript. 2. The one or more computer storage media of claim 1 , the operations further comprising: in response to receiving, via the UI, input identifying selection of a second type of text stylization or layout for application to the text segment: identifying a second video effect corresponding to the second type of text stylization or layout; applying the second video effect to the video segment corresponding to the text segment; and applying the second type of text stylization or layout to the text segment to visually represent the second video effect in the transcript. 3. The one or more computer storage media of claim 1 , where identifying the video effect comprises: displaying, via the UI, video effect options; and receiving, via the UI, input identifying selection of the video effect from the video effect options. 4. The one or more computer storage media of claim 1 , the operations further comprising: in response to receiving, via the UI, selection of a track visualization representing the video segment: displaying, via the UI, a video track of the video segment and an effects track of the video effect. 5. The one or more computer storage media of claim 4 , the operations further comprising: receiving, via the UI, selection of an adjustment of a current time indicator associated with the video track; and displaying, via the UI, a cursor caret within the transcript at a current time as indicated by the current time indicator in the video track. 6. The one or more computer storage media of claim 4 , wherein the video track includes adjustable handles for trimming the video effect. 7. The one or more computer storage media of claim 1 , the operations further comprising: upon detection of a video effect type corresponding to the particular type of text stylization or layout, displaying a video effects panel with selectable video effects associated with the video effect type. 8. The one or more computer storage media of claim 1 , wherein the configurable mapping is generated by a user. 9. A method comprising: generating a user interface (UI) displaying a transcript of an audio track of a video; and in response to receiving, via the UI, selection of a particular type of text stylization or layout for application to a text segment of the transcript: identifying a video effect corresponding to the particular type of text stylization or layout; in response to identifying the video effect, displaying, via the UI, video effect options; receiving, via the UI, input identifying selection of the video effect from the video effect options; applying the video effect to a video segment corresponding to the text segment; and applying the particular type of text stylization or layout to the text segment to visually represent the video effect in the transcript. 10. The method of claim 9 , wherein the video effect and the particular type of text stylization or layout are associated via a configurable mapping. 11. The method of claim 10 , wherein the configurable mapping is generated by a user. 12. The method of claim 9 , further comprising: in response to receiving, via the UI, input identifying selection of a second type of text stylization or layout for application to the text segment: identifying a second video effect corresponding to the second type of text stylization or layout; applying the second video effect to the video segment corresponding to the text segment; and applying the second type of text stylization or layout to the text segment to visually represent the second video effect in the transcript. 13. The method of claim 9 , further comprising: in response to receiving, via the UI, selection of a tracks viewing mode representing the video segment: displaying, via the UI, a video track of the video segment and a video effect track of the video effect. 14. The method of claim 13 , further comprising: receiving, via the UI, selection of an adjustment of a current time indicator associated with the video track; and displaying, via the UI, a cursor caret within the transcript at a current time as indicated by the current time indicator in the video track. 15. The method of claim 13 , wherein the video effect track includes adjustable handles for trimming the video effect. 16. The method of claim 9 , further comprising: upon detection of a video effect type corresponding to the particular type of text stylization or layout, displaying a video effects panel with selectable video effects associated with the video effect type. 17. A computer system comprising one or more processors and memory configured to provide computer program instructions to the one or more processors, the computer program instructions comprising: generating a user interface (UI) displaying a transcript of an audio track of a video; receiving, via the UI, input identifying selection of a text segment from the transcript; and in response to receiving, via the UI, input identifying selection of a particular type of text stylization or layout for application to the text segment: displaying, via the UI, video effect options associated with a video effect type for application to a video segment corresponding to the text segment, wherein the video effect type and the particular type of text stylization or layout are associated via a configurable mapping; receiving, via the UI, input identifying selection of a video effect from the video effect options; applying the video effect to the video segment corresponding to the text segment; and applying the particular type of text stylization or layout to the text segment to visually represent the video effect in the transcript. 18. The computer system of claim 17 , the computer program instructions further comprising: in response to receiving, via the UI, input identifying selection of a second type of text stylization or layout for application to the text segment: identifying a second video effect corresponding to the second type of text stylization or layout; applying the second video effect to the video segment corresponding to the text segment; and applying the second type of text stylization or layout to the text segment to visually represent the second video effect in the transcript. 19. The computer system of claim 17 , the computer program instructions further comprising: upon detection of the
by media transcoding, e.g. video is transformed into a slideshow of still pictures, audio is converted into text · CPC title
Interaction techniques based on cursor appearance or behaviour, e.g. being affected by the presence of displayed objects · CPC title
Interaction with lists of selectable items, e.g. menus · CPC title
involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.