Video editing using transcript text stylization and layout

US12206930B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12206930-B2
Application numberUS-202318154412-A
CountryUS
Kind codeB2
Filing dateJan 13, 2023
Priority dateJan 13, 2023
Publication dateJan 21, 2025
Grant dateJan 21, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure provide, a method, a system, and a computer storage media that provide mechanisms for multimedia effect addition and editing support for text-based video editing tools. The method includes generating a user interface (UI) displaying a transcript of an audio track of a video and receiving, via the UI, input identifying selection of a text segment from the transcript. The method also includes in response to receiving, via the UI, input identifying selection of a particular type of text stylization or layout for application to the text segment. The method further includes identifying a video effect corresponding to the particular type of text stylization or layout, applying the video effect to a video segment corresponding to the text segment, and applying the particular type of text stylization or layout to the text segment to visually represent the video effect in the transcript.

First claim

Opening claim text (preview).

What is claimed is: 1. One or more computer storage media storing computer-useable instructions that, when executed by one or more computing devices, cause the one or more computing devices to perform operations comprising: generating a user interface (UI) displaying a transcript of an audio track of a video; receiving, via the UI, input identifying selection of a text segment from the transcript; and in response to receiving, via the UI, input identifying selection of a particular type of text stylization or layout for application to the text segment: identifying a video effect corresponding to the particular type of text stylization or layout, wherein the video effect is mapped to the particular type of text stylization or layout based on a configurable mapping; in response to identifying the video effect, displaying, via the UI, video effect options; receiving, via the UI, input identifying selection of the video effect from the video effect options; applying the video effect to a video segment corresponding to the text segment; and applying the particular type of text stylization or layout to the text segment to visually represent the video effect in the transcript. 2. The one or more computer storage media of claim 1 , the operations further comprising: in response to receiving, via the UI, input identifying selection of a second type of text stylization or layout for application to the text segment: identifying a second video effect corresponding to the second type of text stylization or layout; applying the second video effect to the video segment corresponding to the text segment; and applying the second type of text stylization or layout to the text segment to visually represent the second video effect in the transcript. 3. The one or more computer storage media of claim 1 , where identifying the video effect comprises: displaying, via the UI, video effect options; and receiving, via the UI, input identifying selection of the video effect from the video effect options. 4. The one or more computer storage media of claim 1 , the operations further comprising: in response to receiving, via the UI, selection of a track visualization representing the video segment: displaying, via the UI, a video track of the video segment and an effects track of the video effect. 5. The one or more computer storage media of claim 4 , the operations further comprising: receiving, via the UI, selection of an adjustment of a current time indicator associated with the video track; and displaying, via the UI, a cursor caret within the transcript at a current time as indicated by the current time indicator in the video track. 6. The one or more computer storage media of claim 4 , wherein the video track includes adjustable handles for trimming the video effect. 7. The one or more computer storage media of claim 1 , the operations further comprising: upon detection of a video effect type corresponding to the particular type of text stylization or layout, displaying a video effects panel with selectable video effects associated with the video effect type. 8. The one or more computer storage media of claim 1 , wherein the configurable mapping is generated by a user. 9. A method comprising: generating a user interface (UI) displaying a transcript of an audio track of a video; and in response to receiving, via the UI, selection of a particular type of text stylization or layout for application to a text segment of the transcript: identifying a video effect corresponding to the particular type of text stylization or layout; in response to identifying the video effect, displaying, via the UI, video effect options; receiving, via the UI, input identifying selection of the video effect from the video effect options; applying the video effect to a video segment corresponding to the text segment; and applying the particular type of text stylization or layout to the text segment to visually represent the video effect in the transcript. 10. The method of claim 9 , wherein the video effect and the particular type of text stylization or layout are associated via a configurable mapping. 11. The method of claim 10 , wherein the configurable mapping is generated by a user. 12. The method of claim 9 , further comprising: in response to receiving, via the UI, input identifying selection of a second type of text stylization or layout for application to the text segment: identifying a second video effect corresponding to the second type of text stylization or layout; applying the second video effect to the video segment corresponding to the text segment; and applying the second type of text stylization or layout to the text segment to visually represent the second video effect in the transcript. 13. The method of claim 9 , further comprising: in response to receiving, via the UI, selection of a tracks viewing mode representing the video segment: displaying, via the UI, a video track of the video segment and a video effect track of the video effect. 14. The method of claim 13 , further comprising: receiving, via the UI, selection of an adjustment of a current time indicator associated with the video track; and displaying, via the UI, a cursor caret within the transcript at a current time as indicated by the current time indicator in the video track. 15. The method of claim 13 , wherein the video effect track includes adjustable handles for trimming the video effect. 16. The method of claim 9 , further comprising: upon detection of a video effect type corresponding to the particular type of text stylization or layout, displaying a video effects panel with selectable video effects associated with the video effect type. 17. A computer system comprising one or more processors and memory configured to provide computer program instructions to the one or more processors, the computer program instructions comprising: generating a user interface (UI) displaying a transcript of an audio track of a video; receiving, via the UI, input identifying selection of a text segment from the transcript; and in response to receiving, via the UI, input identifying selection of a particular type of text stylization or layout for application to the text segment: displaying, via the UI, video effect options associated with a video effect type for application to a video segment corresponding to the text segment, wherein the video effect type and the particular type of text stylization or layout are associated via a configurable mapping; receiving, via the UI, input identifying selection of a video effect from the video effect options; applying the video effect to the video segment corresponding to the text segment; and applying the particular type of text stylization or layout to the text segment to visually represent the video effect in the transcript. 18. The computer system of claim 17 , the computer program instructions further comprising: in response to receiving, via the UI, input identifying selection of a second type of text stylization or layout for application to the text segment: identifying a second video effect corresponding to the second type of text stylization or layout; applying the second video effect to the video segment corresponding to the text segment; and applying the second type of text stylization or layout to the text segment to visually represent the second video effect in the transcript. 19. The computer system of claim 17 , the computer program instructions further comprising: upon detection of the

Assignees

Inventors

Classifications

  • by media transcoding, e.g. video is transformed into a slideshow of still pictures, audio is converted into text · CPC title

  • Interaction techniques based on cursor appearance or behaviour, e.g. being affected by the presence of displayed objects · CPC title

  • Interaction with lists of selectable items, e.g. menus · CPC title

  • involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12206930B2 cover?
Embodiments of the present disclosure provide, a method, a system, and a computer storage media that provide mechanisms for multimedia effect addition and editing support for text-based video editing tools. The method includes generating a user interface (UI) displaying a transcript of an audio track of a video and receiving, via the UI, input identifying selection of a text segment from the tr…
Who is the assignee on this patent?
Adobe Inc
What technology area does this patent fall under?
Primary CPC classification H04N21/440236. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jan 21 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).