User interface method and apparatus for video navigation using captions

US12574608B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12574608-B2
Application numberUS-202318144907-A
CountryUS
Kind codeB2
Filing dateMay 9, 2023
Priority dateMay 9, 2023
Publication dateMar 10, 2026
Grant dateMar 10, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for navigating a video via interaction with text overlaid on the video are described. In one example, a method includes generating for display a video, and generating for display at least one line of text overlaid over the video. Then, in response to receiving a directional user interface input for at least a portion of the at least one line of text, the method includes modifying a play position of the video based on a direction of the directional user interface input for the at least a portion of the at least one line of text.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for media navigation comprising: generating for display a video; generating for display at least one line of text overlaid over the video; in response to receiving a first directional swipe in a first direction and having a first speed, for at least a portion of the at least one line of text: modifying a play position of the video in a first manner based on the first direction and the first speed of the first directional swipe for the at least a portion of the at least one line of text; and modifying the display of the at least one line of text overlaid over the video based on the first direction and the first speed of the first directional swipe; and in response to receiving a selection of a specific portion of text of the at least one line of text overlaid over the video: identifying from the video a plurality of scenes related to the specific portion of text; and providing a plurality of selectable options corresponding to the plurality of scenes wherein a selection of a respective selectable option causes a respective scene to be generated for display; and in response to receiving a second directional swipe in a second direction orthogonal to the first direction and having a second speed, for at least a portion of the at least one line of text: modifying the play position of the video in a second manner based on the second direction and the second speed of the second directional swipe for the at least a portion of the at least one line of text; and modifying the display of the at least one line of text overlaid over the video based on the second direction and the second speed of the second directional swipe. 2 . The method of claim 1 , wherein modifying the play position of the video in the first manner comprises modifying the play position at a rate proportional to the first speed of the first directional swipe, and wherein modifying the play position of the video in the second manner comprises modifying the play position at a rate proportional to the second speed of the second directional swipe. 3 . The method of claim 1 , further comprising: in response to receiving a directional swipe comprising a selection and upward drag of the at least a portion of the at least one line of text, modifying the play position of the video forward proportional to a speed of the upward drag; and in response to receiving a directional swipe comprising a selection and downward drag of the at least a portion of the at least one line of text, modifying the play position of the video backward proportional to a speed of the downward drag. 4 . The method of claim 3 , wherein the directional swipe is received via a touch screen, and wherein the selection and upward drag comprises a tap and swipe. 5 . The method of claim 1 , further comprising: generating for display a plurality of lines of text overlaid over the video; generating for display a current line of text that corresponds to a current video segment in a middle position, wherein the current line of text is visually different from adjacent lines of text; generating for display a first adjacent line of text that corresponds to an adjacent preceding video segment above the middle position; and generating for display a second adjacent line of text that corresponds to an adjacent following video segment below the middle position. 6 . The method of claim 1 , further comprising: in response to receiving a selection of a selected line of text of the at least one line of text overlaid over the video, modifying the play position of the video to a video segment corresponding to the selected line of text. 7 . The method of claim 1 , wherein the at least one line of text overlaid over the video comprises a first page of text generated for display in a page format and comprising a plurality of lines of text, the method further comprising: in response to receiving the first directional swipe in the first direction and having the first speed, for at least a portion of the first page of text: generating for display a second page of text comprising a second plurality of lines of text; and modifying the play position of the video in the first manner to a first video segment corresponding to a first line of text of the second plurality of lines of text of the second page of text; and in response to receiving the second directional swipe in the second direction orthogonal to the first direction and having the second speed, for at least a portion of the first page of text: generating for display a third page of text comprising a third plurality of lines of text; and modifying the play position of the video in the second manner to a second video segment corresponding to the first line of text of the third plurality of lines of text of the third page of text. 8 . The method of claim 1 , further comprising: generating for display credits corresponding to the video; and in response to receiving a user interface input selecting a selected portion of the credits: generating for display an indication of one or more video segments that correspond to the selected portion of the credits; loading the one or more video segments of the video that correspond to the selected portion of the credits; and setting a current play position of the video to a first video segment of the one or more video segments that correspond to the selected portion of the credits. 9 . The method of claim 1 , further comprising: generating for display song information for one or more songs played during the video; and in response to receiving a user interface input selecting a selected portion of the song information: generating for display an indication of one or more video segments that correspond to the selected portion of the song information; and causing an audio output of audio corresponding to the selected portion of the song information. 10 . A system for media navigation comprising: input/output circuitry configured to: generate for display a video; and generate for display at least one line of text overlaid over the video; and control circuitry configured to: in response to receiving a first directional swipe in a first direction and having a first speed, for at least a portion of the at least one line of text: modify a play position of the video in a first manner based on the first direction and the first speed of the first directional swipe for the at least a portion of the at least one line of text; and modify the display of the at least one line of text overlaid over the video based on the first direction and the first speed of the first directional swipe; and in response to receiving a selection of a specific portion of text of the at least one line of text overlaid over the video: identify from the video a plurality of scenes related to the specific portion of text; and provide a plurality of selectable options corresponding to the plurality of scenes wherein a selection of a respective selectable option causes a respective scene to be generated for display; and in response to receiving a second directional swipe in a second direction orthogonal to the first direction and having a second speed, for at least a portion of the at least one line of text: modify the play position of the video in a second manner based on the second direction and the second speed of the second directional swipe for the at least a portion of the at least one line of text; and modify the display of the at least one line of text overlaid over the video based on the second direction and the second speed of the second directional swipe. 11 . The system of claim 10 , wherein the control circuitry is co

Assignees

Inventors

Classifications

  • for controlling playback functions for recorded or on-demand content, e.g. using progress bars, mode or play-point indicators or bookmarks (specific graphical features in visual interfaces H04N21/4312) · CPC title

  • using a touch-screen or digitiser, e.g. input of commands through traced gestures · CPC title

  • Sound input; Sound output (speech processing G10L) · CPC title

  • for displaying subtitles · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12574608B2 cover?
Systems and methods for navigating a video via interaction with text overlaid on the video are described. In one example, a method includes generating for display a video, and generating for display at least one line of text overlaid over the video. Then, in response to receiving a directional user interface input for at least a portion of the at least one line of text, the method includes modi…
Who is the assignee on this patent?
Adeia Guides Inc
What technology area does this patent fall under?
Primary CPC classification H04N21/4884. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Mar 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).