Method and apparatus for determining live streaming keyframe, storage medium, and electronic device

US2026095601A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2026095601-A1
Application numberUS-202519317170-A
CountryUS
Kind codeA1
Filing dateSep 3, 2025
Priority dateSep 29, 2024
Publication dateApr 2, 2026
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of this specification disclose a method and an apparatus for determining a live streaming keyframe, a storage medium, and an electronic device. Feature extraction is performed on a current captured frame of target live streaming by using a feature extraction module, to obtain a feature vector of the current captured frame. The feature vector of the current captured frame is stored in a cache, and a target historical captured frame that is in the cache and that corresponds to the current captured frame is deleted. A feature vector of a plurality of captured frames is read from the cache by using a keyframe calculation module, and at least one keyframe is determined based on the feature vector of the plurality of captured frames. If the current captured frame is in the at least one keyframe, it is determined that the current captured frame is a keyframe.

First claim

Opening claim text (preview).

1 . A method for determining a live streaming keyframe, comprising: performing, by using a feature extraction module, feature extraction on a current captured frame of target live streaming, to obtain a feature vector of the current captured frame, wherein a frame sequence number corresponding to the current captured frame is a first sequence number; storing the feature vector of the current captured frame in a cache, and deleting a target historical captured frame that is in the cache and that corresponds to the current captured frame, wherein a difference between the first sequence number and a target sequence number corresponding to the target historical captured frame is a target quantity; reading, from the cache by using a keyframe calculation module, a feature vector of a plurality of captured frames whose corresponding frame sequence numbers are from a second sequence number to the first sequence number, and determining at least one keyframe based on the feature vector of the plurality of captured frames, wherein a difference between the first sequence number and the second sequence number is the target quantity minus one, and a capture time of a captured frame corresponding to the second sequence number is earlier than a capture time of a captured frame corresponding to the first sequence number; and if the current captured frame is in the at least one keyframe, determining that the current captured frame is a keyframe. 2 . The method according to claim 1 , further comprising: obtaining a sliding window corresponding to a historical captured frame of the target live streaming, wherein a window length of the sliding window is the target quantity, and a current right boundary of the sliding window is a previous historical captured frame corresponding to the current captured frame; the deleting a target historical captured frame that is in the cache and that corresponds to the current captured frame comprises: sliding the sliding window rightward by one step, so that the current right boundary of the sliding window becomes the current captured frame, and deleting a previous target historical captured frame of a historical captured frame that is in the cache and that corresponds to a current left boundary of the sliding window; and the reading, from the cache by using a keyframe calculation module, a feature vector of a plurality of captured frames whose corresponding frame sequence numbers are from a second sequence number to the first sequence number comprises: reading, from the cache based on the sliding window by using the keyframe calculation module, the feature vector of the plurality of captured frames whose corresponding frame sequence numbers are from the second sequence number to the first sequence number. 3 . The method according to claim 2 , wherein the sliding the sliding window rightward by one step, so that the current right boundary of the sliding window becomes the current captured frame, and deleting a previous target historical captured frame of a historical captured frame that is in the cache and that corresponds to a current left boundary of the sliding window comprises: obtaining a frame quantity corresponding to the historical captured frame that is of the target live streaming and that is currently cached in the cache; and if the frame quantity is greater than or equal to the window length of the sliding window, sliding the sliding window rightward by one step, so that the current right boundary of the sliding window becomes the current captured frame, and deleting the previous target historical captured frame of the historical captured frame that is in the cache and that corresponds to the current left boundary of the sliding window. 4 . The method according to claim 2 , further comprising: determining the window length of the sliding window based on the target live streaming. 5 . The method according to claim 4 , wherein the determining the window length of the sliding window based on the target live streaming comprises: determining the window length of the sliding window based on streamer information and/or product information corresponding to the target live streaming. 6 . The method according to claim 5 , wherein the determining the window length of the sliding window based on streamer information and/or product information that correspond/corresponds to the target live streaming comprises: determining the window length of the sliding window based on the streamer information and/or the product information corresponding to the target live streaming and with reference to audience information corresponding to the target live streaming. 7 . The method according to claim 2 , further comprising: performing content detection on the current captured frame by using a content detection module, to obtain a content detection result corresponding to the current captured frame. 8 . The method according to claim 7 , further comprising: determining, based on the content detection result corresponding to the current captured frame, whether the window length of the sliding window needs to be adjusted; and sliding the current right boundary of the sliding window rightward if it is determined that the window length of the sliding window needs to be increased; or sliding the current left boundary of the sliding window rightward if it is determined that the window length of the sliding window needs to be decreased. 9 . The method according to claim 8 , wherein the determining, based on the content detection result corresponding to the current captured frame, whether the window length of the sliding window needs to be adjusted comprises: determining, based on the content detection result corresponding to the current captured frame and a content detection result corresponding to a historical keyframe that corresponds to the target live streaming and that is located before the current captured frames, whether the window length of the sliding window needs to be adjusted. 10 - 11 . (canceled) 12 . An electronic device, comprising a processor and a memory, wherein the memory stores a computer program, and when the processor executes the computer program, the electronic device is caused to: perform, by using a feature extraction module, feature extraction on a current captured frame of target live streaming, to obtain a feature vector of the current captured frame, wherein a frame sequence number corresponding to the current captured frame is a first sequence number; store the feature vector of the current captured frame in a cache, and delete a target historical captured frame that is in the cache and that corresponds to the current captured frame, wherein a difference between the first sequence number and a target sequence number corresponding to the target historical captured frame is a target quantity; read, from the cache by using a keyframe calculation module, a feature vector of a plurality of captured frames whose corresponding frame sequence numbers are from a second sequence number to the first sequence number, and determine at least one keyframe based on the feature vector of the plurality of captured frames, wherein a difference between the first sequence number and the second sequence number is the target quantity minus one, and a capture time of a captured frame corresponding to the second sequence number is earlier than a capture time of a captured frame corresponding to the first sequence number; and if the current captured frame is in the at least one keyframe, determine that the current captured frame is a keyframe. 13 . (canceled) 14 . The electronic device according to claim 12 , is further caused to:

Assignees

Inventors

Classifications

  • involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream (arrangements characterised by components specially adapted for monitoring, identification or recognition of video in broadcast systems H04H60/59) · CPC title

  • Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title

  • Live feed · CPC title

  • communicating with other users, e.g. chatting {(arrangements for providing for computer conferences, e.g. chat rooms, to substation in data switching networks H04L12/1813; distributed application using peer-to-peer [P2P] networks H04L67/104)} · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2026095601A1 cover?
Embodiments of this specification disclose a method and an apparatus for determining a live streaming keyframe, a storage medium, and an electronic device. Feature extraction is performed on a current captured frame of target live streaming by using a feature extraction module, to obtain a feature vector of the current captured frame. The feature vector of the current captured frame is stored i…
Who is the assignee on this patent?
Alipay Hangzhou Inf Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification H04N21/2187. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Apr 02 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).