Video analytics accuracy using transfer learning

US2024037778A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024037778-A1
Application numberUS-202318361340-A
CountryUS
Kind codeA1
Filing dateJul 28, 2023
Priority dateJul 30, 2022
Publication dateFeb 1, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are provided for increasing accuracy of video analytics tasks in real-time by acquiring a video using video cameras, and identifying fluctuations in the accuracy of video analytics applications across consecutive frames of the video. The identified fluctuations are quantified based on an average relative difference of true-positive detection counts across consecutive frames. Fluctuations in accuracy are reduced by applying transfer learning to a deep learning model initially trained using images, and retraining the deep learning model using video frames. A quality of object detections is determined based on an amount of track-ids assigned by a tracker across different video frames. Optimization of the reduction of fluctuations includes iteratively repeating the identifying, the quantifying, the reducing, and the determining the quality of object detections until a threshold is reached. Model predictions for each frame in the video are generated using the retrained deep learning model.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for increasing accuracy of video analytics tasks in real-time, comprising: acquiring a video using one or more video cameras, and identifying fluctuations in the accuracy of video analytics applications across consecutive frames of the video; quantifying the identified fluctuations by determining an average relative difference of true-positive detection counts across the consecutive frames; reducing the fluctuations in accuracy by applying transfer learning to a deep learning model initially trained using images, and retraining the deep learning model using video frames captured for a plurality of different scenarios; determining a quality of object detections based on an amount of track-ids assigned by a tracker across different video frames; optimizing the reducing the fluctuations by iteratively repeating the identifying fluctuations, the quantifying the identified fluctuations, the reducing the fluctuations in accuracy, and the determining a quality of object detections until a threshold is reached; and generating model predictions for each frame in the video using the retrained deep learning model for the video analytics tasks. 2 . The method as recited in claim 1 , further comprising performing object and person detection across the different video frames of the video in real-time with increased detection speed and accuracy by using the deep learning model retrained using the video frames. 3 . The method as recited in claim 1 , further comprising performing object and person counts for a particular area of interest based on the captured video using the retrained deep learning model, and generating a list of the predicted object and person counts in the area of interest. 4 . The method as recited in claim 1 , wherein the retraining of the deep learning model using transfer learning includes extracting a plurality of video frames from the video and pre-processing the extracted frames to match input requirements of the deep learning model. 5 . The method as recited in claim 1 , wherein the fluctuations in accuracy are reduced by adjusting a confidence threshold in the deep learning model based on a difficulty level associated with detection in particular frames of the video. 6 . The method as recited in claim 1 , wherein the fluctuations in accuracy result from an adversarial effect caused by automatic, dynamic camera parameter changes in a video camera. 7 . The method as recited in claim 1 , wherein the fluctuations in accuracy are quantified by determining the average relative difference of the true-positive object detection counts across the consecutive frames by: ∥ tp ( i )− tp ( i+ 1)∥/mean( gt ( i , gt ( + 1), where i represents a video frame, tp(i) represents a true positive object detection count on frame i, and gt(i) represents a ground-truth object count on frame i, on a moving window of 2 frames. 8 . The method as recited in claim 1 , wherein the fluctuations in accuracy are quantified by determining the average relative difference of the true-positive object detection counts across the consecutive frames by:  max ⁢ ( tp ⁡ ( i ) , … , tp ⁡ ( i + 9 ) ) - min ⁡ ( tp ⁡ ( i ) , … , tp ⁡ ( i + 9 ) )  mean ( gt ⁡ ( i ) , … , gt ⁡ ( i + 9 ) ) , where i represents a video frame, tp(i) represents a true positive object detection count on frame i, and gt(i) represents a ground-truth object count on frame i, on a moving window of 10 frames. 9 . A system for increasing accuracy of video analytics tasks in real-time, comprising: a processor operatively coupled to a non-transitory computer-readable storage medium, the processor being configured for: acquiring a video using one or more video cameras, and identifying fluctuations in the accuracy of video analytics applications across consecutive frames of the video; quantifying the identified fluctuations by determining an average relative difference of true-positive detection counts across the consecutive frames; reducing the fluctuations in accuracy by applying transfer learning to a deep learning model initially trained using images, and retraining the deep learning model using video frames captured for a plurality of different scenarios; determining a quality of object detections based on an amount of track-ids assigned by a tracker across different video frames; optimizing the reducing the fluctuations by iteratively repeating the identifying fluctuations, the quantifying the identified fluctuations, the reducing the fluctuations in accuracy, and the determining a quality of object detections until a threshold is reached; and generating model predictions for each frame in the video using the retrained deep learning model for the video analytics tasks. 10 . The system as recited in claim 9 , wherein the processor is further configured for performing object and person detection across the different video frames of the video in real-time with increased detection speed and accuracy by using the deep learning model retrained using the video frames. 11 . The system as recited in claim 9 , wherein the proces

Assignees

Inventors

Classifications

  • G06T7/70Primary

    Determining position or orientation of objects or cameras (camera calibration G06T7/80) · CPC title

  • Inspection of images, e.g. flaw detection · CPC title

  • G06V10/25Primary

    Determination of region of interest [ROI] or a volume of interest [VOI] · CPC title

  • Static body considered as a whole, e.g. static pedestrian or occupant recognition · CPC title

  • Target detection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024037778A1 cover?
Systems and methods are provided for increasing accuracy of video analytics tasks in real-time by acquiring a video using video cameras, and identifying fluctuations in the accuracy of video analytics applications across consecutive frames of the video. The identified fluctuations are quantified based on an average relative difference of true-positive detection counts across consecutive frames.…
Who is the assignee on this patent?
Nec Lab America Inc
What technology area does this patent fall under?
Primary CPC classification G06T7/70. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Feb 01 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).