System and method for calibrating moving cameras capturing broadcast video

US11935247B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11935247-B2
Application numberUS-202318175278-A
CountryUS
Kind codeB2
Filing dateFeb 27, 2023
Priority dateFeb 28, 2019
Publication dateMar 19, 2024
Grant dateMar 19, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method of calibrating moving cameras capturing a sporting event is disclosed herein. A computing system retrieves a broadcast video feed for a sporting event. The broadcast video feed includes a plurality of video frames. The computing system labels, via a neural network, components of a playing surface captured in each video frame. The computing system matches a subset of labeled video frames to a set of templates with various camera perspectives. The computing system fits a playing surface model to the set of labeled video frames that were matched to the set of templates. The computing system identifies camera motion in each video frame using an optical flow model. The computing system generates a homography matrix for each video frame based on the fitted playing surface model and camera motion. The computing system calibrates each camera based on the homography matrix generated for each video frame.

First claim

Opening claim text (preview).

What is claimed: 1. A method of calibrating a moving camera capturing a sporting event, comprising: identifying, by a computing system, a live broadcast video feed for a sporting event, the live broadcast video feed comprising a plurality of video frames captured by a camera located in a sporting venue; receiving, by the computing system, pose data for one or more players; removing, by the computing system, the one or more players from the plurality of video frames based on the pose data to generate a plurality of empty video frames; analyzing, by the computing system, the plurality of empty video frames to identify a flow field between successive video frames of the plurality of empty video frames; generating, by the computing system, a homography matrix for each empty video frame based on the flow field between the successive video frames; and calibrating, by the computing system over a network, the camera based on the homography matrix generated for each empty video frame of the plurality of empty video frames by applying each homography matrix to a corresponding video frame of the plurality of video frames. 2. The method of claim 1 , wherein removing, by the computing system, the players from the plurality of video frames to generate the plurality of empty video frames comprises: identifying one or more pixels corresponding to at least one of the one or more players in a video frame; and removing the one or more pixels from the video frame. 3. The method of claim 1 , wherein the homography matrix indicates a transform that is used to project the players' locations from the plurality of video frames to real world coordinates on a playing surface. 4. The method of claim 1 , wherein analyzing, by the computing system, the plurality of empty video frames to identify the flow field between the successive video frames of the plurality of empty video frames comprises: identifying an object in a first video frame; and tracking motion of the object in at least one second video frame succeeding the first video frame. 5. The method of claim 1 , wherein identifying, by the computing system, the live broadcast video feed for the sporting event comprises: identifying a video frame of the plurality of video frames captured by a second camera remotely located in the sporting venue. 6. The method of claim 5 , wherein calibrating, by the computing system over the network, the camera based on the homography matrix generated for each empty video frame of the plurality of empty video frames comprises: calibrating the second camera based on the homography matrix generated for an empty video frame corresponding to the video frame. 7. A non-transitory computer readable medium comprising one or more sequences of instructions, which, when executed by one or more processors, causes a computing system to perform operations, comprising: identifying, by the computing system, a live broadcast video feed for a sporting event, the live broadcast video feed comprising a plurality of video frames captured by a camera located in a sporting venue; receiving, by the computing system, pose data for one or more players; removing, by the computing system, the one or more players from the plurality of video frames based on the pose data to generate a plurality of empty video frames; analyzing, by the computing system, the plurality of empty video frames to identify a flow field between successive video frames of the plurality of empty video frames; generating, by the computing system, a homography matrix for each empty video frame based on the flow field between the successive video frames; and calibrating, by the computing system over a network, the camera based on the homography matrix generated for each empty video frame of the plurality of empty video frames by applying each homography matrix to a corresponding video frame of the plurality of video frames. 8. The non-transitory computer readable medium of claim 7 , wherein removing, by the computing system, the players from the plurality of video frames to generate the plurality of empty video frames comprises: identifying one or more pixels corresponding to at least one of the one or more players in a video frame; and removing the one or more pixels from the video frame. 9. The non-transitory computer readable medium of claim 7 , wherein the homography matrix indicates a transform that is used to project the players' locations from the plurality of video frames to real world coordinates on a playing surface. 10. The non-transitory computer readable medium of claim 7 , wherein analyzing, by the computing system, the plurality of empty video frames to identify the flow field between the successive video frames of the plurality of empty video frames comprises: identifying an object in a first video frame; and tracking motion of the object in at least one second video frame succeeding the first video frame. 11. The non-transitory computer readable medium of claim 7 , wherein identifying, by the computing system, the live broadcast video feed for the sporting event comprises: identifying a video frame of the plurality of video frames captured by a second camera remotely located in the sporting venue. 12. The non-transitory computer readable medium of claim 11 , wherein calibrating, by the computing system over the network, the camera based on the homography matrix generated for each empty video frame of the plurality of empty video frames comprises: calibrating the second camera based on the homography matrix generated for an empty video frame corresponding to the video frame. 13. A system comprising: a processor; and a memory comprising one or more sequences of instructions, which, when executed by the processor, causes the system to perform operations comprising: identifying a live broadcast video feed for a sporting event, the live broadcast video feed comprising a plurality of video frames captured by a camera located in a sporting venue; receiving pose data for one or more players; removing the one or more players from the plurality of video frames based on the pose data to generate a plurality of empty video frames; analyzing the plurality of empty video frames to identify a flow field between successive video frames of the plurality of empty video frames; generating a homography matrix for each empty video frame based on the flow field between the successive video frames; and calibrating, over a network, the camera based on the homography matrix generated for each empty video frame of the plurality of empty video frames by applying each homography matrix to a corresponding video frame of the plurality of video frames. 14. The system of claim 13 , wherein removing the players from the plurality of video frames to generate the plurality of empty video frames comprises: identifying one or more pixels corresponding to at least one of the one or more players in a video frame; and removing the one or more pixels from the video frame. 15. The system of claim 13 , wherein the homography matrix indicates a transform that is used to project the players' locations from the plurality of video frames to real world coordinates on a playing surface. 16. The system of claim 13 , wherein analyzing the plurality of empty video frames to identify the flow field between the successive video frames of the plurality of empty video frames comprises: identifying an object in a first video frame; and tracking motion of the object in at least one second video frame succeeding the first video frame. 17. The system of claim 13 , w

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Proximity, similarity or dissimilarity measures · CPC title

  • involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream (arrangements characterised by components specially adapted for monitoring, identification or recognition of video in broadcast systems H04H60/59) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11935247B2 cover?
A system and method of calibrating moving cameras capturing a sporting event is disclosed herein. A computing system retrieves a broadcast video feed for a sporting event. The broadcast video feed includes a plurality of video frames. The computing system labels, via a neural network, components of a playing surface captured in each video frame. The computing system matches a subset of labeled …
Who is the assignee on this patent?
Stats Llc
What technology area does this patent fall under?
Primary CPC classification G06V40/23. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 19 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).