What technology area does this patent fall under?

Primary CPC classification H04N19/86. Mapped technology areas include Electricity.

When was this patent published?

Publication date Tue Jan 06 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Compressed video processing system

US12518530B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12518530-B2
Application number	US-202318205696-A
Country	US
Kind code	B2
Filing date	Jun 5, 2023
Priority date	Jun 5, 2023
Publication date	Jan 6, 2026
Grant date	Jan 6, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems are disclosed for applying machine learning models to compressed videos. The system receives a video, depicting an object, that has previously been compressed using one or more video compression processes. The system analyzes, using one or more machine learning models, the video that has previously been compressed to generate a prediction corresponding to the object depicted in the video, with one or more artifacts resulting from application of the one or more machine learning models to the video that has been previously compressed being absent from the prediction. The system generates a visual output based on the prediction in which the one or more artifacts are absent.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising: receiving a compressed video, depicting an object, the compressed video having been previously compressed using one or more video compression processes; analyzing, using one or more machine learning models, the compressed video to generate a prediction corresponding to the object depicted in the compressed video, one or more artifacts resulting from application of the one or more machine learning models to the compressed video that has been previously compressed being absent from the prediction; generating, by the one or more machine learning models based on the prediction, an augmented version of the compressed video in which a virtual object is overlaid on the object depicted in the compressed video; and generating a visual output comprising the augmented version of the compressed video in which the one or more artifacts are absent. 2 . The method of claim 1 , wherein the one or more machine learning models comprise a classifier, and wherein the prediction comprises a classification of the object. 3 . The method of claim 2 , wherein the classification indicates a type of the object. 4 . The method of claim 2 , wherein the classification indicates whether an object is real or fake. 5 . The method of claim 1 , wherein the one or more machine learning models comprise a convolutional neural network associated with a fashion item extended reality experience. 6 . The method of claim 5 , further comprising: receiving a target fashion item associated with the fashion item extended reality experience, wherein the prediction comprises a new video that depicts the object wearing the target fashion item. 7 . The method of claim 1 , wherein the one or more machine learning models are trained by performing training operations comprising: accessing training data comprising a training pair of a training compressed video depicting a training object and a ground truth data associated with the training compressed video; and analyzing, using the one or more machine learning models, the training compressed video to estimate a prediction for the training object. 8 . The method of claim 7 , the training operations comprising: computing a loss based on a deviation between the estimated prediction for the training object and the ground truth data associated with the compressed video; and updating one or more parameters of the one or more machine learning models based on the computed loss. 9 . The method of claim 8 , further comprising repeating the training operations for additional training data until a stopping criterion is met. 10 . The method of claim 9 , wherein the one or more artifacts resulting from application of the one or more machine learning models to the training compressed video are excluded. 11 . The method of claim 8 , further comprising: accessing a first training image; generating a training video using the first training image; applying the one or more video compression processes to the training video to generate the training compressed video comprising a set of artifacts; storing the training video as the ground truth data for the training compressed video in which the set of artifacts is absent; and forming the training pair comprising the training compressed video and the ground truth data. 12 . The method of claim 8 , wherein the ground truth data associated with the training compressed video comprises a first sequence of frames depicting a person wearing a fashion item, and wherein the training compressed video comprises a second sequence of frames depicting the person wearing the fashion item with a set of artifacts. 13 . The method of claim 12 , wherein the second sequence of frames is generated by applying the one or more video compression processes to the first sequence of frames. 14 . The method of claim 13 , wherein the fashion item comprises a virtual fashion item, further comprising generating the first sequence of frames by: applying a fashion item machine learning model to an individual image to overlay the virtual fashion item on the person depicted in the individual image; and replicating the individual image overlaid with the virtual fashion item a threshold quantity of times corresponding to a quantity of the first sequence of frames. 15 . A system comprising: at least one processor; and at least one memory component having instructions stored thereon that, when executed by the at least one processor, cause the at least one processor to perform operations comprising: receiving a compressed video, depicting an object, the compressed video having been previously compressed using one or more video compression processes; analyzing, using one or more machine learning models, the compressed video to generate a prediction corresponding to the object depicted in the compressed video, one or more artifacts resulting from application of the one or more machine learning models to the compressed video that has been previously compressed being absent from the prediction; generating, by the one or more machine learning models based on the prediction, an augmented version of the compressed video in which a virtual object is overlaid on the object depicted in the compressed video; and generating a visual output comprising the augmented version of the compressed video in which the one or more artifacts are absent. 16 . The system of claim 15 , wherein the one or more machine learning models comprise a classifier, and wherein the prediction comprises a classification of the object. 17 . The system of claim 16 , wherein the classification indicates a type of the object. 18 . The system of claim 16 , wherein the one or more machine learning models are trained by performing training operations comprising: accessing training data comprising a training pair of a training compressed video depicting a training object and a ground truth data associated with the training compressed video; analyzing, using the one or more machine learning models, the training compressed video to estimate a prediction for the training object; computing a loss based on a deviation between the estimated prediction for the training object and the ground truth data associated with the compressed video; and updating one or more parameters of the one or more machine learning models based on the computed loss. 19 . The system of claim 18 , wherein the training operations comprise: accessing a first training image; generating a training video using the first training image; applying the one or more video compression processes to the training video to generate the training compressed video comprising a set of artifacts; storing the training video as the ground truth data for the training compressed video in which the set of artifacts is absent; and forming the training pair comprising the training compressed video and the ground truth data. 20 . A non-transitory computer-readable storage medium having stored thereon instructions that, when executed by at least one processor, cause the at least one processor to perform operations comprising: receiving a compressed video, depicting an object, the compressed video having been previously compressed using one or more video compression processes; analyzing, using one or more machine learning models, the compressed video to generate a prediction corresponding to the object depicted in the compressed video, one or more artifacts resulting from application of the one or more machine lea

Assignees

Snap Inc

Inventors

Classifications

G06V10/82
using neural networks · CPC title
H04N5/272
Means for inserting a foreground image in a background image, i.e. inlay, outlay · CPC title
H04N2005/2726
for simulating a person's appearance, e.g. hair style, glasses, clothes · CPC title
H04N5/265
Mixing · CPC title
G06T11/60
Creating or editing images; Combining images with text · CPC title

Patent family

Related publications grouped by family.

View patent family 91738458

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12518530B2 cover?: Methods and systems are disclosed for applying machine learning models to compressed videos. The system receives a video, depicting an object, that has previously been compressed using one or more video compression processes. The system analyzes, using one or more machine learning models, the video that has previously been compressed to generate a prediction corresponding to the object depicted…
Who is the assignee on this patent?: Snap Inc
What technology area does this patent fall under?: Primary CPC classification H04N19/86. Mapped technology areas include Electricity.
When was this patent published?: Publication date Tue Jan 06 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Three-dimensional models of users wearing clothing items

Catalog normalization and segmentation for fashion images

Selectively enhancing compressed digital content

Frequently asked questions