What technology area does this patent fall under?

Primary CPC classification G06V10/82. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Aug 10 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Video classification method and apparatus

Patent metadata
Field	Value
Publication number	US-2017228618-A1
Application number	US-201715495541-A
Country	US
Kind code	A1
Filing date	Apr 24, 2017
Priority date	Oct 24, 2014
Publication date	Aug 10, 2017
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A video classification method and apparatus are provided in embodiments of the present invention. The method includes: establishing a neural network classification model according to a relationship between features of video samples and a semantic relationship of the video samples; obtaining a feature combination of a to-be-classified video file; and classifying the to-be-classified video file by using the neural network classification model and the feature combination of the to-be-classified video file The neural network classification model is established according to the relationship between the features of the video samples and the semantic relationship of the video samples, and the relationship between the features and the semantic relationship are fully considered. Therefore, video classification accuracy are improved.

First claim

Opening claim text (preview).

1 . A video classification method, comprising: establishing a neural network classification model according to (a) a relationship between features of video samples and (b) a semantic relationship of the video samples; obtaining a feature combination of a to-be-classified video file; and classifying the to-be-classified video file by using the neural network classification model and the feature combination of the to-be-classified video file. 2 . The method according to claim 1 , wherein the establishing the neural network classification model according to the relationship between the features of the video samples and the semantic relationship of the video samples comprises: obtaining a weight matrix of a neural network classification model fusion layer and a weight matrix of a neural network classification model classification layer according to the relationship between the features of the video samples and the semantic relationship of the video samples; and establishing the neural network classification model according to the weight matrix of the neural network classification model fusion layer and the weight matrix of the neural network classification model classification layer. 3 . The method according to claim 2 , wherein the obtaining the weight matrix of the neural network classification model fusion layer and the weight matrix of the neural network classification model classification layer according to the relationship between the features of the video samples and the semantic relationship of the video samples comprises: obtaining the weight matrix of the neural network classification model fusion layer and the weight matrix of the neural network classification model classification layer by optimizing a target function, wherein the target function is: min W , Ω  ζ + λ 1 2   W E  2 , 1 + λ 2 2  tr  ( W L - 1  Ω   W L - 1 T ) s . t   Ω ≥ 0   tr  ( Ω ) = 1 wherein ζ represents a deviation between a predictor and a real value of the video samples, ζ 1 represents a preset first weight coefficient, ζ 2 represents a preset second weight coefficient, W E represents the weight matrix of the neural network classification model fusion layer, each column of W E corresponds to a type of feature, W L-1 represents the weight matrix of the neural network classification model classification layer, W L-1 T represents transposition of W L-1 , ∥W E ∥ 2,1 represents an L21 norm of W E , Ω represents a positive semi-definite symmetric matrix used to represent the semantic relationship, and an initial value of Ω is an identity matrix. 4 . The method according to claim 3 , wherein the obtaining the weight matrix of the neural network classification model fusion layer and the weight matrix of the neural network classification model classification layer by optimizing the target function comprises: obtaining the weight matrix of the neural network classification model fusion layer and the weight matrix of the neural network classification model classification layer by optimizing the target function by using a proximal gradient method. 5 . The method according to claim 4 , wherein the optimizing the target function by using the proximal gradient method comprises: initializing the weight matrix of the neural network classification model fusion layer and the weight matrix of the neural network classification model classification layer that are in the target function; obtaining a deviation between an output predictor and an actual value by inputting the features of the video samples; and adjusting the weight matrix of the neural network classification model fusion layer and the weight matrix of the neural network classification model classification layer according to the deviation until the deviation is less than a preset threshold. 6 . A video classification apparatus, comprising: a processor; and a memory coupled to the processor, where the memory stores processor-executable instructions which when executed causes the processor to implement operations including: establishing a neural network classification model according to a relationship between features of video samples and a semantic relationship of the video samples; obtaining a feature combination of a to-be-classified video file; and the to-be-classified video file by using the neural network classification model and the feature combination of the to-be-classified video file. 7 . The apparatus according to claim 6 , wherein the operations further include: obtaining a weight matrix of a neural network classification model fusion layer and a weight matrix of a neural network classification model classification layer according to the relationship between the features of the video samples and the semantic relationship of the video samples; and establishing the neural network classification model according to the weight matrix of the neural network classification model fusion layer and the weight matrix of the neural network classification model classification layer. 8 . The apparatus according to claim 7 , wherein the operations further include: obtaining the weight matrix of the neural network

Assignees

Inventors

Classifications

G06V10/806
of extracted features · CPC title
G06V10/82Primary
using neural networks · CPC title
G06F18/24133
Distances to prototypes · CPC title
G06F18/253
of extracted features · CPC title
G06F18/22
Matching criteria, e.g. proximity measures · CPC title

Patent family

Related publications grouped by family.

View patent family 52406169

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017228618A1 cover?: A video classification method and apparatus are provided in embodiments of the present invention. The method includes: establishing a neural network classification model according to a relationship between features of video samples and a semantic relationship of the video samples; obtaining a feature combination of a to-be-classified video file; and classifying the to-be-classified video file b…
Who is the assignee on this patent?: Huawei Tech Co Ltd, Univ Fudan
What technology area does this patent fall under?: Primary CPC classification G06V10/82. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Aug 10 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).