Systems and methods for identifying trees and estimating tree heights and other tree parameters
US-2024395033-A1 · Nov 28, 2024 · US
US12347159B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12347159-B2 |
| Application number | US-202017122041-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 15, 2020 |
| Priority date | May 22, 2020 |
| Publication date | Jul 1, 2025 |
| Grant date | Jul 1, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
This disclosure relates generally to action recognition and more particularly to system and method for real-time radar-based action recognition. The classical machine learning techniques used for learning and inferring human actions from radar images are compute intensive, and require volumes of training data, making them unsuitable for deployment on network edge. The disclosed system utilizes neuromorphic computing and Spiking Neural Networks (SNN) to learn human actions from radar data captured by radar sensor(s). In an embodiment, the disclosed system includes a SNN model having a data pre-processing layer, Convolutional SNN layers and a Classifier layer. The preprocessing layer receives radar data including doppler frequencies reflected from the target and determines a binarized matrix. The CSNN layers extracts features (spatial and temporal) associated with the target's actions based on the binarized matrix. The classifier layer identifies a type of the action performed by the target based on the features.
Opening claim text (preview).
What is claimed is: 1. A processor implemented method for real-time radar based recognition of an action performed by a target by employing a spiking neural network (SNN) model, the method comprising: receiving, by a data preprocessing layer of a spiking neural network (SNN), radar data acquired by one or more radar sensors, wherein the radar data is indicative of one or more actions performed by the target, wherein the radar data comprises a plurality of Doppler frequencies reflected from the target upon motion of the target with respect to the one or more radar sensors, wherein the spiking neural network (SNN) model is employed for real-time radar based recognition of an action performed by a target, the SNN model comprising a data pre-processing layer, a plurality of Convolutional Spiking neural network (CSNN) layers and a classifier layer; determining, by the data preprocessing layer, a first binarized matrix associated with the radar data; extracting, by the plurality of CSNN layers pre-trained on training data, a set of features associated with the one or more actions of the target based on the first binarized matrix, the set of features comprising a first set of spatial features and a first set of temporal features, wherein each CSNN layer of the plurality of CSNN layers comprises a set of class-wise filter-blocks connected via a lateral inhibition mechanism, which forms a convolutional spiking layer that takes the preprocessed radar data as input, wherein each class-wise filter-block of the set of class-wise filter-blocks comprises a set of filters controlled by a switcher node to enable each CSNN layer of the plurality of CSNN layers to capture spatially collocated patterns within a spike frame of a single action class associated with the action, wherein the switcher node applies inhibition to force all but one filter in the each class-wise filter-block to an inactive state for a predetermined inactivity duration, wherein the predetermined inactivity duration depends on strength of inhibition, wherein the spike frame is a 1-D binary image binarized radar spectrogram representing an action, wherein each pixel within the spike frame is connected to a single neuron of the class-wise filter-block; extracting the first set of spatial features hierarchically by convolving over the plurality of CSNN layers, wherein convolving over the plurality of CSNN layers increases complexity in the first set of spatial features from an initial CSNN layer to a last CSNN layer of the plurality of CSNN layers; wherein convolving over the plurality of CSNN layers comprises iteratively selecting a filter from amongst the set of filters to cause a spike upon an elapse of the predetermined inactivity duration thereby enabling the spatially collocated but temporally separable features to appear on distinct filters from the set of filters, wherein a filter from amongst the set of filters causing a maximum spike is selected, and wherein iteratively selecting the filter of the set of filters comprises: activating a class-wise filter-block from amongst the set of class-wise filter-blocks at a time for a frame sequence associated with the action; and applying a long-term inhibition to disable more than one class-wise filter-blocks from amongst the set of class-wise filter-blocks to learn a redundant pattern and enable lateral inhibition among the set of class-wise filter-blocks and allowing the set of class-wise filter-blocks to compete for a plurality of action classes, wherein applying the long-term inhibition comprises: initializing weights associated with the filters in the set of class-wise filter-blocks randomly; determining a class-wise filter-block from amongst the set of class-wise filter-blocks selected for a distinct action class from amongst the plurality of action classes; and sending an inhibition signal to remaining class-wise filter-blocks from amongst the set of class-wise filter-blocks to prevent the remaining class-wise filter-blocks from being activated; and identifying, by the classifier layer, a type of the action from amongst the one or more actions performed by the target based on the set of features, wherein the spatial features and the temporal features from the CSNN layer corresponding to respective actions is taken as an input to the classifier layer, using the type of action identified by the classifier layer in at least one of monitoring, surveillance and healthcare applications, wherein the plurality of CSNN layers are trained using an unsupervised training technique for identifying the plurality of actions using the training data, wherein training the plurality of CSNN layers comprises: receiving, by the data preprocessing layer, the training data acquired by the one or more radar sensors, the training data indicative of the plurality of actions performed by one or more targets, wherein the training data comprises time series data comprising a plurality of Doppler frequencies reflected from the plurality of targets upon motion of the plurality of targets with respect to the one or more radar sensors; determining a plurality of second binarized matrices associated with each of the plurality of actions by the preprocessing layer, wherein determining the plurality of second binarized matrices comprises: computing a plurality of spectrograms for the plurality of actions as a time-frequency domain representation of the time series data by using a Short-time Fourier Transform (STFT) model; performing a modulus operation on the STFT model to obtain a real valued matrix; consecutively converting the real matrix into a grayscale image into a second binary matrix of the plurality of second binary matrices using a threshold; extracting, by the plurality CSNN layers, a set of training features associated with the plurality of actions of the target based on the second plurality of binarized matrices, the set of training features comprising a second set of spatial features and a second set of temporal features; and identifying, by the classifier layer, a type of the action from amongst the plurality of actions performed by the target based on the second set of features. 2. The method of claim 1 , wherein determining the first binarized matrix comprises: transforming the radar data to obtain an input spectrogram using a Short-time Fourier Transform (STFT) model; and performing a greyscale conversion and binarization of the input spectrogram to obtain the first binarized matrix. 3. The method of claim 1 , wherein a radar sensor from amongst the one or more radar sensors comprises a Continuous Wave (CW) type radar sensor. 4. The method of claim 1 , wherein a radar sensor from amongst the one or more radar sensors comprises a frequency modulated continuous wave (FMCW) type radar sensor. 5. A system for real-time radar based recognition of an action performed by a target by employing a spiking neural network (SNN) model, the system, comprising: one or more memories; and one or more hardware processors, the one or more memories coupled to the one or more hardware processors, wherein the one or more hardware processors are configured to execute programmed instructions stored in the one or more memories, to: receive, by a data preprocessing layer of a spiking neural network (SNN), radar data acquired by one or more radar sensors, the radar data is indicative of one or more actions performed by the target, wherein the radar data comprises a plurality of Doppler frequencies reflected from the target upon motion of the target with respect to the one or more radar sensors, wherein the spiking neural network (SNN) model is employed for real-time radar based recognition of an action performed by a target, the SNN model comprising a data pre-processing layer, a plurality of Convolutional Spiking neural network (CSNN) lay
Convolutional networks [CNN, ConvNet] · CPC title
Movements or behaviour, e.g. gesture recognition (recognition of facial expressions G06V40/16) · CPC title
using neural networks · CPC title
Classification techniques · CPC title
Learning methods · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.