Feature compression and localization for autonomous devices

US11715012B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11715012-B2
Application numberUS-201916598561-A
CountryUS
Kind codeB2
Filing dateOct 10, 2019
Priority dateNov 16, 2018
Publication dateAug 1, 2023
Grant dateAug 1, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, methods, tangible non-transitory computer-readable media, and devices associated with object localization and generation of compressed feature representations are provided. For example, a computing system can access source data and target data. The source data can include a source representation of an environment including a source object. The target data can include a compressed target feature representation of the environment. The compressed target feature representation can be based on compression of a target feature representation of the environment produced by machine-learned models. A source feature representation can be generated based on the source representation and the machine-learned models. The machine-learned models can include machine-learned feature extraction models or machine-learned attention models. A localized state of the source object with respect to the environment can be determined based on the source feature representation and the compressed target feature representation.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for localization of objects, the computer-implemented method comprising: accessing source data and target data, the source data comprising a source representation of an environment comprising a source object, the target data comprising a compressed target feature representation of the environment, wherein the compressed target feature representation is based at least in part on compression of a target feature representation of the environment produced by one or more machine-learned feature extraction models; generating a source feature representation based at least in part on the source representation and the one or more machine-learned feature extraction models; and generating a decompressed target feature representation based at least in part on one or more lossless binary decoding operations; generating a reconstructed target feature representation based at least in part on the decompressed target feature representation and a machine-learned decoding model; and determining a localized state of the source object with respect to the environment based at least in part on one or more comparisons of the source feature representation to the reconstructed target feature representation. 2. The computer-implemented method of claim 1 , wherein the determining the localized state of the source object with respect to the environment based at least in part on the source feature representation and the compressed target feature representation comprises: generating a reconstructed target feature representation based at least in part on the compressed target feature representation and a machine-learned reconstruction model, wherein the reconstructed target feature representation is a reconstruction of the target feature representation; and determining the localized state of the source object based at least in part on one or more comparisons of the source feature representation to the reconstructed target feature representation. 3. The computer-implemented method of claim 2 , wherein the determining the localized state of the source object based at least in part on one or more comparisons of the source feature representation to the reconstructed target feature representation comprises: determining one or more correlations between the reconstructed target feature representation and the source feature representation based at least in part on a probabilistic inference model configured to encode agreement between the source feature representation and the reconstructed target feature representation indexed at the localized state of the source object. 4. The computer-implemented method of claim 2 , wherein the compressed target feature representation is based at least in part on an encoding of the target feature representation using one or more lossless compression operations, and wherein the generating the reconstructed target feature representation based at least in part on the compressed target feature representation and the machine-learned reconstruction model, wherein the reconstructed target feature representation is a reconstruction of the target feature representation comprises: generating a decoded target feature representation of the compressed target feature representation based at least in part on the one or more lossless compression operations, wherein the one or more lossless compression operations comprise one or more lossless binary encoding operations; and generating the target feature representation based at least in part on the decoded target feature representation and the machine-learned reconstruction model. 5. The computer-implemented method of claim 1 , wherein the determining the localized state of the source object with respect to the environment based at least in part on the source feature representation and the compressed target feature representation comprises: rotating the source feature representation to a plurality of candidate angles; and determining at the plurality of candidate angles, whether the source feature representation matches the compressed target feature representation. 6. The computer-implemented method of claim 1 , wherein the compressed target feature representation of the environment is based at least in part on an attended feature representation of the target feature representation generated by a machine-learned attention model configured to mask one or more portions of the target feature representation. 7. The computer-implemented method of claim 1 , wherein the source data is based at least in part on one or more sensor outputs from one or more sensors comprising at least one of: one or more light detection and ranging (LiDAR) devices, one or more sonar devices, one or more radar devices, or one or more cameras. 8. The computer-implemented method of claim 1 , wherein the one or more machine-learned feature extraction models comprise a first machine-learned extraction model configured to generate the source feature representation and a second machine-learned model configured to generate the target feature representation. 9. A computing system comprising: one or more processors; one or more machine-learned feature extraction models configured to access training data comprising one or more representations of a training environment and generate one or more feature extracted representations of the training environment; and one or more tangible non-transitory computer-readable media storing computer-readable instructions that are executable by one or more processors to cause the one or more processors to perform operations, the operations comprising: accessing training data comprising a source representation of the training environment and a target representation of the training environment, wherein the source representation is associated with a ground-truth state of a source object in the training environment; generating a source feature representation and a target feature representation based at least in part on the one or more machine-learned feature extraction models accessing the source representation and the target representation respectively; generating a compressed target feature representation of the target feature representation based at least in part on one or more machine-learned compression models; generating a decompressed target feature representation based at least in part on one or more lossless binary decoding operations; generating a reconstructed target feature representation based at least in part on the decompressed target feature representation and a machine-learned decoding model; determining a localized state of the source object within the target representation of the environment based at least in part on one or more comparisons of the source feature representation to the reconstructed target feature representation; determining a loss based at least in part on one or more comparisons of the localized state of the source object to the ground-truth state of the source object; and adjusting one or more parameters of the one or more machine-learned compression models based at least in part on the loss. 10. The computing system of claim 9 , wherein the generating the compressed target feature representation of the target feature representation based at least in part on the one or more machine-learned compression models comprises: generating an encoded target feature representation based at least in part the target feature representation and a machine-learned encoding model; generating the compressed target feature representation based at least in part on use of one or more lossless binary encoding operations on the encoded target feature representation; and wherein adjusting the one or more parame

Assignees

Inventors

Classifications

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • Learning methods · CPC title

  • Quantised networks; Sparse networks; Compressed networks · CPC title

  • Supervised learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11715012B2 cover?
Systems, methods, tangible non-transitory computer-readable media, and devices associated with object localization and generation of compressed feature representations are provided. For example, a computing system can access source data and target data. The source data can include a source representation of an environment including a source object. The target data can include a compressed targe…
Who is the assignee on this patent?
Uatc Llc
What technology area does this patent fall under?
Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 01 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).