Device and method for performing simultaneous localization and mapping

US2020334841A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2020334841-A1
Application numberUS-202016920058-A
CountryUS
Kind codeA1
Filing dateJul 2, 2020
Priority dateSep 7, 2018
Publication dateOct 22, 2020
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A device and method perform Simultaneous Localization and Mapping (SLAM). The device includes at least one processor configured to perform the SLAM method, which includes the following operations. Preprocess, in a first processing stage, a received data sequence including multiple images recorded by a camera and sensor readings from multiple sensors in order to obtain a frame sequence. Each frame of the frame sequence includes a visual feature set related to one of the images at a determined time instance and sensor readings from that time instance. Sequentially process, in a second processing stage, each frame of the frame sequence based on the visual feature set and the sensor readings included in that frame in order to generate a sequence mapping graph. Merge, in a third processing stage, the sequence mapping graph with at least one other graph, in order to generate or update a full graph.

First claim

Opening claim text (preview).

What is claimed is: 1 . A device for performing simultaneous localization and mapping (SLAM), the device comprising at least one processor configured to: preprocess, in a first processing stage, a received data sequence comprising multiple images recorded by a camera and sensor readings from multiple sensors in order to obtain a frame sequence, each frame of the frame sequence comprising a visual feature set related to one of the images at a determined time and the respective sensor readings from the determined time; sequentially process, in a second processing stage, each frame of the frame sequence based on the visual feature set and the sensor readings comprised in that frame in order to generate a sequence mapping graph; and merge, in a third processing stage, the sequence mapping graph with at least one other graph in order to generate or update a full graph. 2 . The device according to claim 1 , wherein: the visual feature set comprises an image feature set comprising one or more 2D key points extracted from the related one of the images, descriptors corresponding to the 2D key points, and disparity or depth information of the 2D key points. 3 . The device according to claim 2 , wherein the at least one processor is configured to, in the first processing stage: extract an image from the data sequence, the image being one of the multiple images; rectify the image; extract the 2D key points from the rectified image; and generate the image feature set based on the extracted 2D key points. 4 . The device according to claim 3 , wherein the at least one processor is configured to, in the first processing stage: assign one or more semantic labels to pixels of the rectified image; and filter the image feature set based on the semantic labels to remove the 2D key points from the image feature set related to objects labelled as dynamic objects. 5 . The device according to claim 4 , wherein the at least one processor is further configured to, in the first processing stage: generate the visual feature set by adding a bag-of-words descriptor to the filtered image feature set, and generate a respective frame of the frame sequence by combining the visual feature set with the sensor readings from a same time instance of the image. 6 . The device according to claim 1 , wherein the at least one processor is configured to, in the second processing stage: perform camera tracking based on the visual feature set included in a respective frame of the frame set by matching 2D key points in the visual feature set to locally stored 3D key points, in order to obtain a camera pose associated with the respective frame. 7 . The device according to claim 6 , wherein the at least one processor is configured to: determine whether the frame is a key frame based on a number of the matched 2D key points. 8 . The device according to claim 7 , wherein the at least one processor is further configured to, in the second processing stage, based upon determining that the frame is the key frame: perform a first local bundle adjustment (LBA) based on a camera pose in order to obtain visual odometry information and a LBA graph; calculate a fused camera pose based on the visual odometry information and the sensor readings included in the frame; and perform a second LBA based on the fused camera pose and the LBA graph in order to obtain the sequence mapping graph. 9 . The device according to claim 1 , wherein the at least one processor is further configured to, in the third processing stage: detect a presence of one or more loops or overlapping areas shared among the sequence mapping graph and the at least one further graph; merge the sequence mapping graph and the at least one further graph in order to obtain an intermediate graph; and perform a graph optimization on the intermediate graph based on the detected loops or the overlapping areas in order to obtain the full graph. 10 . The device according to claim 1 , wherein at least two of the first processing stage, the second processing stage, or the third processing stage are performed in different processors of the at least one processor. 11 . The device according to claim 1 , wherein: the device is a distributed device and comprises at least one terminal device and at least one network device, a processor of the terminal device is configured to perform the first processing stage and transmit the obtained frame sequence to the network device, a processor of the network device is configured to perform the second and third processing stages, and the at least one processor comprises the processor of the terminal device and the processor of the network device. 12 . The device according to claim 11 , wherein the processor of the terminal device is further configured to: perform a real-time localization based on the frame sequence obtained in the first processing stage. 13 . The device according to claim 12 , wherein the processor of the terminal device is further configured to, in the second processing stage, based upon determining that a frame of the frame sequence is a key frame: perform a first local bundle adjustment (LBA) based on a camera pose in order to obtain visual odometry information and a LBA graph; calculate a fused camera pose based on the visual odometry information and the sensor readings included in the frame; and perform a fusion tracking procedure based on the fused camera pose, the LBA graph, and a current full graph in order to obtain a current camera pose 14 . The device according to claim 11 , wherein the terminal device is located in a vehicle, and the vehicle comprises the at least one camera comprising the camera and at least one of the multiple sensors. 15 . A method for performing simultaneous localization and mapping (SLAM), the method comprising: preprocessing, in a first processing stage, a received data sequence comprising multiple images recorded by a camera and sensor readings from multiple sensors in order to obtain a frame sequence, each frame of the frame sequence comprises a visual feature set related to one of the images at a determined time and sensor readings from the determined time; sequentially processing, in a second processing stage, each frame of the frame sequence based on the visual feature set and the sensor readings comprised in that frame in order to generate a sequence mapping graph; and merging, in a third processing stage, the sequence mapping graph with at least one other graph in order to generate or update a full graph. 16 . The device according to claim 5 , wherein the at least one processor is further configured to, in the first processing stage: generate the visual feature set by adding a hash table for searching the 2D key points.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2020334841A1 cover?
A device and method perform Simultaneous Localization and Mapping (SLAM). The device includes at least one processor configured to perform the SLAM method, which includes the following operations. Preprocess, in a first processing stage, a received data sequence including multiple images recorded by a camera and sensor readings from multiple sensors in order to obtain a frame sequence. Each fra…
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T7/579. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 22 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).