Who is the assignee on this patent?

Beijing Baidu Netcom Sci & Tech Co Ltd

What technology area does this patent fall under?

Primary CPC classification G06V20/588. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Sep 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Method and apparatus for correcting perspective of road image

US12423853B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12423853-B2
Application number	US-202217976367-A
Country	US
Kind code	B2
Filing date	Oct 28, 2022
Priority date	Dec 21, 2021
Publication date	Sep 23, 2025
Grant date	Sep 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and apparatus for processing an image. The method may include: acquiring a top view of a road; identifying a position of a lane line from the top view; cutting the top view into at least two areas, and determining, according to the position of the lane line in each area, a width of a lane in the each area and an average width of the lane in the top view; calculating a first perspective correction matrix by optimizing a first loss function, the first loss function being used to represent a difference between the width of the lane in the each area and the average width of the lane in the top view; and performing a lateral correction on the top view through the first perspective correction matrix to obtain a first corrected image.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for processing an image, comprising: acquiring a top view of a road; identifying a position of a lane line from the top view; cutting the top view into at least two areas, and determining, according to the position of the lane line in each area, a width of a lane in the each area and an average width of the lane in the top view; calculating a first perspective correction matrix by optimizing a first loss function, the first loss function being used to represent a difference between the width of the lane in the each area and the average width of the lane in the top view; and performing a lateral correction on the top view through the first perspective correction matrix to obtain a first corrected image. 2. The method according to claim 1 , further comprising: identifying a dashed lane line from the top view; determining a length of the dashed lane line in the each area and an average length of the dashed lane line in the top view; calculating a second perspective correction matrix by optimizing a second loss function, the second loss function being used to represent a difference between the length of the dashed lane line in the each area and the average length of the dashed lane line in the top view; and performing a longitudinal correction on the first corrected image through the second perspective correction matrix to obtain a second corrected image. 3. The method according to claim 1 , wherein the identifying the position of the lane line from the top view comprises: identifying a pixel and type of the lane line from the top view through a semantic segmentation model, to generate a semantic segmentation image; and extracting the position of the lane line from the semantic segmentation image. 4. The method according to claim 3 , wherein the extracting the position of the lane line from the semantic segmentation image comprises: transforming the semantic segmentation image into a binary image; performing a contour detection on the binary image to obtain a rectangular contour; splitting the rectangular contour into a plurality of segments along a direction of a long side of the rectangular contour, and performing the contour detection on each segment again to generate a plurality of sub-contours; extracting a center line of each sub-contour rectangle as a linear vector of the lane line; and fitting the linear vector of the lane line through a quadratic curve, and predicting and supplementing a missing part of the lane line. 5. The method according to claim 1 , wherein the determining, according to the position of the lane line in the each area, the width of the lane in the each area and the average width of the lane in the top view comprises: performing a near neighbor search on the identified lane line, and pairing each two lane lines to obtain a matching pair set, wherein each matching pair corresponds to one lane; calculating, for each lane, a lane width of a middle position of the each area as a width of the lane in the each area; and calculating the average width of the lane in the top view based on the width of the lane in the each area. 6. The method according to claim 2 , wherein the determining the length of the dashed lane line in the each area and the average length of the dashed lane line in the top view comprises: using, for the each area, a length of a complete dashed lane line in middle of the area as the length of the dashed lane line in the area; and calculating the average length of the dashed lane line based on the length of the dashed lane line in the each area. 7. The method according to claim 1 , wherein the acquiring the top view of the road comprises: acquiring a panoramic view of the road; and transforming the panoramic view into the top view through a perspective projection method. 8. An electronic device, comprising: at least one processor; and a memory, communicatively connected to the at least one processor, wherein the memory stores an instruction executable by the at least one processor, and the instruction is executed by the at least one processor, to enable the at least one processor to perform operations, the operations comprising: acquiring a top view of a road; identifying a position of a lane line from the top view; cutting the top view into at least two areas, and determining, according to the position of the lane line in each area, a width of a lane in the each area and an average width of the lane in the top view; calculating a first perspective correction matrix by optimizing a first loss function, the first loss function being used to represent a difference between the width of the lane in the each area and the average width of the lane in the top view; and performing a lateral correction on the top view through the first perspective correction matrix to obtain a first corrected image. 9. The electronic device according to claim 8 , further comprising: identifying a dashed lane line from the top view; determining a length of the dashed lane line in the each area and an average length of the dashed lane line in the top view; calculating a second perspective correction matrix by optimizing a second loss function, the second loss function being used to represent a difference between the length of the dashed lane line in the each area and the average length of the dashed lane line in the top view; and performing a longitudinal correction on the first corrected image through the second perspective correction matrix to obtain a second corrected image. 10. The electronic device according to claim 8 , wherein the identifying the position of the lane line from the top view comprises: identifying a pixel and type of the lane line from the top view through a semantic segmentation model, to generate a semantic segmentation image; and extracting the position of the lane line from the semantic segmentation image. 11. The electronic device according to claim 10 , wherein the extracting the position of the lane line from the semantic segmentation image comprises: transforming the semantic segmentation image into a binary image; performing a contour detection on the binary image to obtain a rectangular contour; splitting the rectangular contour into a plurality of segments along a direction of a long side of the rectangular contour, and performing the contour detection on each segment again to generate a plurality of sub-contours; extracting a center line of each sub-contour rectangle as a linear vector of the lane line; and fitting the linear vector of the lane line through a quadratic curve, and predicting and supplementing a missing part of the lane line. 12. The electronic device according to claim 8 , wherein the determining, according to the position of the lane line in the each area, the width of the lane in the each area and the average width of the lane in the top view comprises: performing a near neighbor search on the identified lane line, and pairing each two lane lines to obtain a matching pair set, wherein each matching pair corresponds to one lane; calculating, for each lane, a lane width of a middle position of the each area as a width of the lane in the each area; and calculating the average width of the lane in the top view based on the width of the lane in the each area. 13. The electronic device according to claim 9 , wherein the determining the length of the dashed lane line in the each area and the average length of the dashed lane line in the top view comprises: using, for the each area, a length of a complete dashed lane line in middle of the area as the length of the dashed lane line in the area; and calc

Assignees

Beijing Baidu Netcom Sci & Tech Co Ltd

Inventors

Classifications

G06T2207/30256
Lane; Road marking · CPC title
G06V20/588Primary
Recognition of the road, e.g. of lane markings; Recognition of the vehicle driving pattern in relation to the road · CPC title
G06V10/44
Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components · CPC title
G06T5/80
Geometric correction · CPC title
G06V10/24
Aligning, centring, orientation detection or correction of the image · CPC title

Patent family

Related publications grouped by family.

View patent family 80828469

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12423853B2 cover?: A method and apparatus for processing an image. The method may include: acquiring a top view of a road; identifying a position of a lane line from the top view; cutting the top view into at least two areas, and determining, according to the position of the lane line in each area, a width of a lane in the each area and an average width of the lane in the top view; calculating a first perspective…
Who is the assignee on this patent?: Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?: Primary CPC classification G06V20/588. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Sep 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).