Methods and apparatuses for processing image, methods and apparatuses for training image recognition network and methods and apparatuses for recognizing image

US12002182B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12002182-B2
Application numberUS-202117490754-A
CountryUS
Kind codeB2
Filing dateSep 30, 2021
Priority dateOct 29, 2020
Publication dateJun 4, 2024
Grant dateJun 4, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure relates to methods and apparatuses for processing an image, training an image recognition network and recognizing an image. The method of processing an image includes: obtaining a plurality of original images from an original image set, where at least one of the plurality of original images includes an annotation area; obtaining at least one first image by splicing the plurality of original images; for each of the at least one first image, adjusting a shape and/or size of the first image based on the plurality of original images to form a second image; obtaining respective positions of the at least one annotation area in the second image by converting respective positions of the at least one annotation area in the plurality of original images.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of processing an image, comprising: obtaining a plurality of original images from an original image set, wherein at least one of the plurality of original images comprises an annotation area; obtaining at least one first image by splicing the plurality of original images; for each of the at least one first image, adjusting a shape and/or size of the first image based on the plurality of original images to form a second image; obtaining respective positions of the at least one annotation area in the second image by converting respective positions of the at least one annotation area in the plurality of original images; wherein adjusting the shape and/or size of the first image based on the plurality of original images to form the second image comprises: determining a scaling ratio of an annotation area in the plurality of original images and an annotation area in the second image according to a parameter of an image acquisition device corresponding to the plurality of original images and a parameter of an image acquisition device corresponding to the second image; scaling the first image based on the scaling ratio to form a scaled first image; and adjusting a shape and/or size of the scaled first image to be consistent with those of the plurality of original images to form the second image; wherein obtaining respective positions of the at least one annotation area in the second image by converting respective positions of the at least one annotation area in the plurality of original images comprises: for each of the at least one annotation area, obtaining coordinates of the annotation area in the first image by converting coordinates of the annotation area in the plurality of original images; obtaining coordinates of the annotation area in the scaled first image by converting the coordinates of the annotation area in the first image; and obtaining coordinates of the annotation area in the second image by converting the coordinates of the annotation area in the scaled first image; wherein obtaining the coordinates of the annotation area in the first image by converting the coordinates of the annotation area in the plurality of original images comprises: in response to spaces corresponding to the plurality of original images being unoverlapped, obtaining coordinates of the annotation area in a first coordinate system of the first image by converting the coordinates of the annotation area in the plurality of original images, wherein, the first coordinate system takes a center of the first image as an origin; in response to spaces corresponding to two or more of the plurality of original images being overlapped, obtaining coordinates of the annotation area in a second coordinate system of the first image by converting the coordinates of the annotation area in the plurality of original images, wherein, the second coordinate system takes a center of duplicate area as an origin. 2. The method according to claim 1 , wherein the original image set comprises a plurality of process subsets, each of the process subsets comprises a plurality of type nodes, each of the type nodes comprises a multitude of original images, and annotation areas of original images in each of the type nodes are annotated with labels corresponding to the type node; and obtaining the plurality of original images from the original image set comprises: obtaining the plurality of original images from a same type node in a same process subset, or obtaining the plurality of original images from different type nodes in a same process subset, or obtaining the plurality of original images from same type of type nodes in different process subsets. 3. The method according to claim 2 , wherein original images in one type node of each of the process subsets are unannotated images. 4. The method according to claim 1 , wherein obtaining the plurality of original images from the original image set comprises: obtaining N 2 original images, wherein N is an integer greater than or equal to 2. 5. The method according to claim 4 , wherein obtaining the at least one first image by splicing the plurality of original images comprises: in response to spaces corresponding to the plurality of original images being unoverlapped, forming the at least one first image by arranging the plurality of original images into N rows and N columns and splicing; in response to spaces corresponding to two or more of the plurality of original images being overlapped, forming the at least one first image by superposing duplicate areas of the two or more of the plurality of original images. 6. A method of training an image recognition network, comprising: training the image recognition network using an image training set, wherein images in the image training set are processed using the method of processing an image according to claim 1 . 7. A method of recognizing an image, comprising: recognizing an image by using an image recognition network, wherein the image recognition network is trained using the training method according to claim 6 . 8. The method according to claim 1 , wherein adjusting the shape and/or size of the scaled first image to be consistent with those of the plurality of original images to form the second image comprises: padding the scaled first image and/or cropping the scaled first image to form the second image in a shape and size consistent with the plurality of original images. 9. The method according to claim 1 , wherein obtaining the coordinates of the annotation area in the second image by converting the coordinates of the annotation area in the scaled first image comprises: obtaining coordinates of the annotation area in a third coordinate system by converting coordinates of the annotation area in the first coordinate system scaled or the second coordinate system scaled, wherein the third coordinate system takes an upper left corner of the second image as an origin. 10. A device comprising: a memory, and a processor, wherein the memory is configured to store computer instructions executable on the processor, and the processor is configured to execute the computer instructions to implement: obtaining a plurality of original images from an original image set, wherein at least one of the plurality of original images comprises an annotation area; obtaining at least one first image by splicing the plurality of original images; for each of the at least one first image, adjusting a shape and/or size of the first image based on the plurality of original images to form a second image; obtaining respective positions of the at least one annotation area in the second image by converting respective positions of the at least one annotation area in the plurality of original images; wherein adjusting the shape and/or size of the first image based on the plurality of original images to form the second image comprises: determining a scaling ratio of an annotation area in the plurality of original images and an annotation area in the second image according to a parameter of an image acquisition device corresponding to the plurality of original images and a parameter of an image acquisition device corresponding to the second image; scaling the first image based on the scaling ratio to form a scaled first image; and adjusting a shape and/or size of the scaled first image to be consistent with those of the plurality of original images to form the second image; wherein obtaining respective positions of the at least one annotation area in the second image by converting respective positions of the at least one annotation area in the plurality of original images comprises: for each of the at least one annotati

Assignees

Inventors

Classifications

  • G06T3/40Primary

    Scaling of whole images or parts thereof, e.g. expanding or contracting · CPC title

  • using affine transformations · CPC title

  • using two or more images, e.g. averaging or subtraction · CPC title

  • Creating or editing images; Combining images with text · CPC title

  • Training; Learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12002182B2 cover?
The present disclosure relates to methods and apparatuses for processing an image, training an image recognition network and recognizing an image. The method of processing an image includes: obtaining a plurality of original images from an original image set, where at least one of the plurality of original images includes an annotation area; obtaining at least one first image by splicing the pl…
Who is the assignee on this patent?
Beijing Zhongxiangying Tech Co Ltd, Boe Technology Group Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T3/40. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 04 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).