Object of interest based image processing

US9247203B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9247203-B2
Application numberUS-201113994077-A
CountryUS
Kind codeB2
Filing dateApr 11, 2011
Priority dateApr 11, 2011
Publication dateJan 26, 2016
Grant dateJan 26, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus, a method and a system are provided, wherein the system includes an encoding engine to encode and/or compress one or more objects of interest within individual image frames with higher bit densities than the bit density employed to encode and/or compress their background. The system may further include a context engine to identify a region of interest including at least a part of the one or more objects of interest, and scale the region of interest within individual image frames to emphasize the objects of interest.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus comprising: an encoding engine configured to: receive a plurality of image frames of a video signal; identify a region of interest, ROI, associated with one or more objects of interest, OOI, within respective image frames where a center point of the ROI is determined based at least in part on center points of the OOI, and where the center point of the ROI is identified as the weighted center of all of the OOI within the ROI region as identified by P c = ∑ i = 0 N ⁢ P i / N  where P i is the center position of each of the detected OOI regions, and N is the number of detected OOI regions; encode the ROI associated with one or more OOI within respective image frames based on one or more bit densities that are higher than a bit density of a background region, wherein the background region surrounds the one or more ROI associated with the one or more OOI; and a transmitter coupled to the encoding engine and configured to transmit the encoded plurality of image frames to one or more recipients. 2. The apparatus of claim 1 , wherein the one or more OOI include one or more faces of participants of a video conference. 3. The apparatus of claim 1 , further comprising a context engine coupled to the encoding engine and configured to identify the one or more OOI within the respective image frames. 4. The apparatus of claim 3 , wherein the context engine is further configured to identify a region of interest within the respective image frames, wherein the ROI includes at least partially the one or more OOI. 5. The apparatus of claim 4 , wherein the context engine is further configured to enlarge the ROI within the respective image frames to display the ROI more prominently within the respective image frames. 6. The apparatus of claim 4 , wherein the context engine is further configured to adjust the ROI to place at least one of the one or more OOI centrally within the respective image frames. 7. The apparatus of claim 4 , wherein the context engine is further configured to adjust, based on a context information, the ROI to place at least one of the one or more OOI in an off-center position within the respective image frames. 8. The apparatus of claim 6 , wherein the one or more OOI include one or more faces of participants of a video conference, and wherein the context information includes face orientations of the one or more faces. 9. The apparatus of claim 1 , wherein the transmitter is further configured to transmit the one or more OOI and the background region separately. 10. The method of claim 9 , further comprising transmitting the encoded plurality of image frames to one or more recipients, wherein said transmitting includes transmitting the one or more OOI and the background region separately. 11. The apparatus of claim 1 , wherein the center point of the ROI is identified as the center point of the biggest OOI region within the ROI as identified by P c =arg max(size(P i )) where arg max (size(P i )) is the center point of the OOI region that has the biggest size. 12. A method comprising: receiving a plurality of image frames of a video signal; identifying a region of interest, ROI, associated with one or more objects of interest, OOI, within respective image frames where a center point of the ROI is determined based at least in part on center points of the OOI, and where the center point of the ROI is identified as the weighted center of all of the OOI within the ROI region as identified by P c = ∑ i = 0 N ⁢ P i / N where P i is the center position of each of the detected OOI regions, and N is the number of detected OOI regions; encoding the ROI associated with one or more OOI within respective image frames based on one or more bit densities higher than a bit density of a background region, wherein the background region surrounds the one or more ROI associated with the one or more OOI; and transmitting the encoded plurality of image frames to one or more recipients. 13. The method of claim 12 , further comprising identifying the one or more OOI within the respective image frames. 14. The method of claim 12 , further comprising identifying a ROI within the respective image frames, wherein the ROI includes at least partially the one or more OOI. 15. The method of claim 14 , further comprising enlarging the ROI within the respective image frames to display the ROI more prominently within the respective image frames. 16. The method of claim 14 , further comprising adjusting the ROI within the respective image frames to place at least one of the one or more OOI centrally within the respective image frames. 17. The method of claim 14 , further comprising adjusting, based on a context information, the ROI to place at least one of the one or more OOI at an off-center position within the respective image frames. 18. The method of claim 17 , wherein the one or more OOI include one or more faces of participants of a video conference, and wherein the context information includes face orientations of the one or more faces. 19. The method of claim 12 , wherein the center point of the ROI is identified as the center point of the biggest OOI region within the ROI as identified by P c =arg max(size(P i )) where arg max (size(P i )) is the center point of the OOI region that has the biggest size. 20. A system comprising: a camera configured to capture a video signal having a plurality of image frames; an encoding engine operatively coupled to the camera and configured to: receive the plurality of captured image frames; identify a region of interest, ROI, associated with one or more objects of interest, OOI, within respective image frames where a center point of the ROI is determined based at least in part on center points of the OOI, and where the center point of the ROI is identified as the weighted center of all of the OOI within the ROI region as identified by P c = ∑ i = 0 N ⁢ P i

Assignees

Inventors

Classifications

  • H04N7/15Primary

    Conference systems · CPC title

  • involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution · CPC title

  • the unit being an image region, e.g. an object · CPC title

  • H04N19/115Primary

    Selection of the code volume for a coding unit prior to coding · CPC title

  • Position within a video image, e.g. region of interest [ROI] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9247203B2 cover?
An apparatus, a method and a system are provided, wherein the system includes an encoding engine to encode and/or compress one or more objects of interest within individual image frames with higher bit densities than the bit density employed to encode and/or compress their background. The system may further include a context engine to identify a region of interest including at least a part of t…
Who is the assignee on this patent?
Li Qiang, Li Wenlong, Wang Peng, and 4 more
What technology area does this patent fall under?
Primary CPC classification H04N7/15. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jan 26 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).