Automatically identifying and displaying objects of interest in a graphic novel

US2017365083A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017365083-A1
Application numberUS-201615186208-A
CountryUS
Kind codeA1
Filing dateJun 17, 2016
Priority dateJun 17, 2016
Publication dateDec 21, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Locations and presentation orders of objects of interest (e.g., speech bubbles) in digital graphic novel content are identified such that expanded versions of the objects of interest can be presented to a reader. Specifically, digital graphic novel content is received and locations of interest regions (e.g., rectangular text regions of speech bubbles) in the content are identified by applying a machine-learned model to the content. Locations and presentation orders of objects of interest in the digital graphic novel content are identified based on the identified locations of the interest regions. The digital graphic novel content and presentation metadata including the locations and presentation orders of the objects of interest are provided to a reading device such that expanded versions of the objects of interest are presented to the user in accordance with the presentation metadata.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method of providing digital graphic novel content to a reading device, the method comprising: receiving digital graphic novel content; identifying locations of a plurality of interest regions of the digital graphic novel content by applying a model to the digital graphic novel content; identifying locations and presentation orders of a plurality of objects of interest in the digital graphic novel content based on the identified locations of the plurality of interest regions; creating presentation metadata for the digital graphic novel content indicating the identified locations and presentation orders of the plurality of objects of interest; and providing the digital graphic novel content and the presentation metadata to the reading device for presentation of expanded versions of the plurality of objects of interest in accordance with the presentation metadata. 2 . The computer-implemented method of claim 1 , wherein the model is a machine-learned model, and further comprising building the machine-learned model, the building comprising: selecting a set of images; tagging interest regions in the set of images to generate training data of tagged images; and building the machine-learned model based on the tagged images of the training data, the machine-learned model capable of receiving the digital graphic novel content and generating the locations of the plurality of interest regions in the digital graphic novel content. 3 . The computer-implemented method of claim 1 , wherein the plurality of objects comprise speech bubble objects in the digital graphic novel content that contain text associated with characters in the digital graphic novel content. 4 . The computer-implemented method of claim 3 , wherein the plurality of interest regions comprise text regions of the speech bubble objects in the digital graphic novel content that encompass the text of the speech bubble objects. 5 . The computer-implemented method of claim 1 , wherein identifying locations of the plurality of objects of interest comprises, for each identified interest region, identifying a set of points surrounding the interest region indicative of the location of the corresponding object of interest, the set of points identified based on a color gradient between a color associated with the interest region and colors of points surrounding the interest region. 6 . The computer-implemented method of claim 1 , wherein identifying presentation orders of the plurality of objects of interest comprises, for each object of interest: identifying a reference point associated with the object of interest indicating coordinates of a distinctive feature of the object of interest; determining a panel containing the object of interest based on a spatial relationship between the reference point and location of the panel; and determining the presentation order of the object of interest within the panel based on spatial relationships between the reference point and reference points of other objects of interest contained within the panel. 7 . The computer-implemented method of claim 6 , wherein the object of interest is a speech bubble object in the digital graphic novel content and the distinctive feature is an anchor point of the speech bubble object. 8 . The computer-implemented method of claim 1 , further comprising: obtaining feedback data on presentation of the digital graphic novel content; and updating the machine-learned model based on the obtained feedback data to improve presentation metadata associated with the digital graphic novel content. 9 . The computer-implemented method of claim 8 , wherein the feedback data includes portions of the digital graphic novel content that have been zoomed-in on the reader device. 10 . A non-transitory computer-readable storage medium storing executable computer program instructions for providing digital graphic novel content to a reading device, the computer program instructions comprising: receiving digital graphic novel content; identifying locations of a plurality of interest regions of the digital graphic novel content by applying a model to the digital graphic novel content; identifying locations and presentation orders of a plurality of objects of interest in the digital graphic novel content based on the identified locations of the plurality of interest regions; creating presentation metadata for the digital graphic novel content indicating the identified locations and presentation orders of the plurality of objects of interest; and providing the digital graphic novel content and the presentation metadata to the reading device for presentation of expanded versions of the plurality of objects of interest in accordance with the presentation metadata. 11 . The computer-readable storage medium of claim 10 , wherein the model is a machine-learned model, and the computer program instructions further comprise building the machine-learned model, the building comprising: selecting a set of images; tagging interest regions in the images to generate training data of tagged images; and building the machine-learned model based on the tagged images of the training data, the machine-learned model capable of receiving the digital graphic novel content and generating the locations of the plurality of interest regions in the digital graphic novel content. 12 . The computer-readable storage medium of claim 10 , wherein the plurality of objects comprise speech bubble objects in the digital graphic novel content that contain text associated with characters in the digital graphic novel content. 13 . The computer-readable storage medium of claim 10 , wherein identifying locations of the plurality of objects of interest comprises, for each identified interest region, identifying a set of points surrounding the interest region indicative of the location of the corresponding object of interest, the set of points identified based on a color gradient between a color associated with the interest region and colors of points surrounding the interest region. 14 . The computer-readable storage medium of claim 10 , wherein identifying presentation orders of the plurality of objects of interest comprises, for each object of interest: identifying a reference point associated with the object of interest indicating coordinates of a distinctive feature of the object of interest; determining a panel containing the object of interest based on a spatial relationship between the reference point and location of the panel; and determining the presentation order of the object of interest within the panel based on spatial relationships between the reference point and reference points of other objects of interest contained within the panel. 15 . The computer-readable storage medium of claim 14 , wherein the object of interest is a speech bubble object in the digital graphic novel content and the distinctive feature is an anchor point of the speech bubble object. 16 . A server for providing digital graphic novel content to a reading device, comprising: a processor for executing computer program instructions; and a non-transitory computer-readable storage medium storing computer program instructions executable to perform steps comprising: receiving digital graphic novel content; identifying locations of a plurality of interest regions of the digital graphic novel content by applying a model to the digital graphic novel content; identifying locations and presentation orders of a plurality of objects of interest in the digital graphic novel

Assignees

Inventors

Classifications

  • Active pattern learning · CPC title

  • Obtaining sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • using classification, e.g. of video objects · CPC title

  • based on feedback of a supervisor · CPC title

  • based on distances to training or reference patterns · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017365083A1 cover?
Locations and presentation orders of objects of interest (e.g., speech bubbles) in digital graphic novel content are identified such that expanded versions of the objects of interest can be presented to a reader. Specifically, digital graphic novel content is received and locations of interest regions (e.g., rectangular text regions of speech bubbles) in the content are identified by applying a…
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G06T11/60. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 21 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).