Extracting a portion of a document, such as a web page

US9753926B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9753926-B2
Application numberUS-201213460632-A
CountryUS
Kind codeB2
Filing dateApr 30, 2012
Priority dateApr 30, 2012
Publication dateSep 5, 2017
Grant dateSep 5, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A facility for extracting visual contents of a page is described. The facility renders a page including visual contents, the rendered page including an extraction control. In response to user input activating the extraction control, the facility (1) selects a proper subset of the visual contents of the page that are specified by a publisher on the page for extraction using the extraction control, and (2) transmits a representation of the selected subset of visual contents for storage independent of the page.

First claim

Opening claim text (preview).

We claim: 1. A computing system comprising one or more processors configurable to cause: displaying a page on a user device, the page comprising a plurality of page elements, at least a portion of the page elements defining visual features of the page; determining a first location of a pointer displayed in relation to the page according to adjusted coordinates of the pointer; identifying a first page element based on the first location of the pointer; displaying, on the user device, responsive to identifying the first page element, a first extraction control corresponding to the first page element; identifying a first indication that the first extraction control has not been activated; determining a second location of the pointer, the second location being different than the first location; identifying a second page element based on the second location of the pointer; responsive to identifying the second page element: displaying, on the user device, a second extraction control corresponding to the second page element, and removing, on the user device, the displayed first extraction control; identifying a second indication that the second extraction control is activated; extracting, responsive to identifying the second indication, a portion of the page; providing the extracted portion of the page for storage; and displaying, on the user device, an updated second extraction control corresponding to the second page element, the updated second extraction control indicating that the extracted portion of the page was successfully provided for storage. 2. The computing system of claim 1 , the page further comprising: an inclusion reference specifying inclusion in the page of code for extracting the identified first page element in response to activation of the second extraction control. 3. The computing system of claim 2 , wherein the page includes information specifying a hierarchical organization of at least a portion of the page elements, and wherein the information explicitly identifying one or more of the page elements as being an object of the second extraction control comprises information referencing a page element of the hierarchical organization, wherein the identified elements are two or more elements contained by a subtree defined by the referenced page element. 4. The computing system of claim 1 , wherein the page includes information explicitly identifying one or more of the page elements as being an object of the second extraction control. 5. The computing system of claim 4 , wherein the information explicitly identifying one or more of the page elements as being an object of the second extraction control is information contained in a page element specified by information in the page that specifies a pair of page elements in order to identify a range of page elements defined by a specified pair of page elements. 6. A method for conveying a page, the method comprising: causing, by a computing system, display of the page on a user device, the page comprising a plurality of page elements, at least a portion of the page elements defining visual features of the page; determining a first location of a pointer displayed in relation to the page according to adjusted coordinates of the pointer; identifying a first page element based on the first location of the pointer; causing, by the computing system, responsive to identifying the first page element, display on the user device of a first extraction control corresponding to the first page element; identifying a first indication that the first extraction control has not been activated; determining a second location of the pointer, the second location being different than the first location; identifying a second page element based on the second location of the pointer; responsive to identifying the second page element: causing display on the user device of a second extraction control corresponding to the second page element, and removing, on the user device, the displayed first extraction control; identifying a second indication that the second extraction control is activated; extracting, responsive to identifying the second indication, a portion of the page; providing the extracted portion of the page for storage; and causing, by the computing system, display of an updated second extraction control corresponding to the second page element, the updated second extraction control indicating that the extracted portion of the page was successfully provided for storage. 7. The method of claim 6 , the page further comprising: an inclusion reference specifying inclusion in the page of code for extracting the identified first page element in response to activation of the second extraction control. 8. The method of claim 7 , wherein the page includes information specifying a hierarchical organization of at least a portion of the page elements, and wherein the information explicitly identifying one or more of the page elements as being an object of the second extraction control comprises information referencing a page element of the hierarchical organization, wherein the identified elements are two or more elements contained by a subtree defined by the referenced page element. 9. The method of claim 6 , wherein the page includes information explicitly identifying one or more of the page elements as being an object of the second extraction control. 10. The method of claim 9 , wherein the information explicitly identifying one or more of the page elements as being an object of the second extraction control is information contained in a page element specified by information in the page that specifies a pair of page elements in order to identify a range of page elements defined by a specified pair of page elements. 11. A computer program product comprising computer-readable program code to be executed by at least one processor when retrieved from a non-transitory computer-readable medium, the program code comprising instructions configurable to cause: displaying a page on a user device, the page comprising a plurality of page elements, at least a portion of the page elements defining visual features of the page; determining a first location of a pointer displayed in relation to the page according to adjusted coordinates of the pointer; identifying a first page element based on the first location of the pointer; displaying, on the user device, responsive to identifying the first page element, a first extraction control corresponding to the first page element; identifying a first indication that the first extraction control has not been activated; determining a second location of the pointer, the second location being different than the first location; identifying a second page element based on the second location of the pointer; responsive to identifying the second page element: displaying, on the user device, a second extraction control corresponding to the second page element, and removing, on the user device, the displayed first extraction control; identifying a second indication that the second extraction control is activated; extracting, responsive to identifying the second indication, a portion of the page; providing the extracted portion of the page for storage; and displaying, on the user device, an updated second extraction control corresponding to the second page element, the updated second extraction control indicating that the extracted portion of the page was successfully provided for storage. 12. The computer program product of claim 11 , the page further comprising: an inclusion reference specifying inclusion in the page of code for extracting the identified first pa

Assignees

Inventors

Classifications

  • Indexing; Web crawling techniques · CPC title

  • Query formulation, e.g. graphical querying · CPC title

  • G06F16/434Primary

    using image data, e.g. images, photos, pictures taken by a user · CPC title

  • Physics · mapped topic

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9753926B2 cover?
A facility for extracting visual contents of a page is described. The facility renders a page including visual contents, the rendered page including an extraction control. In response to user input activating the extraction control, the facility (1) selects a proper subset of the visual contents of the page that are specified by a publisher on the page for extraction using the extraction contro…
Who is the assignee on this patent?
Flake Gary W, Montgomery Thomas J, Pascale Gregory T, and 3 more
What technology area does this patent fall under?
Primary CPC classification G06F16/434. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 05 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).