Url purification method and url purification apparatus
US-2016306893-A1 · Oct 20, 2016 · US
US2016342500A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2016342500-A1 |
| Application number | US-201514719834-A |
| Country | US |
| Kind code | A1 |
| Filing date | May 22, 2015 |
| Priority date | May 22, 2015 |
| Publication date | Nov 24, 2016 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Template identification techniques for control of testing are described. In one or more implementations, a method is described to control testing of one or more services by one or more computing devices using inferred template identification. Templates are inferred, by the one or more computing devices, that are likely used for documents for respective services of a service provider that are available via corresponding universal resource locators (URLs) to form an inferred dataset. Overlaps are identified by the one or computing devices in the inferred dataset to cluster services together that have likely used corresponding templates. Testing is controlled by the one or more computing devices of the one or more services based at least in part on the clusters.
Opening claim text (preview).
What is claimed is: 1 . A method to control testing of one or more services by one or more computing devices using inferred template identification, the method comprising: inferring templates, by the one or more computing devices, that are likely used for documents for respective services of a service provider that are available via corresponding universal resource locators (URLs) to form an inferred dataset; identifying overlaps by the one or more computing devices in the inferred dataset to cluster services together that have likely used corresponding templates; and controlling testing by the one or more computing devices of the one or more services based at least in part on the clustered services. 2 . The method of claim 1 , wherein the inferring includes applying template inference techniques to symbolically represent the likely templates used by the respective services. 3 . The method of claim 2 , wherein the applying is performed using a set of known templated documents and the documents obtained from the clustered services, an output of which is the symbolic representations. 4 . The method of claim 1 , wherein the identifying of overlaps in the inferred dataset to cluster services together that have likely used corresponding templates is performed using machine learning. 5 . The method of claim 5 , wherein the machine learning uses known template documents as a ground truth to process the documents obtained from the clustered services. 6 . The method of claim 1 , wherein the inferring uses a meta-domain descriptor that is generated to describe structural components of the documents obtained from the clustered services independent of content included in the documents. 7 . The method as described in claim 1 , wherein the documents are webpages or web documents. 8 . The method of claim 1 , further comprising repeating the inferring and the identifying for a subsequent said inferred dataset formed by removing one or more duplicates from the clustered services. 9 . The method as described in claim 8 , wherein the repeating includes at least some duplicate templates that are used to validate quality. 10 . A service testing system comprising: a template inference module implemented at least partially in hardware, the template inference module configured to: infer templates that are likely used for documents for respective services of a service provider that are available via corresponding universal resource locators (URLs) to form an inferred dataset; and identify overlaps by the one or more computing devices in the inferred dataset to cluster services together that have likely used corresponding templates; and a synthetic test generator configured to control testing of the one or more services based at least in part on the clustered services. 11 . The system of claim 10 , wherein the templates are inferred by applying template inference techniques to symbolically represent the likely templates used by the respective services. 12 . The system of claim 11 , wherein the applying is performed using a set of known templated documents and the documents obtained from the clustered services, an output of which is the symbolic representations. 13 . The system of claim 10 , wherein the identifying of overlaps in the inferred dataset to cluster services together that have likely used corresponding templates is performed using machine learning. 14 . The system of claim 13 , wherein the machine learning uses known template documents as a ground truth to process the documents obtained from the clustered services. 15 . The system of claim 10 , wherein the templates are inferred through use of a meta-domain descriptor that is generated to describe structural components of the documents obtained from the clustered services independent of content included in the documents. 16 . A computing device to infer template usage by services for use in testing, the computing device comprising: one or more processors; and one or more computer-readable storage media comprising instructions stored thereon that, responsive to execution by the one or more processors, causes the one or more processors to perform operations comprising: applying template inference techniques to symbolically represent likely templates used by respective services of a service provider to infer templates that are likely used for documents for the respective services that are available via corresponding universal resource locators (URLs) to form an inferred dataset; identifying overlaps in the inferred dataset to cluster services together that have likely used corresponding templates; and removing one or more duplicates from the inferred dataset based on the identified overlaps to form a de-duplicated dataset to be used to test the clustered services. 17 . The computing device of claim 16 , wherein the applying is performed using a set of known templated documents and the documents obtained from the clustered services, an output of which is the symbolic representations. 18 . The computing device of claim 16 , wherein the identifying of overlaps in the inferred dataset to cluster URLs together that have likely used corresponding templates is performed using machine learning. 19 . The computing device of claim 18 , wherein the machine learning uses known template documents as a ground truth to process the documents obtained from the clustered services. 20 . The computing device of claim 16 , wherein the applying uses a meta-domain descriptor that is generated to describe structural components of the documents obtained from the clustered services independent of content included in the documents.
for test execution, e.g. scheduling of test suites · CPC title
Authenticating web pages, e.g. with suspicious links · CPC title
involving long-term monitoring or reporting · CPC title
URL specific, e.g. using aliases, detecting broken or misspelled links · CPC title
for test design, e.g. generating new test cases · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.