Visual domain detection systems and methods

US10679088B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10679088-B1
Application numberUS-201715430158-A
CountryUS
Kind codeB1
Filing dateFeb 10, 2017
Priority dateFeb 10, 2017
Publication dateJun 9, 2020
Grant dateJun 9, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed is an effective domain name defense solution in which a domain name string may be provided to or obtained by a computer embodying a visual domain analyzer. The domain name string may be rendered or otherwise converted to an image. An optical character recognition function may be applied to the image to read out a text string which can then be compared with a protected domain name to determine whether the text string generated by the optical character recognition function from the image converted from the domain name string is similar to or matches the protected domain name. This visual domain analysis can be dynamically applied in an online process or proactively applied in an offline process to hundreds of millions of domain names.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for visual domain detection, comprising: obtaining, by a computer, a domain name string, the domain name string collected or generated by a data source over a network, the computer comprising at least one processor and at least one non-transitory computer-readable medium; converting, by the computer, the domain name string for visual domain detection processing, the converting comprising: determining whether to decode the domain name string based on a format of the domain name string; responsive to a determination to decode the domain name string, decoding the domain name string into binary characters; and rendering the binary characters as an image; applying, by the computer, an optical character recognition (OCR) function to the image, the applying the OCR function to the image generating a text string that is a domain name set of characters; comparing, by the computer, the text string generated by the OCR function with a protected domain name; determining, by the computer utilizing a string similarity measure, whether the text string generated by the OCR function from the image converted from the domain name string is similar to or matches the protected domain name, the string similarity measure comprising a bigram distance between the domain name string and the protected domain name; and communicating, by the computer, a result from the determining to a computing device over the network. 2. The method according to claim 1 , wherein the computer determines prior to the converting whether the domain name string is an internationalized domain name. 3. The method according to claim 2 , wherein the computer determines whether the domain name string is an internationalized domain name by parsing the domain name string and determining whether an ASCII representation consisting of a prefix is present in the domain name string. 4. The method according to claim 3 , wherein the prefix comprises “xn--”. 5. The method according to claim 2 , wherein the computer decodes the internationalized domain name into binary characters, and wherein the computer renders the binary characters into an image file utilizing a draw function. 6. The method according to claim 5 , wherein the binary characters are in UTF8 format. 7. The method according to claim 1 , further comprising: determining, based on one or more words in the text string, whether the domain name string belongs to same product or service vertical on the Internet as the protected domain name; and responsive to the domain name string and the protected domain name belonging to the same product or service vertical on the Internet, providing the domain name string to a rules engine for analysis. 8. A system for visual domain detection, comprising: a computer embodying a visual domain analyzer, the computer comprising at least one processor and at least one non-transitory computer-readable medium storing instructions translatable by the at least one processor to perform: obtaining a domain name string, the domain name string collected or generated by a data source over a network; converting the domain name string for visual domain detection processing, the converting comprising: determining whether to decode the domain name string based on a format of the domain name string; responsive to a determination to decode the domain name string, decoding the domain name string into binary characters; and rendering the binary characters as an image; applying an optical character recognition function (OCR) to the image, the applying the OCR function to the image generating a text string that is a domain name set of characters; comparing the text string generated by the OCR function with a protected domain name; determining, utilizing a string similarity measure, whether the text string generated by the OCR function from the image converted from the domain name string is similar to or matches the protected domain name, the string similarity measure comprising a bigram distance between the domain name string and the protected domain name; and communicating a result from the determining to a computing device over the network. 9. The system of claim 8 , wherein the instructions are further translatable by the at least one processor to determine prior to the converting whether the domain name string is an internationalized domain name. 10. The system of claim 9 , wherein the instructions are further translatable by the at least one processor to determine whether the domain name string is an internationalized domain name by parsing the domain name string and determining whether an ASCII representation consisting of a prefix is present in the domain name string. 11. The system of claim 10 , wherein the prefix comprises “xn--”. 12. The system of claim 9 , wherein the instructions are further translatable by the at least one processor to decode the internationalized domain name into binary characters, and wherein the instructions are further translatable by the at least one processor to render the binary characters into an image file utilizing a draw function. 13. The system of claim 12 , wherein the binary characters are in UTF8 format. 14. The system of claim 8 , wherein the instructions are further translatable by the at least one processor for: determining, based on one or more words in the text string, whether the domain name string belongs to same product or service vertical on the Internet as the protected domain name; and responsive to the domain name string and the protected domain name belonging to the same product or service vertical on the Internet, providing the domain name string to a rules engine for analysis. 15. A computer program product for visual domain detection, the computer program product comprising at least one non-transitory computer-readable medium storing instructions translatable by at least one processor to perform: obtaining a domain name string, the domain name string collected or generated by a data source over a network; converting the domain name string for visual domain detection processing, the converting comprising: determining whether to decode the domain name string based on a format of the domain name string; responsive to a determination to decode the domain name string, decoding the domain name string into binary characters; and rendering the binary characters as an image; applying an optical character recognition function (OCR) to the image, the applying the OCR function to the image generating a text string that is a domain name set of characters; comparing the text string generated by the OCR function with a protected domain name; determining, utilizing a string similarity measure, whether the text string generated by the OCR function from the image converted from the domain name string is similar to or matches the protected domain name, the string similarity measure comprising a bigram distance between the domain name string and the protected domain name; and communicating a result from the determining to a computing device over the network. 16. The computer program product of claim 15 , wherein the instructions are further translatable by the at least one processor to determine prior to the converting whether the domain name string is an internationalized domain name. 17. The computer program product of claim 16 , wherein the instructions are further translatable by the at least one processor to determine whether the domain name string is an internationalized domain name by parsing the domain name string and determining whether an ASCII re

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Electricity · mapped topic

  • for managing network security; network security policies in general (filtering policies H04L63/0227) · CPC title

  • G06K9/344Primary

    Physics · mapped topic

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10679088B1 cover?
Disclosed is an effective domain name defense solution in which a domain name string may be provided to or obtained by a computer embodying a visual domain analyzer. The domain name string may be rendered or otherwise converted to an image. An optical character recognition function may be applied to the image to read out a text string which can then be compared with a protected domain name to d…
Who is the assignee on this patent?
Proofpoint Inc
What technology area does this patent fall under?
Primary CPC classification G06K9/344. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 09 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).