Classifying locator generation kits

US2019141012A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2019141012-A1
Application numberUS-201916242030-A
CountryUS
Kind codeA1
Filing dateJan 8, 2019
Priority dateJul 1, 2016
Publication dateMay 9, 2019
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems for classifying malicious locators. A processor is trained on a set of known malicious locators using a non-supervised learning procedure. Once trained, the processor may classify new locators as being generated by a particular generation kit.

First claim

Opening claim text (preview).

1 . A method for classifying malicious locators accessible through a network, the method comprising: receiving, through an interface to a non-transitory computer-readable medium, at least one locator that comprises the location of a malicious network-accessible resource; extracting at least one feature associated with the at least one locator; assigning membership probabilities to the at least one locator based on the extracted at least one feature, wherein the membership probabilities each represent a probability the at least one locator belongs to a particular family of kits; and labeling the at least one locator as being generated by a kit with which the locator has the highest membership probability. 2 . The method of claim 1 , wherein the at least one locator is a uniform resource locator (URL). 3 . The method of claim 1 , wherein labeling the at least one locator as being generated by the kit includes labeling the at least one locator as being generated by a specific URL-generation kit. 4 . The method of claim 1 , further comprising: accessing a plurality of training locators that each comprise the location of a malicious network-accessible resource; extracting at least one feature associated with each of the plurality of training locators; labeling each of the plurality of training locators as being generated by a specific source based on the extracted features; providing the extracted features and the source label for each of the plurality of locators to a classification module to train the classification module. 5 . The method of claim 4 , wherein the label assigned to each of the plurality of training locators is based on a highest membership probability for each of the plurality of training locators. 6 . The method of claim 1 , wherein the at least one feature includes one or more of locator string length, character frequency distribution, domain levels, number of directories, number of words, number of words from a predetermined list of words, number of vowels, and number of consonants in the locator. 7 . The method of claim 1 , further comprising producing weights for each of the at least one feature to assist in determining the kit that generated the at least one locator. 8 . The method of claim 1 , further comprising issuing a message indicating the kit that was labeled as generating the at least one locator. 9 . The method of claim 1 , further comprising classifying the at least one locator as malicious or non-malicious. 10 . A system for classifying malicious locators accessible through a network, the system comprising: an interface to a non-transitory computer-readable medium configured to access at least one locator that comprises the location of a malicious network-accessible resource; a network interface; and a processor in communication with the medium interface and the network interface, the processor configured to: extract at least one feature associated with the at least one locator; assign membership probabilities to the at least one locator based on the extracted at least one feature, wherein the membership probabilities each represent a probability the at least one locator belongs to a particular family of kits; and label the at least one locator as being generated by a kit with which the locator has the highest membership probability. 11 . The system of claim 10 , wherein the at least one locator is a uniform resource locator (URL). 12 . The system of claim 10 , wherein the processor is configured to label the at least one locator as being generated by a specific URL-generation kit. 13 . The system of claim 10 , wherein the processor is configured to: access a plurality of training locators that each comprise the location of a malicious network-accessible resource; extract at least one feature associated with each of the plurality of training locators; label each of the plurality of training locators as being generated by a specific source based on the extracted features; provide the extracted features and the source label for each of the plurality of locators to a classification module to train the classification module. 14 . The system of claim 13 , wherein the label assigned to each of the plurality of training locators is based on a highest membership probability for each of the plurality of locators. 15 . The system of claim 10 , wherein the at least one feature includes one or more of locator string length, character frequency distribution, domain levels, number of directories, number of words, number of words from a predetermined list of words, number of vowels, and number of consonants in the locator. 16 . The system of claim 10 , wherein the processor is configured to produce weights for each of the at least one feature to assist in determining the kit that generated the at least one locator. 17 . The system of claim 10 , wherein the processor is configured to issue a message indicating the kit that was labeled as generating the at least one locator. 18 . The system of claim 10 , wherein the processor is configured to classify the at least one locator as malicious or non-malicious. 19 . The system of claim 13 , wherein the processor is configured to assign weights to each of the extracted at least one feature associated with each of the plurality of training locators. 20 . A non-transitory computer readable medium containing computer-executable instructions for performing a method for classifying malicious locators accessible through a network, the medium comprising: computer-executable instructions for receiving, through an interface to a non-transitory computer-readable medium, at least one locator that comprises the location of a malicious network-accessible resource; computer-executable instructions for extracting at least one feature associated with the at least one locator; computer-executable instructions for assigning membership probabilities to the at least one locator based on the extracted at least one feature, wherein the membership probabilities each represent a probability the at least one locator belongs to a particular family of kits; and computer-executable instructions for labeling the at least one locator as being generated by a kit with which the locator has the highest membership probability.

Assignees

Inventors

Classifications

  • Active attacks involving interception, injection, modification, spoofing of data unit addresses, e.g. hijacking, packet injection or TCP sequence number attacks · CPC title

  • Filtering by address, protocol, port number or service, e.g. IP-address or URL · CPC title

  • Traffic logging, e.g. anomaly detection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2019141012A1 cover?
Methods and systems for classifying malicious locators. A processor is trained on a set of known malicious locators using a non-supervised learning procedure. Once trained, the processor may classify new locators as being generated by a particular generation kit.
Who is the assignee on this patent?
Rapid7 Inc
What technology area does this patent fall under?
Primary CPC classification H04L63/0236. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu May 09 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).