Systems and methods for intelligent phishing threat detection and phishing threat remediation in a cyber security threat detection and mitigation platform
US-2024414198-A1 · Dec 12, 2024 · US
US9282117B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9282117-B2 |
| Application number | US-201313949974-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jul 24, 2013 |
| Priority date | Jul 24, 2012 |
| Publication date | Mar 8, 2016 |
| Grant date | Mar 8, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A phishing classification model that detects a phishing website based on one or more feature vectors for the website is provided. The phishing classification model may operate on a server and may further select a website, generate a feature vector for a landing page of the website, create a feature vector for every iframe that is a descendent of the landing page, and derive a final feature vector from the feature vectors of the landing page and the descendent iframe pages. Further, machine learning techniques may be applied to generate, or train, a classification model based upon one or more known phishing websites. Based on the feature vector, the classification modeler may classify a website as either a phishing website or as a non-phishing website. Feedback in the form of human verification may further be incorporated.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method comprising: using a device, creating one or more feature vectors for a landing page of a website, wherein the one or more feature vectors for the landing page are derived from one or more landing page elements; creating one or more feature vectors for one or more child pages that are a descendant of the landing page; deriving a final feature vector from the one or more feature vectors of the landing page and the one or more feature vectors for the child pages; and providing the final feature vector to a model to determine whether the website is a phishing website. 2. The method of claim 1 , further comprising: inputting the final feature vector into a model, wherein the model outputs a score associated with a probability of being a phishing site given the input; and classifying the website as a phishing website based on the determined score. 3. The method of claim 2 , further comprising: classifying the website as a phishing website given the score and a threshold. 4. The method of claim 2 , wherein the final feature vector includes a concatenation of at least some of the following individual feature vectors: a uniform resource locator (URL) feature vector including at least some of a URL string character n-gram, an IP address character n-gram, and URL geo-location information; an average URL feature vector derived from links and hrefs on page; average URL feature vectors derived from links and hrefs on page in bins of similarity to the page URL feature vector; an html content feature vector; a classification service classification result feature vector; and a feature vector based on age of webpage. 5. The method of claim 2 , wherein the model utilizes active learning to compute a priority in which the feature vector should be labeled. 6. The method of claim 2 , wherein the model utilizes one or more labels to identify whether the website is a phishing website or not a phishing website. 7. The method of claim 2 , wherein the model utilizes transductive learning. 8. The method of claim 2 , further comprising: an output score indicating an entity that is targeted by the phishing website. 9. The method of claim 1 , wherein the feature vector is derived according to the following formula: p ⇀ = ( p ⇀ 00 , 1 n 1 ∑ k n 1 p ⇀ 1 k , 1 n 11 ∑ { k ❘ k ∈ bin 11 } n 11 p ⇀ 1 k , … , 1 n 1 m ∑ { k ❘ k ∈ bin 1 m n
service impersonation, e.g. phishing, pharming or web spoofing (detection of rogue wireless access points H04W12/12) · CPC title
Computer malware detection or handling, e.g. anti-virus arrangements · CPC title
Assessing vulnerabilities and evaluating computer system security · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.