phishingdetect - A phishing detect system with NLP/OCR/HTML features.

phishingdetect – A phishing detect system with NLP/OCR/HTML features.

PhishingDetect is A simple machine learning model to identify phishing pages by looking at:
+ HTML text
+ HTML structure
+ IMAGE text

phishingdetect

Dependencies:
+ Python 2.7.x
+ tesseract OCR
+ nltk data
+ libraries for machine learning: numpy, scikit-learn, matplotlib and scipy

Use and Download:

Source: https://github.com/ririhedou