Features from Fraudulent Websites Dataset (FFW-282)
Fraudulent Websites Dataset (FFW-282), comprises 181 legitimate and 101 fraudulent domains. Using Selenium Webdriver and Python3, the study collected features from each domain, including HTML content, SSL certificate information, and e-commerce technologies. External services like Trustpilot and WHOIS were used to gather additional information, stored in a JSON file for creating feature vectors. Finally, user reports from ScamAdviser were used to collect 197 domains, with a low confidence score indicating suspicious websites. Legitimate domains were obtained from top online stores globally. A manual analysis set a threshold of 75% confidence for legitimacy.
This dataset is available upon request. Please, send us an email to the address gvis@unileon.es with the following data:
Name of the Institution you are working on.
Brief description of the project in which the dataset will be used.
Objective of the specific research you want the dataset for.
Bear in mind that the request must be done from an email account from your institution. If you are a student, the request must be done by your supervisor in the mentioned Institution.