Saltar al contenido

Features from Fraudulent Websites Dataset (FFW-282)

ffw
Fraudulent Websites Dataset (FFW-282), comprises 181 legitimate and 101 fraudulent domains. Using Selenium Webdriver and Python3, the study collected features from each domain, including HTML content, SSL certificate information, and e-commerce technologies. External services like Trustpilot and WHOIS were used to gather additional information, stored in a JSON file for creating feature vectors. Finally, user reports from ScamAdviser were used to collect 197 domains, with a low confidence score indicating suspicious websites. Legitimate domains were obtained from top online stores globally. A manual analysis set a threshold of 75% confidence for legitimacy.
This dataset is available upon request. Please, send us an email to the address gvis@unileon.es with the following data:
  • Name of the Institution you are working on.
  • Brief description of the project in which the dataset will be used.
  • Objective of the specific research you want the dataset for.
Bear in mind that the request must be done from an email account from your institution. If you are a student, the request must be done by your supervisor in the mentioned Institution.