Saltar al contenido

PasteBin Dataset (PasteCC17K)

kisspng-computer-icons-binary-file-binary-code-desktop-wal-5ae1085f609c47.6613249115246971833957
PasteBin Dataset (PasteCC17K), a dataset of 17640 textual samples crawled from Pastebin, which are classified in 15 categories, being 6 of them suspicious to be related to illegal ones.
This dataset is available upon request. Please, send us an email to the address gvis@unileon.es with the following data:
  • Name of the Institution you are working on.
  • Brief description of the project in which the dataset will be used.
  • Objective of the specific research you want the dataset for.
Bear in mind that the request must be done from an email account from your institution. If you are a student, the request must be done by your supervisor in the mentioned Institution.