RDF:
TTC-3600: Benchmark dataset for Turkish text categorization
TTC-3600: Benchmark dataset for Turkish text categorization

The TTC-3600 data set is a collection of Turkish news and articles including categorized 3,600 documents from 6 well-known portals in Turkey. It has 4 different forms in ARFF Weka format.

The TTC-3600 data set is a collection of Turkish news and articles including categorized 3,600 documents from 6 well-known portals in Turkey. It has 4 different forms in ARFF Weka format.

Data and Resources

  • TTC-3600: Benchmark dataset for Turkish text ...JSON

  • TTC-3600 Turkish Text Classification Dataset.rarRAR

Additional Info

Field Value
Author MCI Machine Learning Repository
Last Updated September 17, 2021, 10:35 (CST)
Created September 17, 2021, 10:35 (CST)
Area "Computer"
Associated Tasks "Classification
Attribute Characteristics "Integer"
DPA_DateImported 212601019
DPA_former_id 7761b73f-06b3-4956-a107-b350e952fe9c
DPA_former_name ttc-3600-benchmark-dataset-for-turkish-text-categorization
DPA_former_owner_org aac07418-7ef8-49ab-b0b2-e4e75a5379fe
DPA_former_site https://scidm.nchc.org.tw
Data Set Characteristics "Text"
Date Donated "2017-02-08"
Missing Values "N/A"
Number of Instances "3600"
Number of Web Hits "5755"
Number_of_Attributes "4814"
AODP Economy Taiwan