13 datasets found

Contents in English are mainly machine translated, may not reflect the exact meaning.

Formats: PDF JSON

Filter Results
  • flag_taiwan

    Title:

    Title in English:

    This dataset has no description

  • flag_taiwan

    Title:

    Title in English:

    Description:

    To create the largest authorship attribution dataset, we extracted works of 50 well-known authors. To have a non-exhaustive learning, in training there are 45 authors whereas,...

    Description in English:

    To create the largest authorship attribution dataset, we extracted works of 50 well-known authors. To have a non-exhaustive learning, in training there are 45 authors whereas, in the testing, it's 50
  • flag_taiwan

    Title:

    Title in English:

    Description:

    The PAMAP2 Physical Activity Monitoring dataset contains data of 18 different physical activities, performed by 9 subjects wearing 3 inertial measurement units and a heart rate...

    Description in English:

    The PAMAP2 Physical Activity Monitoring dataset contains data of 18 different physical activities, performed by 9 subjects wearing 3 inertial measurement units and a heart rate monitor.
  • flag_taiwan

    Title:

    Title in English:

    Description:

    This data set contains user reviews of cars and and hotels collected from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews).

    Description in English:

    This data set contains user reviews of cars and and hotels collected from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews).
  • flag_taiwan

    Title:

    Title in English:

    Description:

    MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The...

    Description in English:

    MADELON is an artificial dataset, which was part of the NIPS 2003 feature selection challenge. This is a two-class classification problem with continuous input variables. The difficulty is that the problem is multivariate and highly non-linear.
  • flag_taiwan

    Title:

    Title in English:

    Description:

    GISETTE is a handwritten digit recognition problem. The problem is to separate the highly confusible digits '4' and '9'. This dataset is one of five datasets of the NIPS 2003...

    Description in English:

    GISETTE is a handwritten digit recognition problem. The problem is to separate the highly confusible digits '4' and '9'. This dataset is one of five datasets of the NIPS 2003 feature selection challenge.
  • flag_taiwan

    Title:

    Title in English:

    Description:

    20 photos of leaves for each of 32 different species.

    Description in English:

    20 photos of leaves for each of 32 different species.
  • flag_taiwan

    Title:

    Title in English:

    Description:

    DOROTHEA is a drug discovery dataset. Chemical compounds represented by structural molecular features must be classified as active (binding to thrombin) or inactive. This is one...

    Description in English:

    DOROTHEA is a drug discovery dataset. Chemical compounds represented by structural molecular features must be classified as active (binding to thrombin) or inactive. This is one of 5 datasets of the NIPS 2003 feature selection challenge.
  • flag_taiwan

    Title:

    Title in English:

    Description:

    DEXTER is a text classification problem in a bag-of-word representation. This is a two-class classification problem with sparse continuous input variables. This dataset is one...

    Description in English:

    DEXTER is a text classification problem in a bag-of-word representation. This is a two-class classification problem with sparse continuous input variables. This dataset is one of five datasets of the NIPS 2003 feature selection challenge.
  • flag_taiwan

    Title:

    Title in English:

    Description:

    Marine sponges of the Demospongiae class classification domain.

    Description in English:

    Marine sponges of the Demospongiae class classification domain.
  • flag_taiwan

    Title:

    Title in English:

    Description:

    AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of real data. Data can be generated in .csv, ARFF or C4.5...

    Description in English:

    AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of real data. Data can be generated in .csv, ARFF or C4.5 formats.
  • flag_taiwan

    Title:

    Title in English:

    Description:

    ARCENE's task is to distinguish cancer versus normal patterns from mass-spectrometric data. This is a two-class classification problem with continuous input variables. This...

    Description in English:

    ARCENE's task is to distinguish cancer versus normal patterns from mass-spectrometric data. This is a two-class classification problem with continuous input variables. This dataset is one of 5 datasets of the NIPS 2003 feature selection challenge.
  • flag_taiwan

    Title:

    Title in English:

    This dataset has no description