-
Title:
Title in English:
Description:
The goal is to model mutant p53 transcriptional activity (active vs inactive) based on data extracted from biophysical simulations.Description in English:
The goal is to model mutant p53 transcriptional activity (active vs inactive) based on data extracted from biophysical simulations. -
Title:
Title in English:
Description:
The OPPORTUNITY Dataset for Human Activity Recognition from Wearable, Object, and Ambient Sensors is a dataset devised to benchmark human activity recognition algorithms...Description in English:
The OPPORTUNITY Dataset for Human Activity Recognition from Wearable, Object, and Ambient Sensors is a dataset devised to benchmark human activity recognition algorithms (classification, automatic data segmentation, sensor fusion, feature extraction, etc). -
Title:
Title in English:
Description:
This data set contains user reviews of cars and and hotels collected from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews).Description in English:
This data set contains user reviews of cars and and hotels collected from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews). -
Title:
Title in English:
Description:
Open University Learning Analytics Dataset contains data about courses, students and their interactions with Virtual Learning Environment for seven selected courses and more...Description in English:
Open University Learning Analytics Dataset contains data about courses, students and their interactions with Virtual Learning Environment for seven selected courses and more than 30000 students. -
Title:
Title in English:
Description:
The dataset contains a million randomly sampled video instances listing 10 fundamental video characteristics along with the YouTube video ID.Description in English:
The dataset contains a million randomly sampled video instances listing 10 fundamental video characteristics along with the YouTube video ID. -
Title:
Title in English:
Description:
This dataset summarizes a heterogeneous set of features about articles published by Mashable in a period of two years. The goal is to predict the number of shares in social...Description in English:
This dataset summarizes a heterogeneous set of features about articles published by Mashable in a period of two years. The goal is to predict the number of shares in social networks (popularity). -
Title:
Title in English:
Description:
Sixteen samples of leaf each of one-hundred plant species. For each sample, a shape descriptor, fine scale margin and texture histogram are given.Description in English:
Sixteen samples of leaf each of one-hundred plant species. For each sample, a shape descriptor, fine scale margin and texture histogram are given. -
Title:
Title in English:
Description:
Experimental data used for binary classification (room occupancy) from Temperature,Humidity,Light and CO2. Ground-truth occupancy was obtained from time stamped pictures that...Description in English:
Experimental data used for binary classification (room occupancy) from Temperature,Humidity,Light and CO2. Ground-truth occupancy was obtained from time stamped pictures that were taken every minute. -
Title:
Title in English:
Description:
NYSK (New York v. Strauss-Kahn) is a collection of English news articles about the case relating to allegations of sexual assault against the former IMF director Dominique...Description in English:
NYSK (New York v. Strauss-Kahn) is a collection of English news articles about the case relating to allegations of sexual assault against the former IMF director Dominique Strauss-Kahn (May 2011). -
Title:
Title in English:
Description:
This data set consists of (a) 129,000 abstracts describing NSF awards for basic research, (b) bag-of-word data files extracted from the abstracts, (c) a list of words used for...Description in English:
This data set consists of (a) 129,000 abstracts describing NSF awards for basic research, (b) bag-of-word data files extracted from the abstracts, (c) a list of words used for indexing the bag-of-word -
Title:
Title in English:
Description:
Northix is designed to be a schema matching benchmark problem for data integration of two entity relationship databases.Description in English:
Northix is designed to be a schema matching benchmark problem for data integration of two entity relationship databases. -
Title:
Title in English:
Description:
Nomao collects data about places (name, phone, localization...) from many sources. Deduplication consists in detecting what data refer to the same place. Instances in the...Description in English:
Nomao collects data about places (name, phone, localization...) from many sources. Deduplication consists in detecting what data refer to the same place. Instances in the dataset compare 2 spots. -
Title:
Title in English:
Description:
Corpus intended to do cleaning (or binarization) and enhancement of noisy grayscale printed text images using supervised learning methods. Noisy images and their corresponding...Description in English:
Corpus intended to do cleaning (or binarization) and enhancement of noisy grayscale printed text images using supervised learning methods. Noisy images and their corresponding ground truth provided. -
Title:
Title in English:
Description:
References to news pages collected from an web aggregator in the period from 10-March-2014 to 10-August-2014. The resources are grouped into clusters that represent pages...Description in English:
References to news pages collected from an web aggregator in the period from 10-March-2014 to 10-August-2014. The resources are grouped into clusters that represent pages discussing the same story. -
Title:
Title in English:
Description:
This dataset was collected by Shan-Hung Wu and DataLab members at NTHU, Taiwan. There're 325 user-perceived clusters from 100 users and their corresponding descriptions.Description in English:
This dataset was collected by Shan-Hung Wu and DataLab members at NTHU, Taiwan. There're 325 user-perceived clusters from 100 users and their corresponding descriptions. -
Title:
Title in English:
Description:
5 types of hand postures from 12 users were recorded using unlabeled markers on fingers of a glove in a motion capture environment. Due to resolution and occlusion, missing...Description in English:
5 types of hand postures from 12 users were recorded using unlabeled markers on fingers of a glove in a motion capture environment. Due to resolution and occlusion, missing values are common. -
Title:
Title in English:
Description:
A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data.Description in English:
A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. -
Title:
Title in English:
Description:
MicroblogPCU data is crawled from sina weibo microblog[[Web Link]]. This data can be used to study machine learning methods as well as do some social network research.Description in English:
MicroblogPCU data is crawled from sina weibo microblog[[Web Link]]. This data can be used to study machine learning methods as well as do some social network research. -
Title:
Title in English:
Description:
The MHEALTH (Mobile Health) dataset is devised to benchmark techniques dealing with human behavior analysis based on multimodal body sensing.Description in English:
The MHEALTH (Mobile Health) dataset is devised to benchmark techniques dealing with human behavior analysis based on multimodal body sensing. -
Title:
Title in English:
Description:
The data here are the ZZAlpha® machine learning recommendations made for various US traded stock portfolios the morning of each day during the 3 year period Jan 1, 2012 - Dec...Description in English:
The data here are the ZZAlpha® machine learning recommendations made for various US traded stock portfolios the morning of each day during the 3 year period Jan 1, 2012 - Dec 31, 2014.