TWITA - Long-term Social Media Collection at the University of Turin

40wita

Collection of tweets about the COVID-19 emergency in Italy.

Go to dataset

ConRef-STANCE-ita

collection of tweets on the topic of the Referendum held in Italy on December 4, 2016, about a reform of the Italian Constitution.

Go to dataset

Felicittà

Corpus for the evaluation of a project on the development of a platform that aimed to estimate and interactively display the degree of happiness in Italian cities.

Go to dataset

HaSpeeDe

Dataset for the Hate Speech Detection task at EVALITA 2018.

Go to dataset

Italian Hate Speech Corpus

Corpus of hate speech on social media towards migrants and ethnic minorities.

Go to dataset

IronITA

Dataset for the irony detection task task at EVALITA 2018.

Go to dataset

PoSTWITA

Dataset for the SENTIment POLarity Classification task at EVALITA 2014 and 2016.

Go to dataset

Senti-TUT

A dataset of Italian tweets with a focus on politics and ironic content.

Go to dataset

SENTIPOLC

Dataset for the SENTIment POLarity Classification task at EVALITA 2014 and 2016.

Go to dataset

TW-BuonaScuola

Corpus of Italian tweets on the topic of the national educational and training systems.

Go to dataset

TW-SWELLFER

Corpus of Italian tweets on subjective well-being, in particular regarding the topics of fertility and parenthood.

Go to dataset

TWITTIRÒ

Dataset of Italian tweets a fine-grained annotation of irony is superimposed.

Go to dataset

TWITA Datasets: