/www/twita.dipinfo.di.unito.it/docs

40wita

This dataset is part of the effort by the Italian Association of Computational Linguistics to collect and track resources to alleviate the national and global crisis following the COVID-19 outbreak in 2020: Computational Linguistics and the Covid-19 Outbreak.

The dataset is collected daily from March 1st, 2020, by filtering TWITA with the following words:

covid, covid19, covid-19, corona virus, coronavirus, quarantena, autoisolamento, auto-isolamento, iorestoacasa, stateacasa, COVID19Italia, redditodicittadinaza, eurobond, coronabond, restiamoacasa, preghiamoinsieme, NoMes, #milanononsiferma, bergamononsiferma, l’italianonsiferma, abbraccciauncinese, iononsonounvirus, iononmifermo, aperisera, covidunstria, italiazonarossa, bergamoisrunning, quarantena, chiudetetutto, apritetutto, CuraItalia, ciricordiamotutto, oggisciopero, chiudiamolefabbriche, iononrinuncioalletradizioni, andràtuttobene, INPSdown, percheQuando, cercareDi, ringraziarevoglio, 600euro, CineINPS, COVID19Pandemic

The data are available here. Please write to Valerio Basile <valerio.basile@unito.it> for the password.

Authors:

There is no publication describing this dataset yet, but if you use 40WITA, please cite it with the following information:

@misc{40twita:2020,
  author = {Basile, Valerio and Tommaso Caselli},
  title = {40twita 1.0: A collection of Italian Tweets during the COVID-19 Pandemic},
  language = {Italian},
  doi = {},
  howpublished = {http://twita.di.unito.it/dataset/40wita}
}

and/or the following paper describing the parent resource:

@inproceedings{DBLP:conf/clic-it/BasileLS18,
  author    = {Valerio Basile and
               Mirko Lai and
               Manuela Sanguinetti},
  title     = {Long-term Social Media Data Collection at the University of Turin},
  booktitle = {Proceedings of the Fifth Italian Conference on Computational Linguistics
               (CLiC-it 2018), Torino, Italy, December 10-12, 2018.},
  year      = {2018},
  crossref  = {DBLP:conf/clic-it/2018},
  url       = {http://ceur-ws.org/Vol-2253/paper48.pdf},
  timestamp = {Mon, 17 Dec 2018 17:18:40 +0100},
  biburl    = {https://dblp.org/rec/bib/conf/clic-it/BasileLS18},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Back to TWITA