MeTooMA (doi:10.7910/DVN/JN4EYU)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link)

Document Description

Citation

Title:

MeTooMA

Identification Number:

doi:10.7910/DVN/JN4EYU

Distributor:

Harvard Dataverse

Date of Distribution:

2019-09-14

Version:

3

Bibliographic Citation:

Kumar Gautam, Akash, 2019, "MeTooMA", https://doi.org/10.7910/DVN/JN4EYU, Harvard Dataverse, V3, UNF:6:vCKHLJcc/RFRvyBxuc/SKw== [fileUNF]

Study Description

Citation

Title:

MeTooMA

Subtitle:

Multi-Aspect Annotations of Tweets Related to the MeToo movement

Identification Number:

doi:10.7910/DVN/JN4EYU

Authoring Entity:

Kumar Gautam, Akash (MIDAS, IIIT-Delhi)

Producer:

Akash Kumar Gautam

Date of Production:

2019-12-14

Distributor:

Harvard Dataverse

Access Authority:

Kumar Gautam, Akash

Depositor:

Kumar Gautam, Akash

Date of Deposit:

2019-09-14

Date of Distribution:

2020-06-8

Holdings Information:

https://doi.org/10.7910/DVN/JN4EYU

Study Scope

Keywords:

Arts and Humanities, Computer and Information Science, Engineering, Social Sciences, MeToo movement, Social Media, Twitter

Topic Classification:

Social Media, Sexual Harrasment, Sarcasm, Hate Speech

Abstract:

The dataset consists of tweets belonging to #MeToo movement on Twitter, labelled into different categories. Due to Twitter's development policies, we only provide the tweet ID's and corresponding labels, other data can be fetched via Twitter API. The data has been labelled by experts, with the majority taken into the account for deciding the final label. We provide these labels for each of the tweets. <li>Relevance</li> <li>Directed Hate</li> <li>Generalized Hate</li> <li>Sarcasm</li> <li>Allegation</li> <li>Justification</li> <li>Refutation</li> <li>Support</li> <li>Oppose</li> There could be more than one label applicable for a single tweet, for detailed annotation information and usage guidelines please refer to the accompanying paper <b>https://aaai.org/ojs/index.php/ICWSM/article/view/7292</b>

Kind of Data:

Tweet ID's and labels. Please refer to associated publication for more details.

Notes:

Please refer to these conditions for the usage and distribution of this dataset. <li> The dataset should be used for scientific or research purposes only. Any other use of the dataset is strictly prohibited. </li> <li> The dataset should not be redistributed or shared in any part with any third-party organization. Interested parties should be redirected to this website.</li> <li> The dataset collection completely follows the Twitter mandated guidelines for distribution and usage. </li> <li> The language all the tweets included in the dataset is English </li>

Methodology and Processing

Sources Statement

Data Access

Notes:

<a href="http://creativecommons.org/publicdomain/zero/1.0">CC0 1.0</a>

Other Study Description Materials

Related Publications

Citation

Title:

@inproceedings{gautam2020metooma, title={\# MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement}, author={Gautam, Akash and Mathur, Puneet and Gosangi, Rakesh and Mahata, Debanjan and Sawhney, Ramit and Shah, Rajiv Ratn}, booktitle={Proceedings of the International AAAI Conference on Web and Social Media}, volume={14}, pages={209--216}, year={2020} }

Bibliographic Citation:

@inproceedings{gautam2020metooma, title={\# MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement}, author={Gautam, Akash and Mathur, Puneet and Gosangi, Rakesh and Mahata, Debanjan and Sawhney, Ramit and Shah, Rajiv Ratn}, booktitle={Proceedings of the International AAAI Conference on Web and Social Media}, volume={14}, pages={209--216}, year={2020} }

File Description--f3582048

File: MeTooMA.tab

  • Number of cases: 9973

  • No. of variables per record: 11

  • Type of File: text/tab-separated-values

Notes:

UNF:6:vCKHLJcc/RFRvyBxuc/SKw==

Variable Description

List of Variables:

Variables

TweetId

f3582048 Location:

Summary Statistics: Max. 1.07464969088340378E18; Mean 1.05277898622540672E18; Min. 1.04849821762791834E18; StDev 2.6063064672513445E15; Valid 9973.0

Variable Format: numeric

Notes: UNF:6:F2yKUBU08VxXM92HNcFtCw==

Country

f3582048 Location:

Variable Format: character

Notes: UNF:6:RxH9Kttk4lvVMuBZ3s2XuQ==

Relevance

f3582048 Location:

Summary Statistics: Max. 1.0; StDev 0.4455932024824297; Min. 0.0; Mean 0.7268625288278346; Valid 9973.0

Variable Format: numeric

Notes: UNF:6:xTlb72hABQBLMOZBpQ5IHg==

Directed_Hate

f3582048 Location:

Summary Statistics: Max. 1.0; StDev 0.20062986712345815; Mean 0.0420134362779504; Valid 9973.0; Min. 0.0

Variable Format: numeric

Notes: UNF:6:UETnpHQ3z5/tpmcidcdO3w==

Generalized_Hate

f3582048 Location:

Summary Statistics: Max. 1.0; Min. 0.0; Valid 9973.0; Mean 0.02817607540358988; StDev 0.16548392696996922;

Variable Format: numeric

Notes: UNF:6:w1j3FST5PC2y76OmfRlubw==

Sarcasm

f3582048 Location:

Summary Statistics: Min. 0.0; Valid 9973.0; Max. 1.0; Mean 0.02205956081419817; StDev 0.1468846484215756;

Variable Format: numeric

Notes: UNF:6:xSQtnCOtGtYQkH2xo0WwAg==

Allegation

f3582048 Location:

Summary Statistics: Valid 9973.0; Min. 0.0; Mean 0.057956482502757714; StDev 0.2336728561937994; Max. 1.0;

Variable Format: numeric

Notes: UNF:6:sGHp4awxtc0IQTVBbEV9+w==

Justification

f3582048 Location:

Summary Statistics: StDev 0.16859608724167258; Min. 0.0; Mean 0.029279053444300263; Valid 9973.0; Max. 1.0

Variable Format: numeric

Notes: UNF:6:lO/qWytVJJmdtfLe/Rw8mQ==

Refutation

f3582048 Location:

Summary Statistics: Valid 9973.0; Max. 1.0; Mean 0.021658477890303705; StDev 0.14557305078307928; Min. 0.0

Variable Format: numeric

Notes: UNF:6:XOx3FMRaPSbi7O67MaKI5g==

Support

f3582048 Location:

Summary Statistics: Mean 0.30823222701293473; StDev 0.4617862098703001; Valid 9973.0; Max. 1.0; Min. 0.0

Variable Format: numeric

Notes: UNF:6:IlPehoBIQe4WX1D7xlChSg==

Oppose

f3582048 Location:

Summary Statistics: Max. 1.0; StDev 0.26259787838386883; Mean 0.07450115311340598; Valid 9973.0; Min. 0.0

Variable Format: numeric

Notes: UNF:6:xUZ7ShvjMZ09jYqDpjDwoA==

Other Study-Related Materials

Label:

README.rtf

Notes:

text/rtf