Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.13091/690
Title: T-HSAB: A Tunisian Hate Speech and Abusive Dataset
Authors: Haddad, Hatem
Mulki, Hala
Oueslati, Asma
Keywords: Tunisian dialect
Abusive speech
Hate speech
Publisher: SPRINGER INTERNATIONAL PUBLISHING AG
Abstract: Since the Jasmine Revolution at 2011, Tunisia has entered a new era of ultimate freedom of expression with a full access into social media. This has been associated with an unrestricted spread of toxic contents such as Abusive and Hate speech. Considering the psychological harm, let alone the potential hate crimes that might be caused by these toxic contents, automatic Abusive and Hate speech detection systems become a mandatory. This evokes the need for Tunisian benchmark datasets required to evaluate Abusive and Hate speech detection models. Being an underrepresented dialect, no previous Abusive or Hate speech datasets were provided for the Tunisian dialect. In this paper, we introduce the first publicly-available Tunisian Hate and Abusive speech (T-HSAB) dataset with the objective to be a benchmark dataset for automatic detection of online Tunisian toxic contents. We provide a detailed review of the data collection steps and how we design the annotation guidelines such that a reliable dataset annotation is guaranteed. This was later emphasized through the comprehensive evaluation of the annotations as the annotation agreement metrics of Cohen's Kappa (k) and Krippendorff's alpha (alpha) indicated the consistency of the annotations.
Description: 7th International Conference on Arabic Language Processing (ICALP) -- OCT 16-17, 2019 -- Nancy, FRANCE
URI: https://doi.org/10.1007/978-3-030-32959-4_18
https://hdl.handle.net/20.500.13091/690
ISBN: 978-3-030-32959-4; 978-3-030-32958-7
ISSN: 1865-0929
1865-0937
Appears in Collections:Mühendislik ve Doğa Bilimleri Fakültesi Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collections
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collections

Show full item record



CORE Recommender

SCOPUSTM   
Citations

12
checked on Apr 27, 2024

WEB OF SCIENCETM
Citations

22
checked on Apr 27, 2024

Page view(s)

244
checked on Apr 29, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.