Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.13091/690
Full metadata record
DC FieldValueLanguage
dc.contributor.authorHaddad, Hatem-
dc.contributor.authorMulki, Hala-
dc.contributor.authorOueslati, Asma-
dc.date.accessioned2021-12-13T10:29:49Z-
dc.date.available2021-12-13T10:29:49Z-
dc.date.issued2019-
dc.identifier.isbn978-3-030-32959-4; 978-3-030-32958-7-
dc.identifier.issn1865-0929-
dc.identifier.issn1865-0937-
dc.identifier.urihttps://doi.org/10.1007/978-3-030-32959-4_18-
dc.identifier.urihttps://hdl.handle.net/20.500.13091/690-
dc.description7th International Conference on Arabic Language Processing (ICALP) -- OCT 16-17, 2019 -- Nancy, FRANCEen_US
dc.description.abstractSince the Jasmine Revolution at 2011, Tunisia has entered a new era of ultimate freedom of expression with a full access into social media. This has been associated with an unrestricted spread of toxic contents such as Abusive and Hate speech. Considering the psychological harm, let alone the potential hate crimes that might be caused by these toxic contents, automatic Abusive and Hate speech detection systems become a mandatory. This evokes the need for Tunisian benchmark datasets required to evaluate Abusive and Hate speech detection models. Being an underrepresented dialect, no previous Abusive or Hate speech datasets were provided for the Tunisian dialect. In this paper, we introduce the first publicly-available Tunisian Hate and Abusive speech (T-HSAB) dataset with the objective to be a benchmark dataset for automatic detection of online Tunisian toxic contents. We provide a detailed review of the data collection steps and how we design the annotation guidelines such that a reliable dataset annotation is guaranteed. This was later emphasized through the comprehensive evaluation of the annotations as the annotation agreement metrics of Cohen's Kappa (k) and Krippendorff's alpha (alpha) indicated the consistency of the annotations.en_US
dc.description.sponsorshipGoogle, Univ Lorraine, Lab Lorrain Rech Informatique Applicat, European Language Resources Assoc, Special Interest Grp Under Resourced Languages, Inst Sci Digitales, Open Language & Knowledge Citizens, Arabic Language Engn Soc Morocco, Springer, Investir Avenir, Impact Olki, CCIS, Lorraine Univ Excellenceen_US
dc.language.isoenen_US
dc.publisherSPRINGER INTERNATIONAL PUBLISHING AGen_US
dc.relation.ispartofARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019en_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectTunisian dialecten_US
dc.subjectAbusive speechen_US
dc.subjectHate speechen_US
dc.titleT-HSAB: A Tunisian Hate Speech and Abusive Dataseten_US
dc.typeConference Objecten_US
dc.identifier.doi10.1007/978-3-030-32959-4_18-
dc.identifier.scopus2-s2.0-85075563504en_US
dc.departmentFakülteler, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.authoridhaddad, hatem/0000-0003-3599-7229-
dc.authorwosidhaddad, hatem/ABD-1530-2021-
dc.identifier.volume1108en_US
dc.identifier.startpage251en_US
dc.identifier.endpage263en_US
dc.identifier.wosWOS:000569685400018en_US
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.authorscopusid22734490100-
dc.authorscopusid57200388232-
dc.authorscopusid57211977409-
dc.identifier.scopusqualityQ3-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.fulltextNo Fulltext-
item.languageiso639-1en-
item.openairetypeConference Object-
item.grantfulltextnone-
item.cerifentitytypePublications-
Appears in Collections:Mühendislik ve Doğa Bilimleri Fakültesi Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collections
WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collections
Show simple item record



CORE Recommender

SCOPUSTM   
Citations

12
checked on May 25, 2024

WEB OF SCIENCETM
Citations

22
checked on May 25, 2024

Page view(s)

248
checked on May 20, 2024

Google ScholarTM

Check




Altmetric


Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.