Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.13091/1014
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Mulki, Hala | - |
dc.contributor.author | Haddad, Hatem | - |
dc.contributor.author | Ali, Chedi Bechikh | - |
dc.contributor.author | Alshabani, Halima | - |
dc.date.accessioned | 2021-12-13T10:34:35Z | - |
dc.date.available | 2021-12-13T10:34:35Z | - |
dc.date.issued | 2019 | - |
dc.identifier.isbn | 978-1-950737-43-7 | - |
dc.identifier.uri | https://hdl.handle.net/20.500.13091/1014 | - |
dc.description | 3rd Workshop on Abusive Language Online -- AUG 01, 2019 -- Florence, ITALY | en_US |
dc.description.abstract | Hate speech and abusive language have become a common phenomenon on Arabic social media. Automatic hate speech and abusive detection systems can facilitate the prohibition of toxic textual contents. The complexity, informality and ambiguity of the Arabic dialects hindered the provision of the needed resources for Arabic abusive/hate speech detection research. In this paper, we introduce the first publicly-available Levantine Hate Speech and Abusive (L-HSAB) Twitter dataset with the objective to be a benchmark dataset for automatic detection of online Levantine toxic contents. We, further, provide a detailed review of the data collection steps and how we design the annotation guidelines such that a reliable dataset annotation is guaranteed. This has been later emphasized through the comprehensive evaluation of the annotations as the annotation agreement metrics of Cohen's Kappa (k) and Krippendorff's alpha (alpha) indicated the consistency of the annotations. | en_US |
dc.description.sponsorship | UCLA, Google, Facebook, Element AI, Aylien | en_US |
dc.language.iso | en | en_US |
dc.publisher | ASSOC COMPUTATIONAL LINGUISTICS-ACL | en_US |
dc.relation.ispartof | THIRD WORKSHOP ON ABUSIVE LANGUAGE ONLINE | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | AGREEMENT | en_US |
dc.title | L-HSAB: A Levantine Twitter Dataset for Hate Speech and Abusive Language | en_US |
dc.type | Conference Object | en_US |
dc.department | Fakülteler, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümü | en_US |
dc.authorid | haddad, hatem/0000-0003-3599-7229 | - |
dc.authorwosid | haddad, hatem/ABD-1530-2021 | - |
dc.identifier.startpage | 111 | en_US |
dc.identifier.endpage | 118 | en_US |
dc.identifier.wos | WOS:000538480400012 | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
item.openairetype | Conference Object | - |
item.languageiso639-1 | en | - |
item.cerifentitytype | Publications | - |
item.grantfulltext | embargo_20300101 | - |
item.fulltext | With Fulltext | - |
item.openairecristype | http://purl.org/coar/resource_type/c_18cf | - |
Appears in Collections: | Mühendislik ve Doğa Bilimleri Fakültesi Koleksiyonu WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collections |
Files in This Item:
File | Size | Format | |
---|---|---|---|
W19-3512.pdf Until 2030-01-01 | 1.49 MB | Adobe PDF | View/Open Request a copy |
CORE Recommender
WEB OF SCIENCETM
Citations
61
checked on Mar 23, 2024
Page view(s)
112
checked on Mar 25, 2024
Google ScholarTM
Check
Altmetric
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.