Convolutional neural networks for toxic comment classification

Georgakopoulos, Spiros; Tasoulis, Sotirios; Vrahatis, Aristidis; Plagianakos, Vassilis

Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://hdl.handle.net/123456789/317

Τύπος:	Πρακτικά συνεδρίου
Τίτλος:	Convolutional neural networks for toxic comment classification
Συγγραφέας:	[EL] Γεωργακόπουλος, Σπυρίδων[EN] Georgakopoulos, Spiros [EL] Τασουλής, Σωτήριος[EN] Tasoulis, Sotirios [EL] Βραχάτης, Αριστείδης[EN] Vrahatis, Aristidis [EL] Πλαγιανάκος, Βασίλειος[EN] Plagianakos, Vassilis
Ημερομηνία:	Ιου-2018
Περίληψη:	Flood of information is produced in a daily basis through the global internet usage arising from the online interactive communications among users. While this situation contributes significantly to the quality of human life, unfortunately it involves enormous dangers, since online texts with high toxicity can cause personal attacks, online harassment and bullying behaviors. This has triggered both industrial and research community in the last few years while there are several attempts to identify an efficient model for online toxic comment prediction. However, these steps are still in their infancy and new approaches and frameworks are required. On parallel, the data explosion that appears constantly, makes the construction of new machine learning computational tools for managing this information, an imperative need. Thankfully advances in hardware, cloud computing and big data management allow the development of Deep Learning approaches appearing very promising performance so far. For text classification in particular the use of Convolutional Neural Networks (CNN) have recently been proposed approaching text analytics in a modern manner emphasizing in the structure of words in a document. In this work, we employ this approach to discover toxic comments in a large pool of documents provided by a current Kaggle’s competition regarding Wikipedia’s talk page edits. To justify this decision we choose to compare CNNs against the traditional bag-of-words approach for text analysis combined with a selection of algorithms proven to be very effective in text classification. The reported results provide enough evidence that CNN enhance toxic comment classification reinforcing research interest towards this direction.
Γλώσσα:	Αγγλικά
Σελίδες:	6
DOI:	10.1145/3200947.3208069
ISBN:	978-1-4503-6433-1
Θεματική κατηγορία:	[EL] Τεχνητή νοημοσύνη[EN] Artificial Intelligence
Λέξεις-κλειδιά:	convolutional neural networks; Toxic Text Classification; word2vec; CNN for Text Mining; Text mining
Κάτοχος πνευματικών δικαιωμάτων:	© Association for Computing Machinery
Ηλεκτρονική διεύθυνση του τεκμηρίου στον εκδότη:	https://dl.acm.org/doi/10.1145/3200947.3208069
Ηλεκτρονική διεύθυνση περιοδικού:	https://dl.acm.org/doi/proceedings/10.1145/3200947
Σελίδες τεκμηρίου (στην πηγή):	Article no 35
Τοποθεσία εκδήλωσης:	Patras Greece
Ημ/νία έναρξης εκδήλωσης:	09/07/2018
Ημ/νία λήξης εκδήλωσης:	11/07/2018
Εμφανίζεται στις συλλογές:	Ερευνητικές ομάδες

Αρχεία σε αυτό το τεκμήριο:

Το πλήρες κείμενο αυτού του τεκμηρίου δεν διατίθεται προς το παρόν από το αποθετήριο