Abstract
Early intervention for suicide risks with social media data has increasingly received great attention. Using a suicide dictionary created by mental health experts is one of the effective ways to detect suicidal ideation. However, little attention has been paid to validate whether and how the existing dictionaries for other languages (i.e., English and Chinese) can be used for predicting suicidal ideation for a low-resource language (i.e., Korean) where a knowledge-based suicide dictionary has not yet been developed. To this end, we propose a cross-lingual suicidal ideation detection model that can identify whether a given social media post includes suicidal ideation or not. To utilize the existing suicide dictionaries developed for other languages (i.e., English and Chinese) in word embedding, our model translates a post written in the target language (i.e., Korean) into English and Chinese, and then uses the separate suicidal-oriented word embeddings developed for English and Chinese, respectively. By applying an ensemble approach for different languages, the model achieves high accuracy, over 87%. We believe our model is useful in accessing suicidal ideation using social media data for preventing potential suicide risk in an early stage.
| Original language | English |
|---|---|
| Title of host publication | Findings of the Association for Computational Linguistics Findings of ACL |
| Subtitle of host publication | EMNLP 2020 |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 2208-2217 |
| Number of pages | 10 |
| ISBN (Electronic) | 9781952148903 |
| State | Published - 2020 |
| Event | Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020 - Virtual, Online Duration: 16 Nov 2020 → 20 Nov 2020 |
Publication series
| Name | Findings of the Association for Computational Linguistics Findings of ACL: EMNLP 2020 |
|---|
Conference
| Conference | Findings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020 |
|---|---|
| City | Virtual, Online |
| Period | 16/11/20 → 20/11/20 |
Bibliographical note
Publisher Copyright:©2020 Association for Computational Linguistics