TY - JOUR
T1 - Robust Natural Language Processing
T2 - Recent Advances, Challenges, and Future Directions
AU - Omar, Marwan
AU - Choi, Soohyeon
AU - Nyang, Daehun
AU - Mohaisen, David
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2022
Y1 - 2022
N2 - Recent natural language processing (NLP) techniques have accomplished high performance on benchmark data sets, primarily due to the significant improvement in the performance of deep learning. The advances in the research community have led to great enhancements in state-of-the-art production systems for NLP tasks, such as virtual assistants, speech recognition, and sentiment analysis. However, such NLP systems still often fail when tested with adversarial attacks. The initial lack of robustness exposed troubling gaps in current models' language understanding capabilities, creating problems when NLP systems are deployed in real life. In this paper, we present a structured overview of NLP robustness research by summarizing the literature in a systemic way across various dimensions. We then take a deep-dive into the various dimensions of robustness, across techniques, metrics, embedding, and benchmarks.
AB - Recent natural language processing (NLP) techniques have accomplished high performance on benchmark data sets, primarily due to the significant improvement in the performance of deep learning. The advances in the research community have led to great enhancements in state-of-the-art production systems for NLP tasks, such as virtual assistants, speech recognition, and sentiment analysis. However, such NLP systems still often fail when tested with adversarial attacks. The initial lack of robustness exposed troubling gaps in current models' language understanding capabilities, creating problems when NLP systems are deployed in real life. In this paper, we present a structured overview of NLP robustness research by summarizing the literature in a systemic way across various dimensions. We then take a deep-dive into the various dimensions of robustness, across techniques, metrics, embedding, and benchmarks.
KW - Adversarial attacks
KW - Natural language processing
KW - Robustness
UR - http://www.scopus.com/inward/record.url?scp=85136154112&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2022.3197769
DO - 10.1109/ACCESS.2022.3197769
M3 - Article
AN - SCOPUS:85136154112
SN - 2169-3536
VL - 10
SP - 86038
EP - 86056
JO - IEEE Access
JF - IEEE Access
ER -