TY - JOUR
T1 - USEVis
T2 - Visual analytics of attention-based neural embedding in information retrieval
AU - Ji, Xiaonan
AU - Tu, Yamei
AU - He, Wenbin
AU - Wang, Junpeng
AU - Shen, Han Wei
AU - Yen, Po Yin
N1 - Publisher Copyright:
© 2021 The Author(s)
PY - 2021/6
Y1 - 2021/6
N2 - Neural attention-based encoders, which effectively attend sentence tokens to their associated context without being restricted by long-term distance or dependency, have demonstrated outstanding performance in embedding sentences into meaningful representations (embeddings). The Universal Sentence Encoder (USE) is one of the most well-recognized deep neural network (DNN) based solutions, which is facilitated with an attention-driven transformer architecture and has been pre-trained on a large number of sentences from the Internet. Besides the fact that USE has been widely used in many downstream applications, including information retrieval (IR), interpreting its complicated internal working mechanism remains challenging. In this work, we present a visual analytics solution towards addressing this challenge. Specifically, focused on semantics and syntactics (concepts and relations) that are critical to domain clinical IR, we designed and developed a visual analytics system, i.e., USEVis. The system investigates the power of USE in effectively extracting sentences’ semantics and syntactics through exploring and interpreting how linguistic properties are captured by attentions. Furthermore, by thoroughly examining and comparing the inherent patterns of these attentions, we are able to exploit attentions to retrieve sentences/documents that have similar semantics or are closely related to a given clinical problem in IR. By collaborating with domain experts, we demonstrate use cases with inspiring findings to validate the contribution of our work and the effectiveness of our system.
AB - Neural attention-based encoders, which effectively attend sentence tokens to their associated context without being restricted by long-term distance or dependency, have demonstrated outstanding performance in embedding sentences into meaningful representations (embeddings). The Universal Sentence Encoder (USE) is one of the most well-recognized deep neural network (DNN) based solutions, which is facilitated with an attention-driven transformer architecture and has been pre-trained on a large number of sentences from the Internet. Besides the fact that USE has been widely used in many downstream applications, including information retrieval (IR), interpreting its complicated internal working mechanism remains challenging. In this work, we present a visual analytics solution towards addressing this challenge. Specifically, focused on semantics and syntactics (concepts and relations) that are critical to domain clinical IR, we designed and developed a visual analytics system, i.e., USEVis. The system investigates the power of USE in effectively extracting sentences’ semantics and syntactics through exploring and interpreting how linguistic properties are captured by attentions. Furthermore, by thoroughly examining and comparing the inherent patterns of these attentions, we are able to exploit attentions to retrieve sentences/documents that have similar semantics or are closely related to a given clinical problem in IR. By collaborating with domain experts, we demonstrate use cases with inspiring findings to validate the contribution of our work and the effectiveness of our system.
KW - Attention mechanism
KW - Clinical decision-making
KW - Document understanding
KW - Information retrieval
KW - Interactive visual system
KW - Neural embedding
UR - http://www.scopus.com/inward/record.url?scp=85106539762&partnerID=8YFLogxK
U2 - 10.1016/j.visinf.2021.03.003
DO - 10.1016/j.visinf.2021.03.003
M3 - Article
AN - SCOPUS:85106539762
SN - 2543-2656
VL - 5
SP - 1
EP - 12
JO - Visual Informatics
JF - Visual Informatics
IS - 2
ER -