TY - JOUR
T1 - Orthognathic surgical planning using graph CNN with dual embedding module
T2 - External validations with multi-hospital datasets
AU - Kim, In Hwan
AU - Kim, Jun Sik
AU - Jeong, Jiheon
AU - Park, Jae Woo
AU - Park, Kanggil
AU - Cho, Jin Hyoung
AU - Hong, Mihee
AU - Kang, Kyung Hwa
AU - Kim, Minji
AU - Kim, Su Jung
AU - Kim, Yoon Ji
AU - Sung, Sang Jin
AU - Kim, Young Ho
AU - Lim, Sung Hoon
AU - Baek, Seung Hak
AU - Kim, Namkug
N1 - Publisher Copyright:
© 2023 Elsevier B.V.
PY - 2023/12
Y1 - 2023/12
N2 - Background and objective: Despite recent development of AI, prediction of the surgical movement in the maxilla and mandible by OGS might be more difficult than that of tooth movement by orthodontic treatment. To evaluate the prediction accuracy of the surgical movement using pairs of pre-(T0) and post-surgical (T1) lateral cephalograms (lat-ceph) of orthognathic surgery (OGS) patients and dual embedding module-graph convolution neural network (DEM-GCNN) model. Methods: 599 pairs from 3 institutions were used as training, internal validation, and internal test sets and 201 pairs from other 6 institutions were used as external test set. DEM-GCNN model (IEM, learning the lat-ceph images; LTEM, learning the landmarks) was developed to predict the amount and direction of surgical movement of ANS and PNS in the maxilla and B-point and Md1crown in the mandible. The distance between T1 landmark coordinates actually moved by OGS (ground truth) and predicted by DEM-GCNN model and pre-existed CNN-based Model-C (learning the lat-ceph images) was compared. Results: In both internal and external tests, DEM-GCNN did not exhibit significant difference from ground truth in all landmarks (ANS, PNS, B-point, Md1crown, all P > 0.05). When the accumulated successful detection rate for each landmark was compared, DEM-GCNN showed higher values than Model-C in both the internal and external tests. In violin plots exhibiting the error distribution of the prediction results, both internal and external tests showed that DEM-GCNN had significant performance improvement in PNS, ANS, B-point, Md1crown than Model-C. DEM-GCNN showed significantly lower prediction error values than Model-C (one-jaw surgery, B-point, Md1crown, all P < 0.005; two-jaw surgery, PNS, ANS, all P < 0.05; B point, Md1crown, all P < 0.005). Conclusion: We developed a robust OGS planning model with maximized generalizability despite diverse qualities of lat-cephs from 9 institutions.
AB - Background and objective: Despite recent development of AI, prediction of the surgical movement in the maxilla and mandible by OGS might be more difficult than that of tooth movement by orthodontic treatment. To evaluate the prediction accuracy of the surgical movement using pairs of pre-(T0) and post-surgical (T1) lateral cephalograms (lat-ceph) of orthognathic surgery (OGS) patients and dual embedding module-graph convolution neural network (DEM-GCNN) model. Methods: 599 pairs from 3 institutions were used as training, internal validation, and internal test sets and 201 pairs from other 6 institutions were used as external test set. DEM-GCNN model (IEM, learning the lat-ceph images; LTEM, learning the landmarks) was developed to predict the amount and direction of surgical movement of ANS and PNS in the maxilla and B-point and Md1crown in the mandible. The distance between T1 landmark coordinates actually moved by OGS (ground truth) and predicted by DEM-GCNN model and pre-existed CNN-based Model-C (learning the lat-ceph images) was compared. Results: In both internal and external tests, DEM-GCNN did not exhibit significant difference from ground truth in all landmarks (ANS, PNS, B-point, Md1crown, all P > 0.05). When the accumulated successful detection rate for each landmark was compared, DEM-GCNN showed higher values than Model-C in both the internal and external tests. In violin plots exhibiting the error distribution of the prediction results, both internal and external tests showed that DEM-GCNN had significant performance improvement in PNS, ANS, B-point, Md1crown than Model-C. DEM-GCNN showed significantly lower prediction error values than Model-C (one-jaw surgery, B-point, Md1crown, all P < 0.005; two-jaw surgery, PNS, ANS, all P < 0.05; B point, Md1crown, all P < 0.005). Conclusion: We developed a robust OGS planning model with maximized generalizability despite diverse qualities of lat-cephs from 9 institutions.
KW - Cephalometry
KW - Deep learning
KW - Dual embedding module
KW - Graph convolution neural network
KW - Maxillofacial surgery
KW - Multicenter study
KW - Orthognathic surgery
KW - Surgical prediction
UR - http://www.scopus.com/inward/record.url?scp=85176138359&partnerID=8YFLogxK
U2 - 10.1016/j.cmpb.2023.107853
DO - 10.1016/j.cmpb.2023.107853
M3 - Article
C2 - 37857025
AN - SCOPUS:85176138359
SN - 0169-2607
VL - 242
JO - Computer Methods and Programs in Biomedicine
JF - Computer Methods and Programs in Biomedicine
M1 - 107853
ER -