TY - JOUR
T1 - Integrating Preprocessing Operations into Deep Learning Model
T2 - Case Study of Posttreatment Visual Acuity Prediction
AU - Otsuki, Ryo
AU - Sugiyama, Osamu
AU - Mori, Yuki
AU - Miyake, Masahiro
AU - Hiragi, Shusuke
AU - Yamamoto, Goshiro
AU - Santos, Luciano
AU - Nakanishi, Yuta
AU - Hosoda, Yoshikatsu
AU - Tamura, Hiroshi
AU - Matsumoto, Shigemi
AU - Tsujikawa, Akitaka
AU - Kuroda, Tomohiro
N1 - Publisher Copyright:
© 2022 The Author(s).
PY - 2022
Y1 - 2022
N2 - Designing a deep neural network model that integrates clinical images with other electronic medical records entails various preprocessing operations. Preprocessing of clinical images often requires trimming of parts of the lesions shown in the images, whereas preprocessing of other electronic medical records requires vectorization of these records; for example, patient age is often converted into a categorical vector of 10-year intervals. Although these preprocessing operations are critical to the performance of the classification model, there is no guarantee that the preprocessing step chosen is appropriate for model training. The ability to integrate these preprocessing operations into a deep neural network model and to train the model, including the preprocessing operations, can help design a multi-modal medical classification model. This study proposes integration layers of preprocessing, both for clinical images and electronic medical records, in deep neural network models. Preprocessing of clinical images is realized by a vision transformer layer that selectively adopts the parts of the images requiring attention. The preprocessing of other medical electrical records is performed by adopting full-connection layers and normalizing these layers. These proposed preprocessing-integrated layers were verified using a posttreatment visual acuity prediction task in ophthalmology as a case study. This prediction task requires clinical images as well as patient profile data corresponding to each patient’s posttreatment logMAR visual acuity. The performance of a heuristically designed prediction model was compared with the performance of the prediction model that includes the proposed preprocessing integration layers. The mean square errors between predicted and correct results were 0.051 for the heuristic model and 0.054 for the proposed model. Experimental results showed that the proposed model utilizing preprocessing integration layers achieved nearly the same performance as the heuristically designed model.
AB - Designing a deep neural network model that integrates clinical images with other electronic medical records entails various preprocessing operations. Preprocessing of clinical images often requires trimming of parts of the lesions shown in the images, whereas preprocessing of other electronic medical records requires vectorization of these records; for example, patient age is often converted into a categorical vector of 10-year intervals. Although these preprocessing operations are critical to the performance of the classification model, there is no guarantee that the preprocessing step chosen is appropriate for model training. The ability to integrate these preprocessing operations into a deep neural network model and to train the model, including the preprocessing operations, can help design a multi-modal medical classification model. This study proposes integration layers of preprocessing, both for clinical images and electronic medical records, in deep neural network models. Preprocessing of clinical images is realized by a vision transformer layer that selectively adopts the parts of the images requiring attention. The preprocessing of other medical electrical records is performed by adopting full-connection layers and normalizing these layers. These proposed preprocessing-integrated layers were verified using a posttreatment visual acuity prediction task in ophthalmology as a case study. This prediction task requires clinical images as well as patient profile data corresponding to each patient’s posttreatment logMAR visual acuity. The performance of a heuristically designed prediction model was compared with the performance of the prediction model that includes the proposed preprocessing integration layers. The mean square errors between predicted and correct results were 0.051 for the heuristic model and 0.054 for the proposed model. Experimental results showed that the proposed model utilizing preprocessing integration layers achieved nearly the same performance as the heuristically designed model.
KW - Automation of preprocessing
KW - Deep learning
KW - Visual acuity prediction
UR - http://www.scopus.com/inward/record.url?scp=85124352167&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85124352167&partnerID=8YFLogxK
U2 - 10.14326/abe.11.16
DO - 10.14326/abe.11.16
M3 - Article
AN - SCOPUS:85124352167
SN - 2187-5219
VL - 11
SP - 16
EP - 24
JO - Advanced Biomedical Engineering
JF - Advanced Biomedical Engineering
ER -