Can feature structure improve model’s precision? A novel prediction method using artificial image and image identification

Research output: Contribution to journalArticlepeer-review

Abstract

Objectives: This study aimed to develop an approach to enhance the model precision by artificial images. Materials and Methods: Given an epidemiological study designed to predict 1 response using f features with M samples, each feature was converted into a pixel with certain value. Permutated these pixels into F orders, resulting in F distinct artificial image sample sets. Based on the experience of image recognition techniques, appropriate training images results in higher precision model. In the preliminary experiment, a binary response was predicted by 76 features, the sample set included 223 patients and 1776 healthy controls. Results: We randomly selected 10 000 artificial sample sets to train the model. Models’ performance (area under the receiver operating characteristic curve values) depicted a bell-shaped distribution. Conclusion: The model construction strategy developed in the research has potential to capture feature order related information and enhance model predictability. Lay Summary We aimed to demonstrate a novel method to investigate the effect of feature structure on model predictability with epidemiological data. The concept was inspired from image identification. Pixels in digital images are used as features when training the identification model. The quality of a given digital image will be damaged when pixels’ position and their values changed arbitrarily, which obstructs the model training and model’s precision. We assume the structure-related relationship exists in epidemiological data. Given a certain dataset, features are transformed to pixel values for generating artificial images. To explore the effect of feature structure, orders of pixels are randomly permutated and the model is trained using pixel-permutated artificial image sample sets. In the preliminary experiment, one binary response was designed to be predicted by 76 features. We randomly selected 10 000 artificial image sample sets to train the model. Models’ performance (area under the receiver operating characteristic curve values) depicted a bell-shaped distribution. Namely, the performance of each model’s predictability was studied and the feature structure information had a strong impact on model performance. Our novel model construction strategy has potential to capture feature order related information and enhance model predictability.

Original languageEnglish
Article number7
JournalJAMIA Open
Volume7
Issue number1
DOIs
Publication statusPublished - 01-04-2024

All Science Journal Classification (ASJC) codes

  • Health Informatics

Fingerprint

Dive into the research topics of 'Can feature structure improve model’s precision? A novel prediction method using artificial image and image identification'. Together they form a unique fingerprint.

Cite this