Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2
Published in The International Journal of Biostatistics, 2020
Download here
Published in Frontiers in Oncology, 2020
Download here
Published in Journal of Experimental & Clinical Cancer Research, 2021
Download here
Published in Proceedings of the International Conference on Medical and Health Informatics (Full Paper Track), 2021
Download here
Published in Journal of Biomedical Informatics, 2021
Download here
Published in R Package, 2021
Download here
Published in Journal of Biomedical Informatics, 2021
Download here
Published in Journal of Biomedical Informatics, 2022
Download here
Published in STAR Protocols, 2023
Download here
Published in Proceedings of the International Conference on Learning Representations (Tiny Paper Track), 2023
Download here
Published in Proceedings of the International Conference on Learning Representations (Tiny Paper Track), 2023
Download here
Published in Proceedings of the International Conference on Learning Representations (Tiny Paper Track), 2023
Download here
Published in Artificial Intelligence in Medicine, 2023
Download here
Published in Proceedings of the Medical Imaging with Deep Learning (Short Paper Track), 2023
Download here
Published in Journal of Biomedical Informatics, 2023
Download here
Published:
Early prediction of adverse drug reaction (ADR) is crucial in clinical research. The development of electronic medical record (EMR) provides an excellent resource for retrospective studies to extract samples and establish models that can be used for prediction of clinical deterioration. However, classical statistical models like multivariate logistic regression (LR) may result in unreliable predictions when handling unbalanced datasets. To develop a trustworthy model on unbalanced ADR data, we first transformed the EMR including medical notes into numeric variables. Then we introduced support vector machine (SVM), random forest (RF), AdaBoost, XGBoost, and artificial neural network (ANN) to deal with the challenge of high dimensionality. Furthermore, we utilized the ensembling approach to tackle data imbalance. Finally, we analyzed potential model mechanisms to provide interpretability and compared methods from the perspective of procedure elapsed time. The results showed ensembling contributed considerable improvement in prediction ability of various machine intelligence models. Compared with the baseline, RF, AdaBoost and XGBoost presented superiority, and ANN without fine-tuning showed similar competence. The results of this study demonstrated the great potential of machine learning models in medical domain.
Primary school course, Yongji Primary School, Tianjin, 2015