Paper: Machine learning strategies to predict late adverse effects in childhood acute lymphoblastic leukemia survivors

Date

2022-11-30

Authors

Nicolas Raymond¹
Maxime Caru^2,3
Hakima Laribi¹
Mehdi Mitiche¹
Valérie Marcil⁴
Maja Krajinovic⁵
Daniel Curnier⁶
Daniel Sinnett⁴
Martin Vallières¹

¹ Department of Computer Science, Université de Sherbrooke, Sherbrooke, Canada

² Division of Hematology and Oncology, Department of Pediatrics, Penn State College of Medicine, Hershey, PA, USA

³ Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA, USA

⁴ Research Center, Sainte Justine University Health Center, Department of Nutrition, Université de Montréal, Montreal, Canada

⁵ Research Center, Sainte Justine University Health Center, Department of Pediatrics, Université de Montréal, Montreal, Canada

⁶ Research Center, Sainte Justine University Health Center, School of Kinesiology and Physical Activity Sciences, Faculty of Medicine, Université de Montréal, Montreal, Canada

Abstract

Acute lymphoblastic leukemia is the most frequent pediatric cancer. Approximately two third of survivors develop one or more health complications known as late adverse effects following their treatments. The existing measures offered to patients during their follow-up visits to the hospital are rather standardized for all childhood cancer survivors and not necessarily personalized for childhood ALL survivors. As a result, late adverse effects may be underdiagnosed and, in most cases, only taken care of following their appearance. Thus, it is necessary to predict these treatment-related conditions earlier in order to prevent them and enhance the survivors’ health. Multiple studies have investigated the development of late adverse effects prediction tools to offer better personalized follow-up methods. However, no solution integrated the usage of neural networks to date. In this work, we developed graph-based parameters-efficient neural networks and promoted their interpretability with multiple post-hoc analyses. We first proposed a new disease-specific VO₂ peak prediction model that does not require patients to participate to a physical function test (e.g., 6-minute walk test) and further created an obesity prediction model using clinical variables that are available from the end of childhood ALL treatment as well as genomic variables. Our solutions were able to achieve better performance than linear and tree-based models on small cohorts of patients (≤ 223) for both tasks.

Links

Last updated on 2024-12-18