This title appears in the Scientific Report :
2023
Please use the identifier:
http://dx.doi.org/10.3390/diagnostics13030391 in citations.
Please use the identifier: http://hdl.handle.net/2128/33768 in citations.
Analysis of Chest X-ray for COVID-19 Diagnosis as a Use Case for an HPC-Enabled Data Analysis and Machine Learning Platform for Medical Diagnosis Support
Analysis of Chest X-ray for COVID-19 Diagnosis as a Use Case for an HPC-Enabled Data Analysis and Machine Learning Platform for Medical Diagnosis Support
The COVID-19 pandemic shed light on the need for quick diagnosis tools in healthcare, leading to the development of several algorithmic models for disease detection. Though these models are relatively easy to build, their training requires a lot of data, storage, and resources, which may not be avai...
Saved in:
Personal Name(s): | Barakat, Chadi (Corresponding author) |
---|---|
Aach, Marcel / Schuppert, Andreas / Brynjólfsson, Sigurður / Fritsch, Sebastian / Riedel, Morris | |
Contributing Institute: |
Jülich Supercomputing Center; JSC |
Published in: | Diagnostics, 13 (2023) 3, S. 391 |
Imprint: |
Basel
MDPI
2023
|
DOI: |
10.3390/diagnostics13030391 |
Document Type: |
Journal Article |
Research Program: |
SMITH - Medizininformatik-Konsortium - Beitrag Forschungszentrum Jülich National Competence Centres in the framework of EuroHPC Research on AI- and Simulation-Based Engineering at Exascale Cross-Domain Algorithms, Tools, Methods Labs (ATMLs) and Research Groups |
Link: |
OpenAccess |
Publikationsportal JuSER |
Please use the identifier: http://hdl.handle.net/2128/33768 in citations.
The COVID-19 pandemic shed light on the need for quick diagnosis tools in healthcare, leading to the development of several algorithmic models for disease detection. Though these models are relatively easy to build, their training requires a lot of data, storage, and resources, which may not be available for use by medical institutions or could be beyond the skillset of the people who most need these tools. This paper describes a data analysis and machine learning platform that takes advantage of high-performance computing infrastructure for medical diagnosis support applications. This platform is validated by re-training a previously published deep learning model (COVID-Net) on new data, where it is shown that the performance of the model is improved through large-scale hyperparameter optimisation that uncovered optimal training parameter combinations. The per-class accuracy of the model, especially for COVID-19 and pneumonia, is higher when using the tuned hyperparameters (healthy: 96.5%; pneumonia: 61.5%; COVID-19: 78.9%) as opposed to parameters chosen through traditional methods (healthy: 93.6%; pneumonia: 46.1%; COVID-19: 76.3%). Furthermore, training speed-up analysis shows a major decrease in training time as resources increase, from 207 min using 1 node to 54 min when distributed over 32 nodes, but highlights the presence of a cut-off point where the communication overhead begins to affect performance. The developed platform is intended to provide the medical field with a technical environment for developing novel portable artificial-intelligence-based tools for diagnosis support. |