RT info:eu-repo/semantics/article
T1 Validation of deep learning techniques for quality augmentation in diffusion MRI for clinical studies
A1 Aja Fernández, Santiago
A1 Martín Martín, Carmen
A1 Planchuelo Gómez, Álvaro
A1 Faiyaz, Abrar
A1 Uddin, Md Nasir
A1 Schifitto, Giovanni
A1 Tiwari, Abhishek
A1 Shigwan, Saurabh J.
A1 Kumar Singh, Rajeev
A1 Zheng, Tianshu
A1 Cao, Zuozhen
A1 Wu, Dan
A1 Blumberg, Stefano B.
A1 Sen, Snigdha
A1 Goodwin-Allcock, Tobias
A1 Slator, Paddy J.
A1 Yigit Avci, Mehmet
A1 Li, Zihan
A1 Bilgic, Berkin
A1 Tian, Qiyuan
A1 Wang, Xinyi
A1 Tang, Zihao
A1 Cabezas, Mariano
A1 Rauland, Amelie
A1 Merhof, Dorit
A1 Manzano María, Renata
A1 Campos, Vinícius Paraníba
A1 Santini, Tales
A1 da Costa Vieira, Marcelo Andrade
A1 HashemizadehKolowri, SeyyedKazem
A1 DiBella, Edward
A1 Peng, Chenxu
A1 Shen, Zhimin
A1 Chen, Zan
A1 Ullah, Irfan
A1 Mani, Merry
A1 Abdolmotallby, Hesam
A1 Eckstrom, Samuel
A1 Baete, Steven H.
A1 Filipiak, Patryk
A1 Dong, Tanxin
A1 Fan, Qiuyun
A1 Luis García, Rodrigo de
A1 Tristán Vega, Antonio
A1 Pieciak, Tomász
K1 Deep learning
K1 Machine learning
K1 Artificial intelligence
K1 Diffusion MRI
K1 Angular resolution
K1 Diffusion tensor
AB The objective of this study is to evaluate the efficacy of deep learning (DL) techniques in improving the quality of diffusion MRI (dMRI) data in clinical applications. The study aims to determine whether the use of artificial intelligence (AI) methods in medical images may result in the loss of critical clinical information and/or the appearance of false information. To assess this, the focus was on the angular resolution of dMRI and a clinical trial was conducted on migraine, specifically between episodic and chronic migraine patients. The number of gradient directions had an impact on white matter analysis results, with statistically significant differences between groups being drastically reduced when using 21 gradient directions instead of the original 61. Fourteen teams from different institutions were tasked to use DL to enhance three diffusion metrics (FA, AD and MD) calculated from data acquired with 21 gradient directions and a b-value of 1000 s/mm2. The goal was to produce results that were comparable to those calculated from 61 gradient directions. The results were evaluated using both standard image quality metrics and Tract-Based Spatial Statistics (TBSS) to compare episodic and chronic migraine patients. The study results suggest that while most DL techniques improved the ability to detect statistical differences between groups, they also led to an increase in false positive. The results showed that there was a constant growth rate of false positives linearly proportional to the new true positives, which highlights the risk of generalization of AI-based tasks when assessing diverse clinical cohorts and training using data from a single group. The methods also showed divergent performance when replicating the original distribution of the data and some exhibited significant bias. In conclusion, extreme caution should be exercised when using AI methods for harmonization or synthesis in clinical studies when processing heterogeneous data in clinical studies, as important information may be altered, even when global metrics such as structural similarity or peak signal-to-noise ratio appear to suggest otherwise.
PB Elsevier
SN 2213-1582
YR 2023
FD 2023
LK https://uvadoc.uva.es/handle/10324/70628
UL https://uvadoc.uva.es/handle/10324/70628
LA eng
NO NeuroImage: Clinical, 2023, vol. 39, p. 103483
DS UVaDOC
RD 11-ago-2025