Clinical Evaluation of Deep Learning for Tumor Delineation on 18F-FDG PET/CT of Head and Neck Cancer

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Clinical Evaluation of Deep Learning for Tumor Delineation on 18F-FDG PET/CT of Head and Neck Cancer. / Kovacs, David G.; Ladefoged, Claes N.; Andersen, Kim F.; Brittain, Jane M.; Christensen, Charlotte B.; Dejanovic, Danijela; Hansen, Naja L.; Loft, Annika; Petersen, Jørgen H.; Reichkendler, Michala; Andersen, Flemming L.; Fischer, Barbara M.

I: Journal of Nuclear Medicine, Bind 65, Nr. 4, 2024, s. 623-629.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Kovacs, DG, Ladefoged, CN, Andersen, KF, Brittain, JM, Christensen, CB, Dejanovic, D, Hansen, NL, Loft, A, Petersen, JH, Reichkendler, M, Andersen, FL & Fischer, BM 2024, 'Clinical Evaluation of Deep Learning for Tumor Delineation on 18F-FDG PET/CT of Head and Neck Cancer', Journal of Nuclear Medicine, bind 65, nr. 4, s. 623-629. https://doi.org/10.2967/jnumed.123.266574

APA

Kovacs, D. G., Ladefoged, C. N., Andersen, K. F., Brittain, J. M., Christensen, C. B., Dejanovic, D., Hansen, N. L., Loft, A., Petersen, J. H., Reichkendler, M., Andersen, F. L., & Fischer, B. M. (2024). Clinical Evaluation of Deep Learning for Tumor Delineation on 18F-FDG PET/CT of Head and Neck Cancer. Journal of Nuclear Medicine, 65(4), 623-629. https://doi.org/10.2967/jnumed.123.266574

Vancouver

Kovacs DG, Ladefoged CN, Andersen KF, Brittain JM, Christensen CB, Dejanovic D o.a. Clinical Evaluation of Deep Learning for Tumor Delineation on 18F-FDG PET/CT of Head and Neck Cancer. Journal of Nuclear Medicine. 2024;65(4):623-629. https://doi.org/10.2967/jnumed.123.266574

Author

Kovacs, David G. ; Ladefoged, Claes N. ; Andersen, Kim F. ; Brittain, Jane M. ; Christensen, Charlotte B. ; Dejanovic, Danijela ; Hansen, Naja L. ; Loft, Annika ; Petersen, Jørgen H. ; Reichkendler, Michala ; Andersen, Flemming L. ; Fischer, Barbara M. / Clinical Evaluation of Deep Learning for Tumor Delineation on 18F-FDG PET/CT of Head and Neck Cancer. I: Journal of Nuclear Medicine. 2024 ; Bind 65, Nr. 4. s. 623-629.

Bibtex

@article{1f307ded7e5c4d1a95624beb1d772815,
title = "Clinical Evaluation of Deep Learning for Tumor Delineation on 18F-FDG PET/CT of Head and Neck Cancer",
abstract = "Artificial intelligence (AI) may decrease 18F-FDG PET/CT-based gross tumor volume (GTV) delineation variability and automate tumorvolume- derived image biomarker extraction. Hence, we aimed to identify and evaluate promising state-of-the-art deep learning methods for head and neck cancer (HNC) PET GTV delineation. Methods: We trained and evaluated deep learning methods using retrospectively included scans of HNC patients referred for radiotherapy between January 2014 and December 2019 (ISRCTN16907234). We used 3 test datasets: an internal set to compare methods, another internal set to compare AI-to-expert variability and expert interobserver variability (IOV), and an external set to compare internal and external AI-to-expert variability. Expert PET GTVs were used as the reference standard. Our benchmark IOV was measured using the PET GTV of 6 experts. The primary outcome was the Dice similarity coefficient (DSC). ANOVA was used to compare methods, a paired t test was used to compare AI-to-expert variability and expert IOV, an unpaired t test was used to compare internal and external AI-toexpert variability, and post hoc Bland-Altman analysis was used to evaluate biomarker agreement. Results: In total, 1,220 18F-FDG PET/CT scans of 1,190 patients (mean age 6 SD, 63 6 10 y; 858 men) were included, and 5 deep learning methods were trained using 5-fold cross-validation (n = 805). The nnU-Net method achieved the highest similarity (DSC, 0.80 [95% CI, 0.77-0.86]; n = 196). We found no evidence of a difference between expert IOV and AI-to-expert variability (DSC, 0.78 for AI vs. 0.82 for experts; mean difference of 0.04 [95% CI, 20.01 to 0.09]; P = 0.12; n = 64). We found no evidence of a difference between the internal and external AI-to-expert variability (DSC, 0.80 internally vs. 0.81 externally; mean difference of 0.004 [95% CI, 20.05 to 0.04]; P = 0.87; n = 125). PET GTV-derived biomarkers of AI were in good agreement with experts. Conclusion: Deep learning can be used to automate 18F-FDG PET/CT tumorvolume- derived imaging biomarkers, and the deep-learning-based volumes have the potential to assist clinical tumor volume delineation in radiation oncology.",
keywords = "18F-FDG PET/CT, deep learning, head and neck cancer, imaging biomarkers, tumor volume delineation",
author = "Kovacs, {David G.} and Ladefoged, {Claes N.} and Andersen, {Kim F.} and Brittain, {Jane M.} and Christensen, {Charlotte B.} and Danijela Dejanovic and Hansen, {Naja L.} and Annika Loft and Petersen, {J{\o}rgen H.} and Michala Reichkendler and Andersen, {Flemming L.} and Fischer, {Barbara M.}",
note = "Publisher Copyright: {\textcopyright} 2024 by the Society of Nuclear Medicine andMolecular Imaging.",
year = "2024",
doi = "10.2967/jnumed.123.266574",
language = "English",
volume = "65",
pages = "623--629",
journal = "The Journal of Nuclear Medicine",
issn = "0161-5505",
publisher = "Society of Nuclear Medicine",
number = "4",

}

RIS

TY - JOUR

T1 - Clinical Evaluation of Deep Learning for Tumor Delineation on 18F-FDG PET/CT of Head and Neck Cancer

AU - Kovacs, David G.

AU - Ladefoged, Claes N.

AU - Andersen, Kim F.

AU - Brittain, Jane M.

AU - Christensen, Charlotte B.

AU - Dejanovic, Danijela

AU - Hansen, Naja L.

AU - Loft, Annika

AU - Petersen, Jørgen H.

AU - Reichkendler, Michala

AU - Andersen, Flemming L.

AU - Fischer, Barbara M.

N1 - Publisher Copyright: © 2024 by the Society of Nuclear Medicine andMolecular Imaging.

PY - 2024

Y1 - 2024

N2 - Artificial intelligence (AI) may decrease 18F-FDG PET/CT-based gross tumor volume (GTV) delineation variability and automate tumorvolume- derived image biomarker extraction. Hence, we aimed to identify and evaluate promising state-of-the-art deep learning methods for head and neck cancer (HNC) PET GTV delineation. Methods: We trained and evaluated deep learning methods using retrospectively included scans of HNC patients referred for radiotherapy between January 2014 and December 2019 (ISRCTN16907234). We used 3 test datasets: an internal set to compare methods, another internal set to compare AI-to-expert variability and expert interobserver variability (IOV), and an external set to compare internal and external AI-to-expert variability. Expert PET GTVs were used as the reference standard. Our benchmark IOV was measured using the PET GTV of 6 experts. The primary outcome was the Dice similarity coefficient (DSC). ANOVA was used to compare methods, a paired t test was used to compare AI-to-expert variability and expert IOV, an unpaired t test was used to compare internal and external AI-toexpert variability, and post hoc Bland-Altman analysis was used to evaluate biomarker agreement. Results: In total, 1,220 18F-FDG PET/CT scans of 1,190 patients (mean age 6 SD, 63 6 10 y; 858 men) were included, and 5 deep learning methods were trained using 5-fold cross-validation (n = 805). The nnU-Net method achieved the highest similarity (DSC, 0.80 [95% CI, 0.77-0.86]; n = 196). We found no evidence of a difference between expert IOV and AI-to-expert variability (DSC, 0.78 for AI vs. 0.82 for experts; mean difference of 0.04 [95% CI, 20.01 to 0.09]; P = 0.12; n = 64). We found no evidence of a difference between the internal and external AI-to-expert variability (DSC, 0.80 internally vs. 0.81 externally; mean difference of 0.004 [95% CI, 20.05 to 0.04]; P = 0.87; n = 125). PET GTV-derived biomarkers of AI were in good agreement with experts. Conclusion: Deep learning can be used to automate 18F-FDG PET/CT tumorvolume- derived imaging biomarkers, and the deep-learning-based volumes have the potential to assist clinical tumor volume delineation in radiation oncology.

AB - Artificial intelligence (AI) may decrease 18F-FDG PET/CT-based gross tumor volume (GTV) delineation variability and automate tumorvolume- derived image biomarker extraction. Hence, we aimed to identify and evaluate promising state-of-the-art deep learning methods for head and neck cancer (HNC) PET GTV delineation. Methods: We trained and evaluated deep learning methods using retrospectively included scans of HNC patients referred for radiotherapy between January 2014 and December 2019 (ISRCTN16907234). We used 3 test datasets: an internal set to compare methods, another internal set to compare AI-to-expert variability and expert interobserver variability (IOV), and an external set to compare internal and external AI-to-expert variability. Expert PET GTVs were used as the reference standard. Our benchmark IOV was measured using the PET GTV of 6 experts. The primary outcome was the Dice similarity coefficient (DSC). ANOVA was used to compare methods, a paired t test was used to compare AI-to-expert variability and expert IOV, an unpaired t test was used to compare internal and external AI-toexpert variability, and post hoc Bland-Altman analysis was used to evaluate biomarker agreement. Results: In total, 1,220 18F-FDG PET/CT scans of 1,190 patients (mean age 6 SD, 63 6 10 y; 858 men) were included, and 5 deep learning methods were trained using 5-fold cross-validation (n = 805). The nnU-Net method achieved the highest similarity (DSC, 0.80 [95% CI, 0.77-0.86]; n = 196). We found no evidence of a difference between expert IOV and AI-to-expert variability (DSC, 0.78 for AI vs. 0.82 for experts; mean difference of 0.04 [95% CI, 20.01 to 0.09]; P = 0.12; n = 64). We found no evidence of a difference between the internal and external AI-to-expert variability (DSC, 0.80 internally vs. 0.81 externally; mean difference of 0.004 [95% CI, 20.05 to 0.04]; P = 0.87; n = 125). PET GTV-derived biomarkers of AI were in good agreement with experts. Conclusion: Deep learning can be used to automate 18F-FDG PET/CT tumorvolume- derived imaging biomarkers, and the deep-learning-based volumes have the potential to assist clinical tumor volume delineation in radiation oncology.

KW - 18F-FDG PET/CT

KW - deep learning

KW - head and neck cancer

KW - imaging biomarkers

KW - tumor volume delineation

U2 - 10.2967/jnumed.123.266574

DO - 10.2967/jnumed.123.266574

M3 - Journal article

C2 - 38388516

AN - SCOPUS:85189198814

VL - 65

SP - 623

EP - 629

JO - The Journal of Nuclear Medicine

JF - The Journal of Nuclear Medicine

SN - 0161-5505

IS - 4

ER -

ID: 388021210