EULAR Abstract Archive

Bookmarked

OP0304 (2025)

AN OMERACT STUDY FOR THE DEVELOPMENT OF AN ALGORITHM FOR AUTOMATIC IDENTIFICATION OF CALCIUM PYROPHOSPHATE DEPOSITION BY ULTRASOUND: THE CRYSTAL ARTIFICIAL INTELLIGENCE MONITORING (CLAIM) STUDY

Keywords: Artificial Intelligence, Ultrasound, Imaging

G. Filippou^2,9, D. Cirillo¹, T. Bassani¹², S. Sirotti⁹, S. Gitto^3,32, A. Adinolfi⁴, E. Cipolletta^5,14, L. Coronel¹⁶, M. Diaz¹⁰, A. Di Matteo²⁸, E. Filippucci⁵, H. B. Hammer^11,30, D. MacCarter¹³, I. Möller²⁴, E. Naredo^17,18, F. Porta⁶, O. M. Olivas Vergara¹⁷, G. Sakellariou^7,29, W. Schmidt¹⁵, O. Aitisha Tabesh^25,31, G. Tamborrini^26,27, P. Todorov²², M. Arese¹, R. Fabbri¹, A. Lucia¹, A. Varvaro³, P. Sarzi-Puttini², M. A. D’Agostino⁸, P. Mandl²¹, C. Pineda²⁰, H. Keen²³, L. M. Sconfienza^3,32, L. Terslev¹⁹

¹Università Degli Studi di Milano, Department of Clinical Sciences and Community Health, Milan, Italy
²Università Degli Studi di Milano, Department of Biomedical and Clinical Sciences, Milano, Italy
³Università Degli Dtudi di Milano, Department of Biomedical Sciences for Health, Milan, Italy
⁴ASST Grande Ospedale Metropolitano Niguarda, UO Reumatologia, Milano, Italy
⁵Polytechnic University of Marche, Department of Clinical and Molecular Sciences, Ancona, Italy
⁶Santa Maria Maddalena Hospital Occhiobello, Interdisciplinary Pain Medicine Unit, Occhiobello, Italy
⁷University of Pavia, Department of Internal Medicine and Therapeutic, Pavia, Italy
⁸Università Cattolica del Sacro Cuore, Department of Rheumatology, Rome, Italy
⁹IRCCS Ospedale Galeazzi – Sant’Ambrogio, Rheumatology Department, Milan, Italy
¹⁰University Hospital Fundación Santa Fe de Bogota, Rheumatology Unit, Bogota, Colombia
¹¹Diakonhjemmet Hospital, Center for Treatment of Rheumatic and Musculoskeletal Diseases (REMEDY), Oslo, Norway
¹²IRCCS Ospedale Galeazzi - Sant’ambrogio, Milan, Italy
¹³Logan Health Whitefish MT, Department of Rheumatology, Whitefish, United States of America
¹⁴University of Nottingham, Academic Rheumatology, Nottingham, United Kingdom
¹⁵Krankenhaus Waldfriede, Rheumatology, Berlin, Germany
¹⁶Vall d’Hebron University Hospital, Ultrasound Unit, Barcelona, Spain
¹⁷Fundación Jiménez Díaz University Hospital and Health Research Institute FJD-UAM, Department of Rheumatology and Joint and Bone Research Unit, Madrid, Spain
¹⁸Autonomous University of Madrid, Madrid, Spain
¹⁹Rigshospitalet, Center for Rheumatology and Spine Diseases, Copenhagen, Denmark
²⁰Instituto Nacional de Rehabilitacion, Division of Musculoskeletal and Rheumatic Disorders, Mexico City, Mexico
²¹Medical University Vienna, Wien, Department of Internal Medicine III, Division of Rheumatology, Wien, Austria
²²Medical University of Plovdiv, Department of Internal Disease Propaedeutics and Rheumatology, Plovdiv, Bulgaria
²³University of Western Australia, School of Medicine, Perth, Australia
²⁴University of Barcelona, Instituto Poal de Reumatología, Barcelona, Spain
²⁵Lebanese University-Geitaoui, Department of Rheumatology, Beirut, Lebanon
²⁶University Hospital of Basel, Clinic for Rheumatology, Basel, Switzerland
²⁷Institute of Rheumatology, Swiss Ultrasound Center, Basel, Switzerland
²⁸Leeds Institute of Rheumatic and Muscoloskeletal Medicine, Leeds, United Kingdom
²⁹Istituti Clinici Scientifici Maugeri IRCCS Pavia, Pavia, Italy
³⁰University of Oslo, Faculty of Medicine, Oslo, Norway
³¹Lebanese American University-Rizk Hospital, Department of Rheumatology, Beirut, Lebanon
³²IRCCS Ospedale Galeazzi - Sant’Ambrogio, Department of Diagnostic and Interventional Radiology, Milano, Italy

Background: Calcium pyrophosphate deposition (CPPD) in joints is one of the most common causes of chronic arthropathy in individuals over 60 years of age. Historically, diagnosis was based on clinical presentation, evidence of crystals in synovial fluid analysis or typical radiographic findings. In recent years, ultrasound (US) has gained a central role in the diagnostic process. The OMERACT group has defined the elementary US lesions of CPPD and developed a scoring system for the extent of the deposition in the knees and wrists. Both elementary lesions and the scoring system have been thoroughly tested and validated for use in research and clinical practice. The 2023 ACR/EULAR Classification Criteria for CPPD Disease and the 2023 EULAR recommendations for the use of imaging in crystal-induced arthritis have endorsed the role of US in the diagnostic process of CPPD. Despite this progress, US remains an operator-dependent technique, and its effective application in CPPD diagnosis requires specialized training and expertise to minimize misinterpretations and diagnostic errors.

Objectives: The aim of this study is to develop an Artificial Intelligence (AI) tool for the identification and scoring of CPPD in the knee menisci.

Methods: The CLAIM study is an international OMERACT initiative involving the US CPPD and gout subtask groups, aiming to develop an AI algorithm for the identification and scoring of the elementary US lesions in gout and CPPD. Briefly, members of both OMERACT groups were asked to contribute with 108 high quality US images of CPPD (at triangular fibrocartilage of the wrist, hyaline cartilage and menisci of the knee) and 72 images of gout (double contour sign and tophi at any site), divided equally by degree (0-3) for the development of the scoring algorithm. In this abstract we present the preliminary results of the first deep learning approach for the development of the algorithm in CPPD patients at the menisci level. US images of the menisci, with any degree of deposition (from 0 to 3) as assigned by the participants and validated also by the steering committee, were included in this analysis. The region of interest, traced manually with dedicated software (3DSlicer software v5.6), included only the meniscus. For this first attempt, a transfer learning approach was employed using convolutional neural network (CNN) models pretrained on the ImageNet dataset. The models were fine-tuned with the training and validation data. Hyperparameters were systematically tested to identify the optimal CNN architecture and settings. To increase variability, the following random augmentations were applied during training: rotation (maximum ±20°), height shift (10%), width shift (10%), zoom range (10%), and horizontal flipping. Sample weighting correction was applied during training to address class imbalance. Each model was trained for a maximum of 100 epochs, with early stopping triggered if the validation loss did not improve after 10 consecutive epochs. Model performance was compared based on accuracy in classifying the validation set. Additional metrics - precision (correct classifications per predicted class), recall (correct classifications per actual class), and F1-score (harmonic mean of precision and recall) - were calculated for each grading class for the best-performing model. All metrics ranged from 0 to 1. All routines for image preprocessing and model implementation were conducted in Python v3.9, utilizing the OpenCV library v4.8 and the TensorFlow Keras framework v2.14.

Results: 19 rheumatologists from 10 countries (3 continents, Europe, Oceania and America) contributed with images in this first step. The final dataset consisted of 446 images with an unbalanced distribution among grades: 39 images (9%) for grade 0, 80 (18%) for grade 1, 215 (48%) for grade 2, and 112 (25%) for grade 3. The dataset was randomly split into training and validation sets (80% and 20%, respectively), yielding 356 and 90 samples while preserving the grading group distribution. The highest classification accuracy (CPPD yes/no) achieved on the validation set was 0.69. The overall performance for grading classes in the validation dataset is presented in Table 1. The precision of the deep learning model in correctly identifying true cases for each grading type was comparable for grades 0, 2, and 3 (ranging from 0.71 to 0.75) but lower for grade 1 (0.53). The confusion matrix displays the number of cases classified into each grading category within the validation dataset (Figure 1).

Table 1.

	precision	recall	f1-score
grade 0	0.71	0.63	0.67
grade 1	0.53	0.63	0.57
grade 2	0.73	0.74	0.74
grade 3	0.75	0.65	0.70

Figure 1.

Conclusion: In conclusion, as the overall accuracy of the model falls slightly below the ‘good’ threshold level (0.69), the performance is insufficient for practical clinical application. This limitation may be partially attributed to the small dataset size, which is likely inadequate for training deep learning models effectively, even with fine-tuning of pretrained network architectures. Future work will prioritize increasing the dataset size, ensuring data balance, accounting for cross-validation procedure and exploring alternative approaches such as semantic segmentation of the cropped images.

REFERENCES: NIL.

Acknowledgements: NIL.

Disclosure of Interests: Georgios Filippou: None declared, Daniele Cirillo: None declared, Tito Bassani: None declared, Silvia Sirotti: None declared, Salvatore Gitto: None declared, Antonella Adinolfi Janssen, Janssen, Edoardo Cipolletta IBSA, Novartis, Horizion Therapeutics, Luis Coronel: None declared, Mario Diaz: None declared, Andrea Di Matteo Janssen, Emilio Filippucci: None declared, Hilde Berner Hammer Abbvie, UCB, Novartis, Lilly, Daryl MacCarter: None declared, Ingrid Möller Bristol-Myers Squibb, Pfizer, Johnson & Johnson, AbbVie, Gebro, Esperanza Naredo: None declared, Francesco Porta Laborest, IBSA, Amgen, Otto Martin Olivas Vergara: None declared, Garifallia Sakellariou: None declared, Wolfgang Schmidt: None declared, Ouidade Aitisha Tabesh: None declared, Giorgio Tamborrini: None declared, Plamen Todorov: None declared, Marta Arese: None declared, Rodolfo Fabbri: None declared, Alessandro Lucia: None declared, Antonio Varvaro: None declared, Piercarlo Sarzi-Puttini: None declared, Maria-Antonietta D’Agostino Amgen, MSD, BMS, Astrazeneca, GSK, Galapagos, Sanofi, Novartis, J&J, Abbvie, UCB, Pfizer, Alfasigma, Eli Lilly, Amgen, MSD, BMS, Astrazeneca, GSK, Galapagos, Sanofi, Novartis, J&J, Abbvie, UCB, Pfizer, Alfasigma, Eli Lilly, Peter Mandl AbbVie, Novartis, Janssen Sobi, AbbVie, Alfasigma, Novartis, Sobi, Carlos Pineda: None declared, Helen Keen: None declared, Luca Maria Sconfienza Esaote SPA, Samsung Medison, IBSA, Fidia Farmaceutici, Esaote SPA, Lene Terslev Novartis, UCB, Janssen, GE Healthcare.

© The Authors 2025. This abstract is an open access article published in Annals of Rheumatic Diseases under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ ). Neither EULAR nor the publisher make any representation as to the accuracy of the content. The authors are solely responsible for the content in their abstract including accuracy of the facts, statements, results, conclusion, citing resources etc.

DOI: annrheumdis-2025-eular.B2032

Keywords: Artificial Intelligence, Ultrasound, Imaging

Citation: , volume 84, supplement 1, year 2025, page 242

Session: Clinical Abstract Sessions: Gout - Ready to improve outcomes? (Oral Presentations)

version:	1.02