Advertisement for orthosearch.org.uk
Results 1 - 20 of 217
Results per page:
Bone & Joint Open
Vol. 4, Issue 8 | Pages 584 - 593
15 Aug 2023
Sainio H Rämö L Reito A Silvasti-Lundell M Lindahl J

Aims. Several previously identified patient-, injury-, and treatment-related factors are associated with the development of nonunion in distal femur fractures. However, the predictive value of these factors is not well defined. We aimed to assess the predictive ability of previously identified risk factors in the development of nonunion leading to secondary surgery in distal femur fractures. Methods. We conducted a retrospective cohort study of adult patients with traumatic distal femur fracture treated with lateral locking plate between 2009 and 2018. The patients who underwent secondary surgery due to fracture healing problem or plate failure were considered having nonunion. Background knowledge of risk factors of distal femur fracture nonunion based on previous literature was used to form an initial set of variables. A logistic regression model was used with previously identified patient- and injury-related variables (age, sex, BMI, diabetes, smoking, periprosthetic fracture, open fracture, trauma energy, fracture zone length, fracture comminution, medial side comminution) in the first analysis and with treatment-related variables (different surgeon-controlled factors, e.g. plate length, screw placement, and proximal fixation) in the second analysis to predict the nonunion leading to secondary surgery in distal femur fractures. Results. We were able to include 299 fractures in 291 patients. Altogether, 31/299 fractures (10%) developed nonunion. In the first analysis, pseudo-R. 2. was 0.27 and area under the receiver operating characteristic curve (AUC) was 0.81. BMI was the most important variable in the prediction. In the second analysis, pseudo-R. 2. was 0.06 and AUC was 0.67. Plate length was the most important variable in the prediction. Conclusion. The model including patient- and injury-related factors had moderate fit and predictive ability in the prediction of distal femur fracture nonunion leading to secondary surgery. BMI was the most important variable in prediction of nonunion. Surgeon-controlled factors had a minor role in prediction of nonunion. Cite this article: Bone Jt Open 2023;4(8):584–593


Bone & Joint Open
Vol. 5, Issue 11 | Pages 962 - 970
4 Nov 2024
Suter C Mattila H Ibounig T Sumrein BO Launonen A Järvinen TLN Lähdeoja T Rämö L

Aims

Though most humeral shaft fractures heal nonoperatively, up to one-third may lead to nonunion with inferior outcomes. The Radiographic Union Score for HUmeral Fractures (RUSHU) was created to identify high-risk patients for nonunion. Our study evaluated the RUSHU’s prognostic performance at six and 12 weeks in discriminating nonunion within a significantly larger cohort than before.

Methods

Our study included 226 nonoperatively treated humeral shaft fractures. We evaluated the interobserver reliability and intraobserver reproducibility of RUSHU scoring using intraclass correlation coefficients (ICCs). Additionally, we determined the optimal cut-off thresholds for predicting nonunion using the receiver operating characteristic (ROC) method.


Bone & Joint Research
Vol. 5, Issue 6 | Pages 232 - 238
1 Jun 2016
Tanaka A Yoshimura Y Aoki K Kito M Okamoto M Suzuki S Momose T Kato H

Objectives. Our objective was to predict the knee extension strength and post-operative function in quadriceps resection for soft-tissue sarcoma of the thigh. Methods. A total of 18 patients (14 men, four women) underwent total or partial quadriceps resection for soft-tissue sarcoma of the thigh between 2002 and 2014. The number of resected quadriceps was surveyed, knee extension strength was measured with the Biodex isokinetic dynamometer system (affected side/unaffected side) and relationships between these were examined. The Musculoskeletal Tumor Society (MSTS) score, Toronto Extremity Salvage Score (TESS), European Quality of Life-5 Dimensions (EQ-5D) score and the Short Form 8 were used to evaluate post-operative function and examine correlations with extension strength. The cutoff value for extension strength to expect good post-operative function was also calculated using a receiver operating characteristic (ROC) curve and Fisher’s exact test. Results. Extension strength decreased when the number of resected quadriceps increased (p < 0.001), and was associated with lower MSTS score, TESS and EQ-5D (p = 0.004, p = 0.005, p = 0.006, respectively). Based on the functional evaluation scales, the cutoff value of extension strength was 56.2%, the equivalent to muscle strength with resection of up to two muscles. Conclusion. Good post-operative results can be expected if at least two quadriceps muscles are preserved. Cite this article: A. Tanaka, Y. Yoshimura, K. Aoki, M. Kito, M. Okamoto, S. Suzuki, T. Momose, H. Kato. Knee extension strength and post-operative functional prediction in quadriceps resection for soft-tissue sarcoma of the thigh. Bone Joint Res 2016;5:232–238. DOI: 10.1302/2046-3758.56.2000631


Bone & Joint Open
Vol. 4, Issue 10 | Pages 750 - 757
10 Oct 2023
Brenneis M Thewes N Holder J Stief F Braun S

Aims. Accurate skeletal age and final adult height prediction methods in paediatric orthopaedics are crucial for determining optimal timing of growth-guiding interventions and minimizing complications in treatments of various conditions. This study aimed to evaluate the accuracy of final adult height predictions using the central peak height (CPH) method with long leg X-rays and four different multiplier tables. Methods. This study included 31 patients who underwent temporary hemiepiphysiodesis for varus or valgus deformity of the leg between 2014 and 2020. The skeletal age at surgical intervention was evaluated using the CPH method with long leg radiographs. The true final adult height (FH. TRUE. ) was determined when the growth plates were closed. The final height prediction accuracy of four different multiplier tables (1. Bayley and Pinneau; 2. Paley et al; 3. Sanders – Greulich and Pyle (SGP); and 4. Sanders – peak height velocity (PHV)) was then compared using either skeletal age or chronological age. Results. All final adult height predictions overestimated the FH. TRUE. , with the SGP multiplier table having the lowest overestimation and lowest absolute deviation when using both chronological age and skeletal age. There were no significant differences in final height prediction accuracy between using skeletal age and chronological age with PHV (p = 0.652) or SGP multiplier tables (p = 0.969). Adult height predictions with chronological age and SGP (r = 0.769; p ≤ 0.001), as well as chronological age and PHV (r = 0.822; p ≤ 0.001), showed higher correlations with FH. TRUE. than predictions with skeletal age and SGP (r = 0.657; p ≤ 0.001) or skeletal age and PHV (r = 0.707; p ≤ 0.001). Conclusion. There was no significant improvement in adult height prediction accuracy when using the CPH method compared to chronological age alone. The study concludes that there is no advantage in routinely using the CPH method for skeletal age determination over the simple use of chronological age. The findings highlight the need for more accurate methods to predict final adult height in contemporary patient populations. Cite this article: Bone Jt Open 2023;4(10):750–757


The Bone & Joint Journal
Vol. 104-B, Issue 9 | Pages 1011 - 1016
1 Sep 2022
Acem I van de Sande MAJ

Prediction tools are instruments which are commonly used to estimate the prognosis in oncology and facilitate clinical decision-making in a more personalized manner. Their popularity is shown by the increasing numbers of prediction tools, which have been described in the medical literature. Many of these tools have been shown to be useful in the field of soft-tissue sarcoma of the extremities (eSTS). In this annotation, we aim to provide an overview of the available prediction tools for eSTS, provide an approach for clinicians to evaluate the performance and usefulness of the available tools for their own patients, and discuss their possible applications in the management of patients with an eSTS. Cite this article: Bone Joint J 2022;104-B(9):1011–1016


Bone & Joint Open
Vol. 5, Issue 3 | Pages 243 - 251
25 Mar 2024
Wan HS Wong DLL To CS Meng N Zhang T Cheung JPY

Aims. This systematic review aims to identify 3D predictors derived from biplanar reconstruction, and to describe current methods for improving curve prediction in patients with mild adolescent idiopathic scoliosis. Methods. A comprehensive search was conducted by three independent investigators on MEDLINE, PubMed, Web of Science, and Cochrane Library. Search terms included “adolescent idiopathic scoliosis”,“3D”, and “progression”. The inclusion and exclusion criteria were carefully defined to include clinical studies. Risk of bias was assessed with the Quality in Prognostic Studies tool (QUIPS) and Appraisal tool for Cross-Sectional Studies (AXIS), and level of evidence for each predictor was rated with the Grading of Recommendations, Assessment, Development, and Evaluations (GRADE) approach. In all, 915 publications were identified, with 377 articles subjected to full-text screening; overall, 31 articles were included. Results. Torsion index (TI) and apical vertebral rotation (AVR) were identified as accurate predictors of curve progression in early visits. Initial TI > 3.7° and AVR > 5.8° were predictive of curve progression. Thoracic hypokyphosis was inconsistently observed in progressive curves with weak evidence. While sagittal wedging was observed in mild curves, there is insufficient evidence for its correlation with curve progression. In curves with initial Cobb angle < 25°, Cobb angle was a poor predictor for future curve progression. Prediction accuracy was improved by incorporating serial reconstructions in stepwise layers. However, a lack of post-hoc analysis was identified in studies involving geometrical models. Conclusion. For patients with mild curves, TI and AVR were identified as predictors of curve progression, with TI > 3.7° and AVR > 5.8° found to be important thresholds. Cobb angle acts as a poor predictor in mild curves, and more investigations are required to assess thoracic kyphosis and wedging as predictors. Cumulative reconstruction of radiographs improves prediction accuracy. Comprehensive analysis between progressive and non-progressive curves is recommended to extract meaningful thresholds for clinical prognostication. Cite this article: Bone Jt Open 2024;5(3):243–251


Bone & Joint Open
Vol. 4, Issue 3 | Pages 168 - 181
14 Mar 2023
Dijkstra H Oosterhoff JHF van de Kuit A IJpma FFA Schwab JH Poolman RW Sprague S Bzovsky S Bhandari M Swiontkowski M Schemitsch EH Doornberg JN Hendrickx LAM

Aims. To develop prediction models using machine-learning (ML) algorithms for 90-day and one-year mortality prediction in femoral neck fracture (FNF) patients aged 50 years or older based on the Hip fracture Evaluation with Alternatives of Total Hip arthroplasty versus Hemiarthroplasty (HEALTH) and Fixation using Alternative Implants for the Treatment of Hip fractures (FAITH) trials. Methods. This study included 2,388 patients from the HEALTH and FAITH trials, with 90-day and one-year mortality proportions of 3.0% (71/2,388) and 6.4% (153/2,388), respectively. The mean age was 75.9 years (SD 10.8) and 65.9% of patients (1,574/2,388) were female. The algorithms included patient and injury characteristics. Six algorithms were developed, internally validated and evaluated across discrimination (c-statistic; discriminative ability between those with risk of mortality and those without), calibration (observed outcome compared to the predicted probability), and the Brier score (composite of discrimination and calibration). Results. The developed algorithms distinguished between patients at high and low risk for 90-day and one-year mortality. The penalized logistic regression algorithm had the best performance metrics for both 90-day (c-statistic 0.80, calibration slope 0.95, calibration intercept -0.06, and Brier score 0.039) and one-year (c-statistic 0.76, calibration slope 0.86, calibration intercept -0.20, and Brier score 0.074) mortality prediction in the hold-out set. Conclusion. Using high-quality data, the ML-based prediction models accurately predicted 90-day and one-year mortality in patients aged 50 years or older with a FNF. The final models must be externally validated to assess generalizability to other populations, and prospectively evaluated in the process of shared decision-making. Cite this article: Bone Jt Open 2023;4(3):168–181


The Bone & Joint Journal
Vol. 106-B, Issue 1 | Pages 19 - 27
1 Jan 2024
Tang H Guo S Ma Z Wang S Zhou Y

Aims. The aim of this study was to evaluate the reliability and validity of a patient-specific algorithm which we developed for predicting changes in sagittal pelvic tilt after total hip arthroplasty (THA). Methods. This retrospective study included 143 patients who underwent 171 THAs between April 2019 and October 2020 and had full-body lateral radiographs preoperatively and at one year postoperatively. We measured the pelvic incidence (PI), the sagittal vertical axis (SVA), pelvic tilt, sacral slope (SS), lumbar lordosis (LL), and thoracic kyphosis to classify patients into types A, B1, B2, B3, and C. The change of pelvic tilt was predicted according to the normal range of SVA (0 mm to 50 mm) for types A, B1, B2, and B3, and based on the absolute value of one-third of the PI-LL mismatch for type C patients. The reliability of the classification of the patients and the prediction of the change of pelvic tilt were assessed using kappa values and intraclass correlation coefficients (ICCs), respectively. Validity was assessed using the overall mean error and mean absolute error (MAE) for the prediction of the change of pelvic tilt. Results. The kappa values were 0.927 (95% confidence interval (CI) 0.861 to 0.992) and 0.945 (95% CI 0.903 to 0.988) for the inter- and intraobserver reliabilities, respectively, and the ICCs ranged from 0.919 to 0.997. The overall mean error and MAE for the prediction of the change of pelvic tilt were -0.3° (SD 3.6°) and 2.8° (SD 2.4°), respectively. The overall absolute change of pelvic tilt was 5.0° (SD 4.1°). Pre- and postoperative values and changes in pelvic tilt, SVA, SS, and LL varied significantly among the five types of patient. Conclusion. We found that the proposed algorithm was reliable and valid for predicting the standing pelvic tilt after THA. Cite this article: Bone Joint J 2024;106-B(1):19–27


Bone & Joint Research
Vol. 13, Issue 4 | Pages 184 - 192
18 Apr 2024
Morita A Iida Y Inaba Y Tezuka T Kobayashi N Choe H Ike H Kawakami E

Aims. This study was designed to develop a model for predicting bone mineral density (BMD) loss of the femur after total hip arthroplasty (THA) using artificial intelligence (AI), and to identify factors that influence the prediction. Additionally, we virtually examined the efficacy of administration of bisphosphonate for cases with severe BMD loss based on the predictive model. Methods. The study included 538 joints that underwent primary THA. The patients were divided into groups using unsupervised time series clustering for five-year BMD loss of Gruen zone 7 postoperatively, and a machine-learning model to predict the BMD loss was developed. Additionally, the predictor for BMD loss was extracted using SHapley Additive exPlanations (SHAP). The patient-specific efficacy of bisphosphonate, which is the most important categorical predictor for BMD loss, was examined by calculating the change in predictive probability when hypothetically switching between the inclusion and exclusion of bisphosphonate. Results. Time series clustering allowed us to divide the patients into two groups, and the predictive factors were identified including patient- and operation-related factors. The area under the receiver operating characteristic (ROC) curve (AUC) for the BMD loss prediction averaged 0.734. Virtual administration of bisphosphonate showed on average 14% efficacy in preventing BMD loss of zone 7. Additionally, stem types and preoperative triglyceride (TG), creatinine (Cr), estimated glomerular filtration rate (eGFR), and creatine kinase (CK) showed significant association with the estimated patient-specific efficacy of bisphosphonate. Conclusion. Periprosthetic BMD loss after THA is predictable based on patient- and operation-related factors, and optimal prescription of bisphosphonate based on the prediction may prevent BMD loss. Cite this article: Bone Joint Res 2024;13(4):184–192


Bone & Joint Open
Vol. 4, Issue 6 | Pages 399 - 407
1 Jun 2023
Yeramosu T Ahmad W Satpathy J Farrar JM Golladay GJ Patel NK

Aims

To identify variables independently associated with same-day discharge (SDD) of patients following revision total knee arthroplasty (rTKA) and to develop machine learning algorithms to predict suitable candidates for outpatient rTKA.

Methods

Data were obtained from the American College of Surgeons National Quality Improvement Programme (ACS-NSQIP) database from the years 2018 to 2020. Patients with elective, unilateral rTKA procedures and a total hospital length of stay between zero and four days were included. Demographic, preoperative, and intraoperative variables were analyzed. A multivariable logistic regression (MLR) model and various machine learning techniques were compared using area under the curve (AUC), calibration, and decision curve analysis. Important and significant variables were identified from the models.


Bone & Joint Research
Vol. 8, Issue 11 | Pages 563 - 569
1 Nov 2019
Koh Y Lee J Lee H Kim H Kang K

Objectives

Unicompartmental knee arthroplasty (UKA) is an alternative to total knee arthroplasty with isolated medial or lateral compartment osteoarthritis. However, polyethylene wear can significantly reduce the lifespan of UKA. Different bearing designs and materials for UKA have been developed to change the rate of polyethylene wear. Therefore, the objective of this study is to investigate the effect of insert conformity and material on the predicted wear in mobile-bearing UKA using a previously developed computational wear method.

Methods

Two different designs were tested with the same femoral component under identical kinematic input: anatomy mimetic design (AMD) and conforming design inserts with different conformity levels. The insert materials were standard or crosslinked ultra-high-molecular-weight polyethylene (UHMWPE). We evaluated the contact pressure, contact area, wear rate, wear depth, and volumetric wear under gait cycle loading conditions.


Bone & Joint Open
Vol. 5, Issue 1 | Pages 9 - 19
16 Jan 2024
Dijkstra H van de Kuit A de Groot TM Canta O Groot OQ Oosterhoff JH Doornberg JN

Aims. Machine-learning (ML) prediction models in orthopaedic trauma hold great promise in assisting clinicians in various tasks, such as personalized risk stratification. However, an overview of current applications and critical appraisal to peer-reviewed guidelines is lacking. The objectives of this study are to 1) provide an overview of current ML prediction models in orthopaedic trauma; 2) evaluate the completeness of reporting following the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement; and 3) assess the risk of bias following the Prediction model Risk Of Bias Assessment Tool (PROBAST) tool. Methods. A systematic search screening 3,252 studies identified 45 ML-based prediction models in orthopaedic trauma up to January 2023. The TRIPOD statement assessed transparent reporting and the PROBAST tool the risk of bias. Results. A total of 40 studies reported on training and internal validation; four studies performed both development and external validation, and one study performed only external validation. The most commonly reported outcomes were mortality (33%, 15/45) and length of hospital stay (9%, 4/45), and the majority of prediction models were developed in the hip fracture population (60%, 27/45). The overall median completeness for the TRIPOD statement was 62% (interquartile range 30 to 81%). The overall risk of bias in the PROBAST tool was low in 24% (11/45), high in 69% (31/45), and unclear in 7% (3/45) of the studies. High risk of bias was mainly due to analysis domain concerns including small datasets with low number of outcomes, complete-case analysis in case of missing data, and no reporting of performance measures. Conclusion. The results of this study showed that despite a myriad of potential clinically useful applications, a substantial part of ML studies in orthopaedic trauma lack transparent reporting, and are at high risk of bias. These problems must be resolved by following established guidelines to instil confidence in ML models among patients and clinicians. Otherwise, there will remain a sizeable gap between the development of ML prediction models and their clinical application in our day-to-day orthopaedic trauma practice. Cite this article: Bone Jt Open 2024;5(1):9–19


Bone & Joint Open
Vol. 5, Issue 8 | Pages 671 - 680
14 Aug 2024
Fontalis A Zhao B Putzeys P Mancino F Zhang S Vanspauwen T Glod F Plastow R Mazomenos E Haddad FS

Aims. Precise implant positioning, tailored to individual spinopelvic biomechanics and phenotype, is paramount for stability in total hip arthroplasty (THA). Despite a few studies on instability prediction, there is a notable gap in research utilizing artificial intelligence (AI). The objective of our pilot study was to evaluate the feasibility of developing an AI algorithm tailored to individual spinopelvic mechanics and patient phenotype for predicting impingement. Methods. This international, multicentre prospective cohort study across two centres encompassed 157 adults undergoing primary robotic arm-assisted THA. Impingement during specific flexion and extension stances was identified using the virtual range of motion (ROM) tool of the robotic software. The primary AI model, the Light Gradient-Boosting Machine (LGBM), used tabular data to predict impingement presence, direction (flexion or extension), and type. A secondary model integrating tabular data with plain anteroposterior pelvis radiographs was evaluated to assess for any potential enhancement in prediction accuracy. Results. We identified nine predictors from an analysis of baseline spinopelvic characteristics and surgical planning parameters. Using fivefold cross-validation, the LGBM achieved 70.2% impingement prediction accuracy. With impingement data, the LGBM estimated direction with 85% accuracy, while the support vector machine (SVM) determined impingement type with 72.9% accuracy. After integrating imaging data with a multilayer perceptron (tabular) and a convolutional neural network (radiograph), the LGBM’s prediction was 68.1%. Both combined and LGBM-only had similar impingement direction prediction rates (around 84.5%). Conclusion. This study is a pioneering effort in leveraging AI for impingement prediction in THA, utilizing a comprehensive, real-world clinical dataset. Our machine-learning algorithm demonstrated promising accuracy in predicting impingement, its type, and direction. While the addition of imaging data to our deep-learning algorithm did not boost accuracy, the potential for refined annotations, such as landmark markings, offers avenues for future enhancement. Prior to clinical integration, external validation and larger-scale testing of this algorithm are essential. Cite this article: Bone Jt Open 2024;5(8):671–680


Bone & Joint Open
Vol. 3, Issue 5 | Pages 383 - 389
1 May 2022
Motesharei A Batailler C De Massari D Vincent G Chen AF Lustig S

Aims. No predictive model has been published to forecast operating time for total knee arthroplasty (TKA). The aims of this study were to design and validate a predictive model to estimate operating time for robotic-assisted TKA based on demographic data, and evaluate the added predictive power of CT scan-based predictors and their impact on the accuracy of the predictive model. Methods. A retrospective study was conducted on 1,061 TKAs performed from January 2016 to December 2019 with an image-based robotic-assisted system. Demographic data included age, sex, height, and weight. The femoral and tibial mechanical axis and the osteophyte volume were calculated from CT scans. These inputs were used to develop a predictive model aimed to predict operating time based on demographic data only, and demographic and 3D patient anatomy data. Results. The key factors for predicting operating time were the surgeon and patient weight, followed by 12 anatomical parameters derived from CT scans. The predictive model based only on demographic data showed that 90% of predictions were within 15 minutes of actual operating time, with 73% within ten minutes. The predictive model including demographic data and CT scans showed that 94% of predictions were within 15 minutes of actual operating time and 88% within ten minutes. Conclusion. The primary factors for predicting robotic-assisted TKA operating time were surgeon, patient weight, and osteophyte volume. This study demonstrates that incorporating 3D patient-specific data can improve operating time predictions models, which may lead to improved operating room planning and efficiency. Cite this article: Bone Jt Open 2022;3(5):383–389


The Bone & Joint Journal
Vol. 104-B, Issue 4 | Pages 486 - 494
4 Apr 2022
Liu W Sun Z Xiong H Liu J Lu J Cai B Wang W Fan C

Aims. The aim of this study was to develop and internally validate a prognostic nomogram to predict the probability of gaining a functional range of motion (ROM ≥ 120°) after open arthrolysis of the elbow in patients with post-traumatic stiffness of the elbow. Methods. We developed the Shanghai Prediction Model for Elbow Stiffness Surgical Outcome (SPESSO) based on a dataset of 551 patients who underwent open arthrolysis of the elbow in four institutions. Demographic and clinical characteristics were collected from medical records. The least absolute shrinkage and selection operator regression model was used to optimize the selection of relevant features. Multivariable logistic regression analysis was used to build the SPESSO. Its prediction performance was evaluated using the concordance index (C-index) and a calibration graph. Internal validation was conducted using bootstrapping validation. Results. BMI, the duration of stiffness, the preoperative ROM, the preoperative intensity of pain, and grade of post-traumatic osteoarthritis of the elbow were identified as predictors of outcome and incorporated to construct the nomogram. SPESSO displayed good discrimination with a C-index of 0.73 (95% confidence interval 0.64 to 0.81). A high C-index value of 0.70 could still be reached in the interval validation. The calibration graph showed good agreement between the nomogram prediction and the outcome. Conclusion. The newly developed SPESSO is a valid and convenient model which can be used to predict the outcome of open arthrolysis of the elbow. It could assist clinicians in counselling patients regarding the choice and expectations of treatment. Cite this article: Bone Joint J 2022;104-B(4):486–494


The Bone & Joint Journal
Vol. 106-B, Issue 11 | Pages 1216 - 1222
1 Nov 2024
Castagno S Gompels B Strangmark E Robertson-Waters E Birch M van der Schaar M McCaskie AW

Aims. Machine learning (ML), a branch of artificial intelligence that uses algorithms to learn from data and make predictions, offers a pathway towards more personalized and tailored surgical treatments. This approach is particularly relevant to prevalent joint diseases such as osteoarthritis (OA). In contrast to end-stage disease, where joint arthroplasty provides excellent results, early stages of OA currently lack effective therapies to halt or reverse progression. Accurate prediction of OA progression is crucial if timely interventions are to be developed, to enhance patient care and optimize the design of clinical trials. Methods. A systematic review was conducted in accordance with PRISMA guidelines. We searched MEDLINE and Embase on 5 May 2024 for studies utilizing ML to predict OA progression. Titles and abstracts were independently screened, followed by full-text reviews for studies that met the eligibility criteria. Key information was extracted and synthesized for analysis, including types of data (such as clinical, radiological, or biochemical), definitions of OA progression, ML algorithms, validation methods, and outcome measures. Results. Out of 1,160 studies initially identified, 39 were included. Most studies (85%) were published between 2020 and 2024, with 82% using publicly available datasets, primarily the Osteoarthritis Initiative. ML methods were predominantly supervised, with significant variability in the definitions of OA progression: most studies focused on structural changes (59%), while fewer addressed pain progression or both. Deep learning was used in 44% of studies, while automated ML was used in 5%. There was a lack of standardization in evaluation metrics and limited external validation. Interpretability was explored in 54% of studies, primarily using SHapley Additive exPlanations. Conclusion. Our systematic review demonstrates the feasibility of ML models in predicting OA progression, but also uncovers critical limitations that currently restrict their clinical applicability. Future priorities should include diversifying data sources, standardizing outcome measures, enforcing rigorous validation, and integrating more sophisticated algorithms. This paradigm shift from predictive modelling to actionable clinical tools has the potential to transform patient care and disease management in orthopaedic practice. Cite this article: Bone Joint J 2024;106-B(11):1216–1222


The Bone & Joint Journal
Vol. 106-B, Issue 11 | Pages 1333 - 1341
1 Nov 2024
Cheung PWH Leung JHM Lee VWY Cheung JPY

Aims. Developmental cervical spinal stenosis (DcSS) is a well-known predisposing factor for degenerative cervical myelopathy (DCM) but there is a lack of consensus on its definition. This study aims to define DcSS based on MRI, and its multilevel characteristics, to assess the prevalence of DcSS in the general population, and to evaluate the presence of DcSS in the prediction of developing DCM. Methods. This cross-sectional study analyzed MRI spine morphological parameters at C3 to C7 (including anteroposterior (AP) diameter of spinal canal, spinal cord, and vertebral body) from DCM patients (n = 95) and individuals recruited from the general population (n = 2,019). Level-specific median AP spinal canal diameter from DCM patients was used to screen for stenotic levels in the population-based cohort. An individual with multilevel (≥ 3 vertebral levels) AP canal diameter smaller than the DCM median values was considered as having DcSS. The most optimal cut-off canal diameter per level for DcSS was determined by receiver operating characteristic analyses, and multivariable logistic regression was performed for the prediction of developing DCM that required surgery. Results. A total of 2,114 individuals aged 64.6 years (SD 11.9) who underwent surgery from March 2009 to December 2016 were studied. The most optimal cut-off canal diameters for DcSS are: C3 < 12.9 mm, C4 < 11.8 mm, C5 < 11.9 mm, C6 < 12.3 mm, and C7 < 13.3 mm. Overall, 13.0% (262 of 2,019) of the population-based cohort had multilevel DcSS. Multilevel DcSS (odds ratio (OR) 6.12 (95% CI 3.97 to 9.42); p < 0.001) and male sex (OR 4.06 (95% CI 2.55 to 6.45); p < 0.001) were predictors of developing DCM. Conclusion. This is the first MRI-based study for defining DcSS with multilevel canal narrowing. Level-specific cut-off canal diameters for DcSS can be used for early identification of individuals at risk of developing DCM. Individuals with DcSS at ≥ three levels and male sex are recommended for close monitoring or early intervention to avoid traumatic spinal cord injuries from stenosis. Cite this article: Bone Joint J 2024;106-B(11):1333–1341


Bone & Joint Open
Vol. 3, Issue 7 | Pages 573 - 581
1 Jul 2022
Clement ND Afzal I Peacock CJH MacDonald D Macpherson GJ Patton JT Asopa V Sochart DH Kader DF

Aims. The aims of this study were to assess mapping models to predict the three-level version of EuroQoL five-dimension utility index (EQ-5D-3L) from the Oxford Knee Score (OKS) and validate these before and after total knee arthroplasty (TKA). Methods. A retrospective cohort of 5,857 patients was used to create the prediction models, and a second cohort of 721 patients from a different centre was used to validate the models, all of whom underwent TKA. Patient characteristics, BMI, OKS, and EQ-5D-3L were collected preoperatively and one year postoperatively. Generalized linear regression was used to formulate the prediction models. Results. There were significant correlations between the OKS and EQ-5D-3L preoperatively (r = 0.68; p < 0.001) and postoperatively (r = 0.77; p < 0.001) and for the change in the scores (r = 0.61; p < 0.001). Three different models (preoperative, postoperative, and change) were created. There were no significant differences between the actual and predicted mean EQ-5D-3L utilities at any timepoint or for change in the scores (p > 0.090) in the validation cohort. There was a significant correlation between the actual and predicted EQ-5D-3L utilities preoperatively (r = 0.63; p < 0.001) and postoperatively (r = 0.77; p < 0.001) and for the change in the scores (r = 0.56; p < 0.001). Bland-Altman plots demonstrated that a lower utility was overestimated, and higher utility was underestimated. The individual predicted EQ-5D-3L that was within ± 0.05 and ± 0.010 (minimal clinically important difference (MCID)) of the actual EQ-5D-3L varied between 13% to 35% and 26% to 64%, respectively, according to timepoint assessed and change in the scores, but was not significantly different between the modelling and validation cohorts (p ≥ 0.148). Conclusion. The OKS can be used to estimate EQ-5D-3L. Predicted individual patient utility error beyond the MCID varied from one-third to two-thirds depending on timepoint assessed, but the mean for a cohort did not differ and could be employed for this purpose. Cite this article: Bone Jt Open 2022;3(7):573–581


Bone & Joint Research
Vol. 12, Issue 9 | Pages 512 - 521
1 Sep 2023
Langenberger B Schrednitzki D Halder AM Busse R Pross CM

Aims. A substantial fraction of patients undergoing knee arthroplasty (KA) or hip arthroplasty (HA) do not achieve an improvement as high as the minimal clinically important difference (MCID), i.e. do not achieve a meaningful improvement. Using three patient-reported outcome measures (PROMs), our aim was: 1) to assess machine learning (ML), the simple pre-surgery PROM score, and logistic-regression (LR)-derived performance in their prediction of whether patients undergoing HA or KA achieve an improvement as high or higher than a calculated MCID; and 2) to test whether ML is able to outperform LR or pre-surgery PROM scores in predictive performance. Methods. MCIDs were derived using the change difference method in a sample of 1,843 HA and 1,546 KA patients. An artificial neural network, a gradient boosting machine, least absolute shrinkage and selection operator (LASSO) regression, ridge regression, elastic net, random forest, LR, and pre-surgery PROM scores were applied to predict MCID for the following PROMs: EuroQol five-dimension, five-level questionnaire (EQ-5D-5L), EQ visual analogue scale (EQ-VAS), Hip disability and Osteoarthritis Outcome Score-Physical Function Short-form (HOOS-PS), and Knee injury and Osteoarthritis Outcome Score-Physical Function Short-form (KOOS-PS). Results. Predictive performance of the best models per outcome ranged from 0.71 for HOOS-PS to 0.84 for EQ-VAS (HA sample). ML statistically significantly outperformed LR and pre-surgery PROM scores in two out of six cases. Conclusion. MCIDs can be predicted with reasonable performance. ML was able to outperform traditional methods, although only in a minority of cases. Cite this article: Bone Joint Res 2023;12(9):512–521


Bone & Joint Research
Vol. 12, Issue 4 | Pages 245 - 255
3 Apr 2023
Ryu S So J Ha Y Kuh S Chin D Kim K Cho Y Kim K

Aims. To determine the major risk factors for unplanned reoperations (UROs) following corrective surgery for adult spinal deformity (ASD) and their interactions, using machine learning-based prediction algorithms and game theory. Methods. Patients who underwent surgery for ASD, with a minimum of two-year follow-up, were retrospectively reviewed. In total, 210 patients were included and randomly allocated into training (70% of the sample size) and test (the remaining 30%) sets to develop the machine learning algorithm. Risk factors were included in the analysis, along with clinical characteristics and parameters acquired through diagnostic radiology. Results. Overall, 152 patients without and 58 with a history of surgical revision following surgery for ASD were observed; the mean age was 68.9 years (SD 8.7) and 66.9 years (SD 6.6), respectively. On implementing a random forest model, the classification of URO events resulted in a balanced accuracy of 86.8%. Among machine learning-extracted risk factors, URO, proximal junction failure (PJF), and postoperative distance from the posterosuperior corner of C7 and the vertical axis from the centroid of C2 (SVA) were significant upon Kaplan-Meier survival analysis. Conclusion. The major risk factors for URO following surgery for ASD, i.e. postoperative SVA and PJF, and their interactions were identified using a machine learning algorithm and game theory. Clinical benefits will depend on patient risk profiles. Cite this article: Bone Joint Res 2023;12(4):245–255