Aims. Machine learning (ML), a branch of artificial intelligence that uses algorithms to learn from data and make predictions, offers a pathway towards more personalized and tailored surgical treatments. This approach is particularly relevant to prevalent joint diseases such as osteoarthritis (OA). In contrast to end-stage disease, where joint arthroplasty provides excellent results, early stages of OA currently lack effective therapies to halt or reverse progression. Accurate prediction of OA progression is crucial if timely interventions are to be developed, to enhance patient care and optimize the design of clinical trials. Methods. A systematic review was conducted in accordance with PRISMA guidelines. We searched MEDLINE and Embase on 5 May 2024 for studies utilizing ML to predict OA progression. Titles and abstracts were independently screened, followed by full-text reviews for studies that met the eligibility criteria. Key information was extracted and synthesized for analysis, including types of data (such as clinical, radiological, or biochemical), definitions of OA progression, ML algorithms, validation methods, and outcome measures. Results. Out of 1,160 studies initially identified, 39 were included. Most studies (85%) were published between 2020 and 2024, with 82% using publicly available datasets, primarily the Osteoarthritis Initiative. ML methods were predominantly supervised, with significant variability in the definitions of OA progression: most studies focused on structural changes (59%), while fewer addressed pain progression or both. Deep learning was used in 44% of studies, while automated ML was used in 5%. There was a lack of standardization in evaluation metrics and limited
Despite the vast quantities of published artificial intelligence (AI) algorithms that target trauma and orthopaedic applications, very few progress to inform clinical practice. One key reason for this is the lack of a clear pathway from development to deployment. In order to assist with this process, we have developed the Clinical Practice Integration of Artificial Intelligence (CPI-AI) framework – a five-stage approach to the clinical practice adoption of AI in the setting of trauma and orthopaedics, based on the IDEAL principles ( Cite this article:
Rotator cuff tear (RCT) is the leading cause of shoulder pain, primarily associated with age-related tendon degeneration. This study aimed to elucidate the potential differential gene expressions in tendons across different age groups, and to investigate their roles in tendon degeneration. Linear regression and differential expression (DE) analyses were performed on two transcriptome profiling datasets of torn supraspinatus tendons to identify age-related genes. Subsequent functional analyses were conducted on these candidate genes to explore their potential roles in tendon ageing. Additionally, a secondary DE analysis was performed on candidate genes by comparing their expressions between lesioned and normal tendons to explore their correlations with RCTs.Aims
Methods
Aims. Precise implant positioning, tailored to individual spinopelvic biomechanics and phenotype, is paramount for stability in total hip arthroplasty (THA). Despite a few studies on instability prediction, there is a notable gap in research utilizing artificial intelligence (AI). The objective of our pilot study was to evaluate the feasibility of developing an AI algorithm tailored to individual spinopelvic mechanics and patient phenotype for predicting impingement. Methods. This international, multicentre prospective cohort study across two centres encompassed 157 adults undergoing primary robotic arm-assisted THA. Impingement during specific flexion and extension stances was identified using the virtual range of motion (ROM) tool of the robotic software. The primary AI model, the Light Gradient-Boosting Machine (LGBM), used tabular data to predict impingement presence, direction (flexion or extension), and type. A secondary model integrating tabular data with plain anteroposterior pelvis radiographs was evaluated to assess for any potential enhancement in prediction accuracy. Results. We identified nine predictors from an analysis of baseline spinopelvic characteristics and surgical planning parameters. Using fivefold cross-validation, the LGBM achieved 70.2% impingement prediction accuracy. With impingement data, the LGBM estimated direction with 85% accuracy, while the support vector machine (SVM) determined impingement type with 72.9% accuracy. After integrating imaging data with a multilayer perceptron (tabular) and a convolutional neural network (radiograph), the LGBM’s prediction was 68.1%. Both combined and LGBM-only had similar impingement direction prediction rates (around 84.5%). Conclusion. This study is a pioneering effort in leveraging AI for impingement prediction in THA, utilizing a comprehensive, real-world clinical dataset. Our machine-learning algorithm demonstrated promising accuracy in predicting impingement, its type, and direction. While the addition of imaging data to our deep-learning algorithm did not boost accuracy, the potential for refined annotations, such as landmark markings, offers avenues for future enhancement. Prior to clinical integration,
Aims. This study aimed to explore the biological and clinical importance of dysregulated key genes in osteoarthritis (OA) patients at the cartilage level to find potential biomarkers and targets for diagnosing and treating OA. Methods. Six sets of gene expression profiles were obtained from the Gene Expression Omnibus database. Differential expression analysis, weighted gene coexpression network analysis (WGCNA), and multiple machine-learning algorithms were used to screen crucial genes in osteoarthritic cartilage, and genome enrichment and functional annotation analyses were used to decipher the related categories of gene function. Single-sample gene set enrichment analysis was performed to analyze immune cell infiltration. Correlation analysis was used to explore the relationship among the hub genes and immune cells, as well as markers related to articular cartilage degradation and bone mineralization. Results. A total of 46 genes were obtained from the intersection of significantly upregulated genes in osteoarthritic cartilage and the key module genes screened by WGCNA. Functional annotation analysis revealed that these genes were closely related to pathological responses associated with OA, such as inflammation and immunity. Four key dysregulated genes (cartilage acidic protein 1 (CRTAC1), iodothyronine deiodinase 2 (DIO2), angiopoietin-related protein 2 (ANGPTL2), and MAGE family member D1 (MAGED1)) were identified after using machine-learning algorithms. These genes had high diagnostic value in both the training cohort and
Aims. Machine-learning (ML) prediction models in orthopaedic trauma hold great promise in assisting clinicians in various tasks, such as personalized risk stratification. However, an overview of current applications and critical appraisal to peer-reviewed guidelines is lacking. The objectives of this study are to 1) provide an overview of current ML prediction models in orthopaedic trauma; 2) evaluate the completeness of reporting following the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement; and 3) assess the risk of bias following the Prediction model Risk Of Bias Assessment Tool (PROBAST) tool. Methods. A systematic search screening 3,252 studies identified 45 ML-based prediction models in orthopaedic trauma up to January 2023. The TRIPOD statement assessed transparent reporting and the PROBAST tool the risk of bias. Results. A total of 40 studies reported on training and internal validation; four studies performed both development and
To map the Oxford Knee Score (OKS) and High Activity Arthroplasty Score (HAAS) items to a common scale, and to investigate the psychometric properties of this new scale for the measurement of knee health. Patient-reported outcome measure (PROM) data measuring knee health were obtained from the NHS PROMs dataset and Total or Partial Knee Arthroplasty Trial (TOPKAT). Assumptions for common scale modelling were tested. A graded response model (fitted to OKS item responses in the NHS PROMs dataset) was used as an anchor to calibrate paired HAAS items from the TOPKAT dataset. Information curves for the combined OKS-HAAS model were plotted. Bland-Altman analysis was used to compare common scale scores derived from OKS and HAAS items. A conversion table was developed to map between HAAS, OKS, and the common scale.Aims
Methods
This study aimed to develop and validate a fully automated system that quantifies proximal femoral bone mineral density (BMD) from CT images. The study analyzed 978 pairs of hip CT and dual-energy X-ray absorptiometry (DXA) measurements of the proximal femur (DXA-BMD) collected from three institutions. From the CT images, the femur and a calibration phantom were automatically segmented using previously trained deep-learning models. The Hounsfield units of each voxel were converted into density (mg/cm3). Then, a deep-learning model trained by manual landmark selection of 315 cases was developed to select the landmarks at the proximal femur to rotate the CT volume to the neutral position. Finally, the CT volume of the femur was projected onto the coronal plane, and the areal BMD of the proximal femur (CT-aBMD) was quantified. CT-aBMD correlated to DXA-BMD, and a receiver operating characteristic (ROC) analysis quantified the accuracy in diagnosing osteoporosis.Aims
Methods
The use of artificial intelligence (AI) is rapidly growing across many domains, of which the medical field is no exception. AI is an umbrella term defining the practical application of algorithms to generate useful output, without the need of human cognition. Owing to the expanding volume of patient information collected, known as ‘big data’, AI is showing promise as a useful tool in healthcare research and across all aspects of patient care pathways. Practical applications in orthopaedic surgery include: diagnostics, such as fracture recognition and tumour detection; predictive models of clinical and patient-reported outcome measures, such as calculating mortality rates and length of hospital stay; and real-time rehabilitation monitoring and surgical training. However, clinicians should remain cognizant of AI’s limitations, as the development of robust reporting and validation frameworks is of paramount importance to prevent avoidable errors and biases. The aim of this review article is to provide a comprehensive understanding of AI and its subfields, as well as to delineate its existing clinical applications in trauma and orthopaedic surgery. Furthermore, this narrative review expands upon the limitations of AI and future direction. Cite this article:
To identify variables independently associated with same-day discharge (SDD) of patients following revision total knee arthroplasty (rTKA) and to develop machine learning algorithms to predict suitable candidates for outpatient rTKA. Data were obtained from the American College of Surgeons National Quality Improvement Programme (ACS-NSQIP) database from the years 2018 to 2020. Patients with elective, unilateral rTKA procedures and a total hospital length of stay between zero and four days were included. Demographic, preoperative, and intraoperative variables were analyzed. A multivariable logistic regression (MLR) model and various machine learning techniques were compared using area under the curve (AUC), calibration, and decision curve analysis. Important and significant variables were identified from the models.Aims
Methods
To determine the major risk factors for unplanned reoperations (UROs) following corrective surgery for adult spinal deformity (ASD) and their interactions, using machine learning-based prediction algorithms and game theory. Patients who underwent surgery for ASD, with a minimum of two-year follow-up, were retrospectively reviewed. In total, 210 patients were included and randomly allocated into training (70% of the sample size) and test (the remaining 30%) sets to develop the machine learning algorithm. Risk factors were included in the analysis, along with clinical characteristics and parameters acquired through diagnostic radiology.Aims
Methods
To develop prediction models using machine-learning (ML) algorithms for 90-day and one-year mortality prediction in femoral neck fracture (FNF) patients aged 50 years or older based on the Hip fracture Evaluation with Alternatives of Total Hip arthroplasty versus Hemiarthroplasty (HEALTH) and Fixation using Alternative Implants for the Treatment of Hip fractures (FAITH) trials. This study included 2,388 patients from the HEALTH and FAITH trials, with 90-day and one-year mortality proportions of 3.0% (71/2,388) and 6.4% (153/2,388), respectively. The mean age was 75.9 years (SD 10.8) and 65.9% of patients (1,574/2,388) were female. The algorithms included patient and injury characteristics. Six algorithms were developed, internally validated and evaluated across discrimination (c-statistic; discriminative ability between those with risk of mortality and those without), calibration (observed outcome compared to the predicted probability), and the Brier score (composite of discrimination and calibration).Aims
Methods
Within healthcare, several measures are used to quantify and compare the severity of health conditions. Two common measures are disability weight (DW), a context-independent value representing severity of a health state, and utility weight (UW), a context-dependent measure of health-related quality of life. Neither of these measures have previously been determined for developmental dysplasia of the hip (DDH). The aim of this study is to determine the DW and country-specific UWs for DDH. A survey was created using three different methods to estimate the DW: a preference ranking exercise, time trade-off exercise, and visual analogue scale (VAS). Participants were fully licensed orthopaedic surgeons who were contacted through national and international orthopaedic organizations. A global DW was calculated using a random effects model through an inverse-variance approach. A UW was calculated for each country as one minus the country-specific DW composed of the time trade-off exercise and VAS.Aims
Methods
Hip dysplasia (HD) leads to premature osteoarthritis. Timely detection and correction of HD has been shown to improve pain, functional status, and hip longevity. Several time-consuming radiological measurements are currently used to confirm HD. An artificial intelligence (AI) software named HIPPO automatically locates anatomical landmarks on anteroposterior pelvis radiographs and performs the needed measurements. The primary aim of this study was to assess the reliability of this tool as compared to multi-reader evaluation in clinically proven cases of adult HD. The secondary aims were to assess the time savings achieved and evaluate inter-reader assessment. A consecutive preoperative sample of 130 HD patients (256 hips) was used. This cohort included 82.3% females (n = 107) and 17.7% males (n = 23) with median patient age of 28.6 years (interquartile range (IQR) 22.5 to 37.2). Three trained readers’ measurements were compared to AI outputs of lateral centre-edge angle (LCEA), caput-collum-diaphyseal (CCD) angle, pelvic obliquity, Tönnis angle, Sharp’s angle, and femoral head coverage. Intraclass correlation coefficients (ICC) and Bland-Altman analyses were obtained.Aims
Methods
Accurate identification of the ankle joint centre is critical for estimating tibial coronal alignment in total knee arthroplasty (TKA). The purpose of the current study was to leverage artificial intelligence (AI) to determine the accuracy and effect of using different radiological anatomical landmarks to quantify mechanical alignment in relation to a traditionally defined radiological ankle centre. Patients with full-limb radiographs from the Osteoarthritis Initiative were included. A sub-cohort of 250 radiographs were annotated for landmarks relevant to knee alignment and used to train a deep learning (U-Net) workflow for angle calculation on the entire database. The radiological ankle centre was defined as the midpoint of the superior talus edge/tibial plafond. Knee alignment (hip-knee-ankle angle) was compared against 1) midpoint of the most prominent malleoli points, 2) midpoint of the soft-tissue overlying malleoli, and 3) midpoint of the soft-tissue sulcus above the malleoli.Aims
Methods
Prediction tools are instruments which are commonly used to estimate the prognosis in oncology and facilitate clinical decision-making in a more personalized manner. Their popularity is shown by the increasing numbers of prediction tools, which have been described in the medical literature. Many of these tools have been shown to be useful in the field of soft-tissue sarcoma of the extremities (eSTS). In this annotation, we aim to provide an overview of the available prediction tools for eSTS, provide an approach for clinicians to evaluate the performance and usefulness of the available tools for their own patients, and discuss their possible applications in the management of patients with an eSTS. Cite this article:
The primary aim was to estimate the cost-effectiveness of routine operative fixation for all patients with humeral shaft fractures. The secondary aim was to estimate the health economic implications of using a Radiographic Union Score for HUmeral fractures (RUSHU) of < 8 to facilitate selective fixation for patients at risk of nonunion. From 2008 to 2017, 215 patients (mean age 57 yrs (17 to 18), 61% female (n = 130/215)) with a nonoperatively managed humeral diaphyseal fracture were retrospectively identified. Union was achieved in 77% (n = 165/215) after initial nonoperative management, with 23% (n = 50/215) uniting after surgery for nonunion. The EuroQol five-dimension three-level health index (EQ-5D-3L) was obtained via postal survey. Multiple regression was used to determine the independent influence of patient, injury, and management factors upon the EQ-5D-3L. An incremental cost-effectiveness ratio (ICER) of < £20,000 per quality-adjusted life-year (QALY) gained was considered cost-effective.Aims
Methods
The aims of this study were to assess mapping models to predict the three-level version of EuroQoL five-dimension utility index (EQ-5D-3L) from the Oxford Knee Score (OKS) and validate these before and after total knee arthroplasty (TKA). A retrospective cohort of 5,857 patients was used to create the prediction models, and a second cohort of 721 patients from a different centre was used to validate the models, all of whom underwent TKA. Patient characteristics, BMI, OKS, and EQ-5D-3L were collected preoperatively and one year postoperatively. Generalized linear regression was used to formulate the prediction models.Aims
Methods
The aim of this study was to develop and internally validate a prognostic nomogram to predict the probability of gaining a functional range of motion (ROM ≥ 120°) after open arthrolysis of the elbow in patients with post-traumatic stiffness of the elbow. We developed the Shanghai Prediction Model for Elbow Stiffness Surgical Outcome (SPESSO) based on a dataset of 551 patients who underwent open arthrolysis of the elbow in four institutions. Demographic and clinical characteristics were collected from medical records. The least absolute shrinkage and selection operator regression model was used to optimize the selection of relevant features. Multivariable logistic regression analysis was used to build the SPESSO. Its prediction performance was evaluated using the concordance index (C-index) and a calibration graph. Internal validation was conducted using bootstrapping validation.Aims
Methods
The aim of this study was to review the current evidence surrounding curve type and morphology on curve progression risk in adolescent idiopathic scoliosis (AIS). A comprehensive search was conducted by two independent reviewers on PubMed, Embase, Medline, and Web of Science to obtain all published information on morphological predictors of AIS progression. Search items included ‘adolescent idiopathic scoliosis’, ‘progression’, and ‘imaging’. The inclusion and exclusion criteria were carefully defined. Risk of bias of studies was assessed with the Quality in Prognostic Studies tool, and level of evidence for each predictor was rated with the Grading of Recommendations, Assessment, Development and Evaluations (GRADE) approach. In all, 6,286 publications were identified with 3,598 being subjected to secondary scrutiny. Ultimately, 26 publications (25 datasets) were included in this review.Aims
Methods