Aims. Our aim was to develop and validate nomograms that would predict the cumulative incidence of sarcoma-specific death (CISSD) and disease progression (CIDP) in patients with localized high-grade primary central and dedifferentiated chondrosarcoma. Methods. The study population consisted of 391 patients from two international sarcoma centres (development cohort) who had undergone definitive surgery for a localized high-grade (histological grade II or III) conventional primary central chondrosarcoma or dedifferentiated chondrosarcoma. Disease progression captured the first event of either metastasis or local recurrence. An independent cohort of 221 patients from three additional hospitals was used for
Aims. The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing.
Aims. To develop and externally validate a parsimonious statistical prediction model of 90-day mortality after elective total hip arthroplasty (THA), and to provide a web calculator for clinical usage. Methods. We included 53,099 patients with cemented THA due to osteoarthritis from the Swedish Hip Arthroplasty Registry for model derivation and internal validation, as well as 125,428 patients from England and Wales recorded in the National Joint Register for England, Wales, Northern Ireland, the Isle of Man, and the States of Guernsey (NJR) for
The purpose of this study was to develop a convolutional neural network (CNN) for fracture detection, classification, and identification of greater tuberosity displacement ≥ 1 cm, neck-shaft angle (NSA) ≤ 100°, shaft translation, and articular fracture involvement, on plain radiographs. The CNN was trained and tested on radiographs sourced from 11 hospitals in Australia and externally validated on radiographs from the Netherlands. Each radiograph was paired with corresponding CT scans to serve as the reference standard based on dual independent evaluation by trained researchers and attending orthopaedic surgeons. Presence of a fracture, classification (non- to minimally displaced; two-part, multipart, and glenohumeral dislocation), and four characteristics were determined on 2D and 3D CT scans and subsequently allocated to each series of radiographs. Fracture characteristics included greater tuberosity displacement ≥ 1 cm, NSA ≤ 100°, shaft translation (0% to < 75%, 75% to 95%, > 95%), and the extent of articular involvement (0% to < 15%, 15% to 35%, or > 35%).Aims
Methods
Aims. To examine whether natural language processing (NLP) using a clinically based large language model (LLM) could be used to predict patient selection for total hip or total knee arthroplasty (THA/TKA) from routinely available free-text radiology reports. Methods. Data pre-processing and analyses were conducted according to the Artificial intelligence to Revolutionize the patient Care pathway in Hip and knEe aRthroplastY (ARCHERY) project protocol. This included use of de-identified Scottish regional clinical data of patients referred for consideration of THA/TKA, held in a secure data environment designed for artificial intelligence (AI) inference. Only preoperative radiology reports were included. NLP algorithms were based on the freely available GatorTron model, a LLM trained on over 82 billion words of de-identified clinical text. Two inference tasks were performed: assessment after model-fine tuning (50 Epochs and three cycles of k-fold cross validation), and
Aims. Machine-learning (ML) prediction models in orthopaedic trauma hold great promise in assisting clinicians in various tasks, such as personalized risk stratification. However, an overview of current applications and critical appraisal to peer-reviewed guidelines is lacking. The objectives of this study are to 1) provide an overview of current ML prediction models in orthopaedic trauma; 2) evaluate the completeness of reporting following the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement; and 3) assess the risk of bias following the Prediction model Risk Of Bias Assessment Tool (PROBAST) tool. Methods. A systematic search screening 3,252 studies identified 45 ML-based prediction models in orthopaedic trauma up to January 2023. The TRIPOD statement assessed transparent reporting and the PROBAST tool the risk of bias. Results. A total of 40 studies reported on training and internal validation; four studies performed both development and
Aims. The aim of this study was to identify factors associated with five-year cancer-related mortality in patients with limb and trunk soft-tissue sarcoma (STS) and develop and validate machine learning algorithms in order to predict five-year cancer-related mortality in these patients. Methods. Demographic, clinicopathological, and treatment variables of limb and trunk STS patients in the Surveillance, Epidemiology, and End Results Program (SEER) database from 2004 to 2017 were analyzed. Multivariable logistic regression was used to determine factors significantly associated with five-year cancer-related mortality. Various machine learning models were developed and compared using area under the curve (AUC), calibration, and decision curve analysis. The model that performed best on the SEER testing data was further assessed to determine the variables most important in its predictive capacity. This model was externally validated using our institutional dataset. Results. A total of 13,646 patients with STS from the SEER database were included, of whom 35.9% experienced five-year cancer-related mortality. The random forest model performed the best overall and identified tumour size as the most important variable when predicting mortality in patients with STS, followed by M stage, histological subtype, age, and surgical excision. Each variable was significant in logistic regression.
Aims. This study aimed to explore the biological and clinical importance of dysregulated key genes in osteoarthritis (OA) patients at the cartilage level to find potential biomarkers and targets for diagnosing and treating OA. Methods. Six sets of gene expression profiles were obtained from the Gene Expression Omnibus database. Differential expression analysis, weighted gene coexpression network analysis (WGCNA), and multiple machine-learning algorithms were used to screen crucial genes in osteoarthritic cartilage, and genome enrichment and functional annotation analyses were used to decipher the related categories of gene function. Single-sample gene set enrichment analysis was performed to analyze immune cell infiltration. Correlation analysis was used to explore the relationship among the hub genes and immune cells, as well as markers related to articular cartilage degradation and bone mineralization. Results. A total of 46 genes were obtained from the intersection of significantly upregulated genes in osteoarthritic cartilage and the key module genes screened by WGCNA. Functional annotation analysis revealed that these genes were closely related to pathological responses associated with OA, such as inflammation and immunity. Four key dysregulated genes (cartilage acidic protein 1 (CRTAC1), iodothyronine deiodinase 2 (DIO2), angiopoietin-related protein 2 (ANGPTL2), and MAGE family member D1 (MAGED1)) were identified after using machine-learning algorithms. These genes had high diagnostic value in both the training cohort and
Aims. Machine learning (ML), a branch of artificial intelligence that uses algorithms to learn from data and make predictions, offers a pathway towards more personalized and tailored surgical treatments. This approach is particularly relevant to prevalent joint diseases such as osteoarthritis (OA). In contrast to end-stage disease, where joint arthroplasty provides excellent results, early stages of OA currently lack effective therapies to halt or reverse progression. Accurate prediction of OA progression is crucial if timely interventions are to be developed, to enhance patient care and optimize the design of clinical trials. Methods. A systematic review was conducted in accordance with PRISMA guidelines. We searched MEDLINE and Embase on 5 May 2024 for studies utilizing ML to predict OA progression. Titles and abstracts were independently screened, followed by full-text reviews for studies that met the eligibility criteria. Key information was extracted and synthesized for analysis, including types of data (such as clinical, radiological, or biochemical), definitions of OA progression, ML algorithms, validation methods, and outcome measures. Results. Out of 1,160 studies initially identified, 39 were included. Most studies (85%) were published between 2020 and 2024, with 82% using publicly available datasets, primarily the Osteoarthritis Initiative. ML methods were predominantly supervised, with significant variability in the definitions of OA progression: most studies focused on structural changes (59%), while fewer addressed pain progression or both. Deep learning was used in 44% of studies, while automated ML was used in 5%. There was a lack of standardization in evaluation metrics and limited
Aims. Precise implant positioning, tailored to individual spinopelvic biomechanics and phenotype, is paramount for stability in total hip arthroplasty (THA). Despite a few studies on instability prediction, there is a notable gap in research utilizing artificial intelligence (AI). The objective of our pilot study was to evaluate the feasibility of developing an AI algorithm tailored to individual spinopelvic mechanics and patient phenotype for predicting impingement. Methods. This international, multicentre prospective cohort study across two centres encompassed 157 adults undergoing primary robotic arm-assisted THA. Impingement during specific flexion and extension stances was identified using the virtual range of motion (ROM) tool of the robotic software. The primary AI model, the Light Gradient-Boosting Machine (LGBM), used tabular data to predict impingement presence, direction (flexion or extension), and type. A secondary model integrating tabular data with plain anteroposterior pelvis radiographs was evaluated to assess for any potential enhancement in prediction accuracy. Results. We identified nine predictors from an analysis of baseline spinopelvic characteristics and surgical planning parameters. Using fivefold cross-validation, the LGBM achieved 70.2% impingement prediction accuracy. With impingement data, the LGBM estimated direction with 85% accuracy, while the support vector machine (SVM) determined impingement type with 72.9% accuracy. After integrating imaging data with a multilayer perceptron (tabular) and a convolutional neural network (radiograph), the LGBM’s prediction was 68.1%. Both combined and LGBM-only had similar impingement direction prediction rates (around 84.5%). Conclusion. This study is a pioneering effort in leveraging AI for impingement prediction in THA, utilizing a comprehensive, real-world clinical dataset. Our machine-learning algorithm demonstrated promising accuracy in predicting impingement, its type, and direction. While the addition of imaging data to our deep-learning algorithm did not boost accuracy, the potential for refined annotations, such as landmark markings, offers avenues for future enhancement. Prior to clinical integration,
Aims. The primary aim of this study was to develop a reliable, effective radiological score to assess the healing of humeral shaft fractures, the Radiographic Union Score for HUmeral fractures (RUSHU). The secondary aim was to assess whether the six-week RUSHU was predictive of nonunion at six months after the injury. Patients and Methods. Initially, 20 patients with radiographs six weeks following a humeral shaft fracture were selected at random from a trauma database and scored by three observers, based on the Radiographic Union Scale for Tibial fractures system. After refinement of the RUSHU criteria, a second group of 60 patients with radiographs six weeks after injury, 40 with fractures that united and 20 with fractures that developed nonunion, were scored by two blinded observers. Results. After refinement, the interobserver intraclass correlation coefficient (ICC) was 0.79 (95% confidence interval (CI) 0.67 to 0.87), indicating substantial agreement. At six weeks after injury, patients whose fractures united had a significantly higher median score than those who developed nonunion (10 vs 7; p < 0.001). A receiver operating characteristic curve determined that a RUSHU cut-off of < 8 was predictive of nonunion (area under the curve = 0.84, 95% CI 0.74 to 0.94). The sensitivity was 75% and specificity 80% with a positive predictive value (PPV) of 65% and a negative predictive value of 86%. Patients with a RUSHU < 8 (n = 23) were more likely to develop nonunion than those with a RUSHU ≥ 8 (n = 37, odds ratio 12.0, 95% CI 3.4 to 42.9). Based on a PPV of 65%, if all patients with a RUSHU < 8 underwent fixation, the number of procedures needed to avoid one nonunion would be 1.5. Conclusion. The RUSHU is reliable and effective in identifying patients at risk of nonunion of a humeral shaft fracture at six weeks after injury. This tool requires
Despite the vast quantities of published artificial intelligence (AI) algorithms that target trauma and orthopaedic applications, very few progress to inform clinical practice. One key reason for this is the lack of a clear pathway from development to deployment. In order to assist with this process, we have developed the Clinical Practice Integration of Artificial Intelligence (CPI-AI) framework – a five-stage approach to the clinical practice adoption of AI in the setting of trauma and orthopaedics, based on the IDEAL principles ( Cite this article:
This study aimed to develop and validate a fully automated system that quantifies proximal femoral bone mineral density (BMD) from CT images. The study analyzed 978 pairs of hip CT and dual-energy X-ray absorptiometry (DXA) measurements of the proximal femur (DXA-BMD) collected from three institutions. From the CT images, the femur and a calibration phantom were automatically segmented using previously trained deep-learning models. The Hounsfield units of each voxel were converted into density (mg/cm3). Then, a deep-learning model trained by manual landmark selection of 315 cases was developed to select the landmarks at the proximal femur to rotate the CT volume to the neutral position. Finally, the CT volume of the femur was projected onto the coronal plane, and the areal BMD of the proximal femur (CT-aBMD) was quantified. CT-aBMD correlated to DXA-BMD, and a receiver operating characteristic (ROC) analysis quantified the accuracy in diagnosing osteoporosis.Aims
Methods
Objectives. To define Patient Acceptable Symptom State (PASS) thresholds
for the Oxford hip score (OHS) and Oxford knee score (OKS) at mid-term
follow-up. Methods. In a prospective multicentre cohort study, OHS and OKS were collected
at a mean follow-up of three years (1.5 to 6.0), combined with a
numeric rating scale (NRS) for satisfaction and an external validation
question assessing the patient’s willingness to undergo surgery
again. A total of 550 patients underwent total hip replacement (THR)
and 367 underwent total knee replacement (TKR). Results. Receiver operating characteristic (ROC) curves identified a PASS
threshold of 42 for the OHS after THR and 37 for the OKS after TKR.
THR patients with an OHS ≥ 42 and TKR patients with an OKS ≥ 37
had a higher NRS for satisfaction and a greater likelihood of being
willing to undergo surgery again. Conclusions. PASS thresholds appear larger at mid-term follow-up than at six
months after surgery. With- out
To develop prediction models using machine-learning (ML) algorithms for 90-day and one-year mortality prediction in femoral neck fracture (FNF) patients aged 50 years or older based on the Hip fracture Evaluation with Alternatives of Total Hip arthroplasty versus Hemiarthroplasty (HEALTH) and Fixation using Alternative Implants for the Treatment of Hip fractures (FAITH) trials. This study included 2,388 patients from the HEALTH and FAITH trials, with 90-day and one-year mortality proportions of 3.0% (71/2,388) and 6.4% (153/2,388), respectively. The mean age was 75.9 years (SD 10.8) and 65.9% of patients (1,574/2,388) were female. The algorithms included patient and injury characteristics. Six algorithms were developed, internally validated and evaluated across discrimination (c-statistic; discriminative ability between those with risk of mortality and those without), calibration (observed outcome compared to the predicted probability), and the Brier score (composite of discrimination and calibration).Aims
Methods
Literature surrounding artificial intelligence (AI)-related applications for hip and knee arthroplasty has proliferated. However, meaningful advances that fundamentally transform the practice and delivery of joint arthroplasty are yet to be realized, despite the broad range of applications as we continue to search for meaningful and appropriate use of AI. AI literature in hip and knee arthroplasty between 2018 and 2021 regarding image-based analyses, value-based care, remote patient monitoring, and augmented reality was reviewed. Concerns surrounding meaningful use and appropriate methodological approaches of AI in joint arthroplasty research are summarized. Of the 233 AI-related orthopaedics articles published, 178 (76%) constituted original research, while the rest consisted of editorials or reviews. A total of 52% of original AI-related research concerns hip and knee arthroplasty (n = 92), and a narrative review is described. Three studies were externally validated. Pitfalls surrounding present-day research include conflating vernacular (“AI/machine learning”), repackaging limited registry data, prematurely releasing internally validated prediction models, appraising model architecture instead of inputted data, withholding code, and evaluating studies using antiquated regression-based guidelines. While AI has been applied to a variety of hip and knee arthroplasty applications with limited clinical impact, the future remains promising if the question is meaningful, the methodology is rigorous and transparent, the data are rich, and the model is externally validated. Simple checkpoints for meaningful AI adoption include ensuring applications focus on: administrative support over clinical evaluation and management; necessity of the advanced model; and the novelty of the question being answered. Cite this article:
To determine the major risk factors for unplanned reoperations (UROs) following corrective surgery for adult spinal deformity (ASD) and their interactions, using machine learning-based prediction algorithms and game theory. Patients who underwent surgery for ASD, with a minimum of two-year follow-up, were retrospectively reviewed. In total, 210 patients were included and randomly allocated into training (70% of the sample size) and test (the remaining 30%) sets to develop the machine learning algorithm. Risk factors were included in the analysis, along with clinical characteristics and parameters acquired through diagnostic radiology.Aims
Methods
Prediction tools are instruments which are commonly used to estimate the prognosis in oncology and facilitate clinical decision-making in a more personalized manner. Their popularity is shown by the increasing numbers of prediction tools, which have been described in the medical literature. Many of these tools have been shown to be useful in the field of soft-tissue sarcoma of the extremities (eSTS). In this annotation, we aim to provide an overview of the available prediction tools for eSTS, provide an approach for clinicians to evaluate the performance and usefulness of the available tools for their own patients, and discuss their possible applications in the management of patients with an eSTS. Cite this article:
The October 2023 Hip & Pelvis Roundup360 looks at: Femoroacetabular impingement syndrome at ten years – how do athletes do?; Venous thromboembolism in patients following total joint replacement: are transfusions to blame?; What changes in pelvic sagittal tilt occur 20 years after total hip arthroplasty?; Can stratified care in hip arthroscopy predict successful and unsuccessful outcomes?; Hip replacement into your nineties; Can large language models help with follow-up?; The most taxing of revisions – proximal femoral replacement for periprosthetic joint infection – what’s the benefit of dual mobility?
The risk factors for recurrent instability (RI) following a primary traumatic anterior shoulder dislocation (PTASD) remain unclear. In this study, we aimed to determine the rate of RI in a large cohort of patients managed nonoperatively after PTASD and to develop a clinical prediction model. A total of 1,293 patients with PTASD managed nonoperatively were identified from a trauma database (mean age 23.3 years (15 to 35); 14.3% female). We assessed the prevalence of RI, and used multivariate regression modelling to evaluate which demographic- and injury-related factors were independently predictive for its occurrence.Aims
Methods