Aims. Our aim was to develop and validate nomograms that would predict the cumulative incidence of sarcoma-specific death (CISSD) and disease progression (CIDP) in patients with localized high-grade primary central and dedifferentiated chondrosarcoma. Methods. The study population consisted of 391 patients from two international sarcoma centres (development cohort) who had undergone definitive surgery for a localized high-grade (histological grade II or III) conventional primary central chondrosarcoma or dedifferentiated chondrosarcoma. Disease progression captured the first event of either metastasis or local recurrence. An independent cohort of 221 patients from three additional hospitals was used for
Aims. The number of convolutional neural networks (CNN) available for fracture detection and classification is rapidly increasing.
Aims. To develop and externally validate a parsimonious statistical prediction model of 90-day mortality after elective total hip arthroplasty (THA), and to provide a web calculator for clinical usage. Methods. We included 53,099 patients with cemented THA due to osteoarthritis from the Swedish Hip Arthroplasty Registry for model derivation and internal validation, as well as 125,428 patients from England and Wales recorded in the National Joint Register for England, Wales, Northern Ireland, the Isle of Man, and the States of Guernsey (NJR) for
The purpose of this study was to develop a convolutional neural network (CNN) for fracture detection, classification, and identification of greater tuberosity displacement ≥ 1 cm, neck-shaft angle (NSA) ≤ 100°, shaft translation, and articular fracture involvement, on plain radiographs. The CNN was trained and tested on radiographs sourced from 11 hospitals in Australia and externally validated on radiographs from the Netherlands. Each radiograph was paired with corresponding CT scans to serve as the reference standard based on dual independent evaluation by trained researchers and attending orthopaedic surgeons. Presence of a fracture, classification (non- to minimally displaced; two-part, multipart, and glenohumeral dislocation), and four characteristics were determined on 2D and 3D CT scans and subsequently allocated to each series of radiographs. Fracture characteristics included greater tuberosity displacement ≥ 1 cm, NSA ≤ 100°, shaft translation (0% to < 75%, 75% to 95%, > 95%), and the extent of articular involvement (0% to < 15%, 15% to 35%, or > 35%).Aims
Methods
Aims. To examine whether natural language processing (NLP) using a clinically based large language model (LLM) could be used to predict patient selection for total hip or total knee arthroplasty (THA/TKA) from routinely available free-text radiology reports. Methods. Data pre-processing and analyses were conducted according to the Artificial intelligence to Revolutionize the patient Care pathway in Hip and knEe aRthroplastY (ARCHERY) project protocol. This included use of de-identified Scottish regional clinical data of patients referred for consideration of THA/TKA, held in a secure data environment designed for artificial intelligence (AI) inference. Only preoperative radiology reports were included. NLP algorithms were based on the freely available GatorTron model, a LLM trained on over 82 billion words of de-identified clinical text. Two inference tasks were performed: assessment after model-fine tuning (50 Epochs and three cycles of k-fold cross validation), and
Aims. Machine-learning (ML) prediction models in orthopaedic trauma hold great promise in assisting clinicians in various tasks, such as personalized risk stratification. However, an overview of current applications and critical appraisal to peer-reviewed guidelines is lacking. The objectives of this study are to 1) provide an overview of current ML prediction models in orthopaedic trauma; 2) evaluate the completeness of reporting following the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement; and 3) assess the risk of bias following the Prediction model Risk Of Bias Assessment Tool (PROBAST) tool. Methods. A systematic search screening 3,252 studies identified 45 ML-based prediction models in orthopaedic trauma up to January 2023. The TRIPOD statement assessed transparent reporting and the PROBAST tool the risk of bias. Results. A total of 40 studies reported on training and internal validation; four studies performed both development and
Aims. The aim of this study was to identify factors associated with five-year cancer-related mortality in patients with limb and trunk soft-tissue sarcoma (STS) and develop and validate machine learning algorithms in order to predict five-year cancer-related mortality in these patients. Methods. Demographic, clinicopathological, and treatment variables of limb and trunk STS patients in the Surveillance, Epidemiology, and End Results Program (SEER) database from 2004 to 2017 were analyzed. Multivariable logistic regression was used to determine factors significantly associated with five-year cancer-related mortality. Various machine learning models were developed and compared using area under the curve (AUC), calibration, and decision curve analysis. The model that performed best on the SEER testing data was further assessed to determine the variables most important in its predictive capacity. This model was externally validated using our institutional dataset. Results. A total of 13,646 patients with STS from the SEER database were included, of whom 35.9% experienced five-year cancer-related mortality. The random forest model performed the best overall and identified tumour size as the most important variable when predicting mortality in patients with STS, followed by M stage, histological subtype, age, and surgical excision. Each variable was significant in logistic regression.
Aims. This study aimed to explore the biological and clinical importance of dysregulated key genes in osteoarthritis (OA) patients at the cartilage level to find potential biomarkers and targets for diagnosing and treating OA. Methods. Six sets of gene expression profiles were obtained from the Gene Expression Omnibus database. Differential expression analysis, weighted gene coexpression network analysis (WGCNA), and multiple machine-learning algorithms were used to screen crucial genes in osteoarthritic cartilage, and genome enrichment and functional annotation analyses were used to decipher the related categories of gene function. Single-sample gene set enrichment analysis was performed to analyze immune cell infiltration. Correlation analysis was used to explore the relationship among the hub genes and immune cells, as well as markers related to articular cartilage degradation and bone mineralization. Results. A total of 46 genes were obtained from the intersection of significantly upregulated genes in osteoarthritic cartilage and the key module genes screened by WGCNA. Functional annotation analysis revealed that these genes were closely related to pathological responses associated with OA, such as inflammation and immunity. Four key dysregulated genes (cartilage acidic protein 1 (CRTAC1), iodothyronine deiodinase 2 (DIO2), angiopoietin-related protein 2 (ANGPTL2), and MAGE family member D1 (MAGED1)) were identified after using machine-learning algorithms. These genes had high diagnostic value in both the training cohort and
Aims. Machine learning (ML), a branch of artificial intelligence that uses algorithms to learn from data and make predictions, offers a pathway towards more personalized and tailored surgical treatments. This approach is particularly relevant to prevalent joint diseases such as osteoarthritis (OA). In contrast to end-stage disease, where joint arthroplasty provides excellent results, early stages of OA currently lack effective therapies to halt or reverse progression. Accurate prediction of OA progression is crucial if timely interventions are to be developed, to enhance patient care and optimize the design of clinical trials. Methods. A systematic review was conducted in accordance with PRISMA guidelines. We searched MEDLINE and Embase on 5 May 2024 for studies utilizing ML to predict OA progression. Titles and abstracts were independently screened, followed by full-text reviews for studies that met the eligibility criteria. Key information was extracted and synthesized for analysis, including types of data (such as clinical, radiological, or biochemical), definitions of OA progression, ML algorithms, validation methods, and outcome measures. Results. Out of 1,160 studies initially identified, 39 were included. Most studies (85%) were published between 2020 and 2024, with 82% using publicly available datasets, primarily the Osteoarthritis Initiative. ML methods were predominantly supervised, with significant variability in the definitions of OA progression: most studies focused on structural changes (59%), while fewer addressed pain progression or both. Deep learning was used in 44% of studies, while automated ML was used in 5%. There was a lack of standardization in evaluation metrics and limited
Aims. Precise implant positioning, tailored to individual spinopelvic biomechanics and phenotype, is paramount for stability in total hip arthroplasty (THA). Despite a few studies on instability prediction, there is a notable gap in research utilizing artificial intelligence (AI). The objective of our pilot study was to evaluate the feasibility of developing an AI algorithm tailored to individual spinopelvic mechanics and patient phenotype for predicting impingement. Methods. This international, multicentre prospective cohort study across two centres encompassed 157 adults undergoing primary robotic arm-assisted THA. Impingement during specific flexion and extension stances was identified using the virtual range of motion (ROM) tool of the robotic software. The primary AI model, the Light Gradient-Boosting Machine (LGBM), used tabular data to predict impingement presence, direction (flexion or extension), and type. A secondary model integrating tabular data with plain anteroposterior pelvis radiographs was evaluated to assess for any potential enhancement in prediction accuracy. Results. We identified nine predictors from an analysis of baseline spinopelvic characteristics and surgical planning parameters. Using fivefold cross-validation, the LGBM achieved 70.2% impingement prediction accuracy. With impingement data, the LGBM estimated direction with 85% accuracy, while the support vector machine (SVM) determined impingement type with 72.9% accuracy. After integrating imaging data with a multilayer perceptron (tabular) and a convolutional neural network (radiograph), the LGBM’s prediction was 68.1%. Both combined and LGBM-only had similar impingement direction prediction rates (around 84.5%). Conclusion. This study is a pioneering effort in leveraging AI for impingement prediction in THA, utilizing a comprehensive, real-world clinical dataset. Our machine-learning algorithm demonstrated promising accuracy in predicting impingement, its type, and direction. While the addition of imaging data to our deep-learning algorithm did not boost accuracy, the potential for refined annotations, such as landmark markings, offers avenues for future enhancement. Prior to clinical integration,
Aims. The primary aim of this study was to develop a reliable, effective radiological score to assess the healing of humeral shaft fractures, the Radiographic Union Score for HUmeral fractures (RUSHU). The secondary aim was to assess whether the six-week RUSHU was predictive of nonunion at six months after the injury. Patients and Methods. Initially, 20 patients with radiographs six weeks following a humeral shaft fracture were selected at random from a trauma database and scored by three observers, based on the Radiographic Union Scale for Tibial fractures system. After refinement of the RUSHU criteria, a second group of 60 patients with radiographs six weeks after injury, 40 with fractures that united and 20 with fractures that developed nonunion, were scored by two blinded observers. Results. After refinement, the interobserver intraclass correlation coefficient (ICC) was 0.79 (95% confidence interval (CI) 0.67 to 0.87), indicating substantial agreement. At six weeks after injury, patients whose fractures united had a significantly higher median score than those who developed nonunion (10 vs 7; p < 0.001). A receiver operating characteristic curve determined that a RUSHU cut-off of < 8 was predictive of nonunion (area under the curve = 0.84, 95% CI 0.74 to 0.94). The sensitivity was 75% and specificity 80% with a positive predictive value (PPV) of 65% and a negative predictive value of 86%. Patients with a RUSHU < 8 (n = 23) were more likely to develop nonunion than those with a RUSHU ≥ 8 (n = 37, odds ratio 12.0, 95% CI 3.4 to 42.9). Based on a PPV of 65%, if all patients with a RUSHU < 8 underwent fixation, the number of procedures needed to avoid one nonunion would be 1.5. Conclusion. The RUSHU is reliable and effective in identifying patients at risk of nonunion of a humeral shaft fracture at six weeks after injury. This tool requires
Despite the vast quantities of published artificial intelligence (AI) algorithms that target trauma and orthopaedic applications, very few progress to inform clinical practice. One key reason for this is the lack of a clear pathway from development to deployment. In order to assist with this process, we have developed the Clinical Practice Integration of Artificial Intelligence (CPI-AI) framework – a five-stage approach to the clinical practice adoption of AI in the setting of trauma and orthopaedics, based on the IDEAL principles ( Cite this article:
This study aimed to develop and validate a fully automated system that quantifies proximal femoral bone mineral density (BMD) from CT images. The study analyzed 978 pairs of hip CT and dual-energy X-ray absorptiometry (DXA) measurements of the proximal femur (DXA-BMD) collected from three institutions. From the CT images, the femur and a calibration phantom were automatically segmented using previously trained deep-learning models. The Hounsfield units of each voxel were converted into density (mg/cm3). Then, a deep-learning model trained by manual landmark selection of 315 cases was developed to select the landmarks at the proximal femur to rotate the CT volume to the neutral position. Finally, the CT volume of the femur was projected onto the coronal plane, and the areal BMD of the proximal femur (CT-aBMD) was quantified. CT-aBMD correlated to DXA-BMD, and a receiver operating characteristic (ROC) analysis quantified the accuracy in diagnosing osteoporosis.Aims
Methods
Objectives. To define Patient Acceptable Symptom State (PASS) thresholds
for the Oxford hip score (OHS) and Oxford knee score (OKS) at mid-term
follow-up. Methods. In a prospective multicentre cohort study, OHS and OKS were collected
at a mean follow-up of three years (1.5 to 6.0), combined with a
numeric rating scale (NRS) for satisfaction and an external validation
question assessing the patient’s willingness to undergo surgery
again. A total of 550 patients underwent total hip replacement (THR)
and 367 underwent total knee replacement (TKR). Results. Receiver operating characteristic (ROC) curves identified a PASS
threshold of 42 for the OHS after THR and 37 for the OKS after TKR.
THR patients with an OHS ≥ 42 and TKR patients with an OKS ≥ 37
had a higher NRS for satisfaction and a greater likelihood of being
willing to undergo surgery again. Conclusions. PASS thresholds appear larger at mid-term follow-up than at six
months after surgery. With- out
To develop prediction models using machine-learning (ML) algorithms for 90-day and one-year mortality prediction in femoral neck fracture (FNF) patients aged 50 years or older based on the Hip fracture Evaluation with Alternatives of Total Hip arthroplasty versus Hemiarthroplasty (HEALTH) and Fixation using Alternative Implants for the Treatment of Hip fractures (FAITH) trials. This study included 2,388 patients from the HEALTH and FAITH trials, with 90-day and one-year mortality proportions of 3.0% (71/2,388) and 6.4% (153/2,388), respectively. The mean age was 75.9 years (SD 10.8) and 65.9% of patients (1,574/2,388) were female. The algorithms included patient and injury characteristics. Six algorithms were developed, internally validated and evaluated across discrimination (c-statistic; discriminative ability between those with risk of mortality and those without), calibration (observed outcome compared to the predicted probability), and the Brier score (composite of discrimination and calibration).Aims
Methods
To determine the major risk factors for unplanned reoperations (UROs) following corrective surgery for adult spinal deformity (ASD) and their interactions, using machine learning-based prediction algorithms and game theory. Patients who underwent surgery for ASD, with a minimum of two-year follow-up, were retrospectively reviewed. In total, 210 patients were included and randomly allocated into training (70% of the sample size) and test (the remaining 30%) sets to develop the machine learning algorithm. Risk factors were included in the analysis, along with clinical characteristics and parameters acquired through diagnostic radiology.Aims
Methods
Literature surrounding artificial intelligence (AI)-related applications for hip and knee arthroplasty has proliferated. However, meaningful advances that fundamentally transform the practice and delivery of joint arthroplasty are yet to be realized, despite the broad range of applications as we continue to search for meaningful and appropriate use of AI. AI literature in hip and knee arthroplasty between 2018 and 2021 regarding image-based analyses, value-based care, remote patient monitoring, and augmented reality was reviewed. Concerns surrounding meaningful use and appropriate methodological approaches of AI in joint arthroplasty research are summarized. Of the 233 AI-related orthopaedics articles published, 178 (76%) constituted original research, while the rest consisted of editorials or reviews. A total of 52% of original AI-related research concerns hip and knee arthroplasty (n = 92), and a narrative review is described. Three studies were externally validated. Pitfalls surrounding present-day research include conflating vernacular (“AI/machine learning”), repackaging limited registry data, prematurely releasing internally validated prediction models, appraising model architecture instead of inputted data, withholding code, and evaluating studies using antiquated regression-based guidelines. While AI has been applied to a variety of hip and knee arthroplasty applications with limited clinical impact, the future remains promising if the question is meaningful, the methodology is rigorous and transparent, the data are rich, and the model is externally validated. Simple checkpoints for meaningful AI adoption include ensuring applications focus on: administrative support over clinical evaluation and management; necessity of the advanced model; and the novelty of the question being answered. Cite this article:
The October 2023 Hip & Pelvis Roundup360 looks at: Femoroacetabular impingement syndrome at ten years – how do athletes do?; Venous thromboembolism in patients following total joint replacement: are transfusions to blame?; What changes in pelvic sagittal tilt occur 20 years after total hip arthroplasty?; Can stratified care in hip arthroscopy predict successful and unsuccessful outcomes?; Hip replacement into your nineties; Can large language models help with follow-up?; The most taxing of revisions – proximal femoral replacement for periprosthetic joint infection – what’s the benefit of dual mobility?
Prediction tools are instruments which are commonly used to estimate the prognosis in oncology and facilitate clinical decision-making in a more personalized manner. Their popularity is shown by the increasing numbers of prediction tools, which have been described in the medical literature. Many of these tools have been shown to be useful in the field of soft-tissue sarcoma of the extremities (eSTS). In this annotation, we aim to provide an overview of the available prediction tools for eSTS, provide an approach for clinicians to evaluate the performance and usefulness of the available tools for their own patients, and discuss their possible applications in the management of patients with an eSTS. Cite this article:
The risk factors for recurrent instability (RI) following a primary traumatic anterior shoulder dislocation (PTASD) remain unclear. In this study, we aimed to determine the rate of RI in a large cohort of patients managed nonoperatively after PTASD and to develop a clinical prediction model. A total of 1,293 patients with PTASD managed nonoperatively were identified from a trauma database (mean age 23.3 years (15 to 35); 14.3% female). We assessed the prevalence of RI, and used multivariate regression modelling to evaluate which demographic- and injury-related factors were independently predictive for its occurrence.Aims
Methods
The October 2023 Spine Roundup360 looks at: Cutting through surgical smoke: the science of cleaner air in spinal operations; Unlocking success: key factors in thoracic spine decompression and fusion for ossification of the posterior longitudinal ligament; Deep learning algorithm for identifying cervical cord compression due to degenerative canal stenosis on radiography; Surgeon experience influences robotics learning curve for minimally invasive lumbar fusion; Decision-making algorithm for the surgical treatment of degenerative lumbar spondylolisthesis of L4/L5; Response to preoperative steroid injections predicts surgical outcomes in patients undergoing fusion for isthmic spondylolisthesis.
The August 2023 Hip & Pelvis Roundup360 looks at: Using machine learning to predict venous thromboembolism and major bleeding events following total joint arthroplasty; Antibiotic length in revision total hip arthroplasty; Preoperative colonization and worse outcomes; Short stem cemented total hip arthroplasty; What are the outcomes of one- versus two-stage revisions in the UK?; To cement or not to cement? The best approach in hemiarthroplasty; Similar re-revisions in cemented and cementless femoral revisions for periprosthetic femoral fractures in total hip arthroplasty; Are hip precautions still needed?
Economic evaluation provides a framework for assessing the costs and consequences of alternative programmes or interventions. One common vehicle for economic evaluations in the healthcare context is the decision-analytic model, which synthesizes information on parameter inputs (for example, probabilities or costs of clinical events or health states) from multiple sources and requires application of mathematical techniques, usually within a software program. A plethora of decision-analytic modelling-based economic evaluations of orthopaedic interventions have been published in recent years. This annotation outlines a number of issues that can help readers, reviewers, and decision-makers interpret evidence from decision-analytic modelling-based economic evaluations of orthopaedic interventions. Cite this article:
Within healthcare, several measures are used to quantify and compare the severity of health conditions. Two common measures are disability weight (DW), a context-independent value representing severity of a health state, and utility weight (UW), a context-dependent measure of health-related quality of life. Neither of these measures have previously been determined for developmental dysplasia of the hip (DDH). The aim of this study is to determine the DW and country-specific UWs for DDH. A survey was created using three different methods to estimate the DW: a preference ranking exercise, time trade-off exercise, and visual analogue scale (VAS). Participants were fully licensed orthopaedic surgeons who were contacted through national and international orthopaedic organizations. A global DW was calculated using a random effects model through an inverse-variance approach. A UW was calculated for each country as one minus the country-specific DW composed of the time trade-off exercise and VAS.Aims
Methods
To identify variables independently associated with same-day discharge (SDD) of patients following revision total knee arthroplasty (rTKA) and to develop machine learning algorithms to predict suitable candidates for outpatient rTKA. Data were obtained from the American College of Surgeons National Quality Improvement Programme (ACS-NSQIP) database from the years 2018 to 2020. Patients with elective, unilateral rTKA procedures and a total hospital length of stay between zero and four days were included. Demographic, preoperative, and intraoperative variables were analyzed. A multivariable logistic regression (MLR) model and various machine learning techniques were compared using area under the curve (AUC), calibration, and decision curve analysis. Important and significant variables were identified from the models.Aims
Methods
Rotator cuff tear (RCT) is the leading cause of shoulder pain, primarily associated with age-related tendon degeneration. This study aimed to elucidate the potential differential gene expressions in tendons across different age groups, and to investigate their roles in tendon degeneration. Linear regression and differential expression (DE) analyses were performed on two transcriptome profiling datasets of torn supraspinatus tendons to identify age-related genes. Subsequent functional analyses were conducted on these candidate genes to explore their potential roles in tendon ageing. Additionally, a secondary DE analysis was performed on candidate genes by comparing their expressions between lesioned and normal tendons to explore their correlations with RCTs.Aims
Methods
The use of artificial intelligence (AI) is rapidly growing across many domains, of which the medical field is no exception. AI is an umbrella term defining the practical application of algorithms to generate useful output, without the need of human cognition. Owing to the expanding volume of patient information collected, known as ‘big data’, AI is showing promise as a useful tool in healthcare research and across all aspects of patient care pathways. Practical applications in orthopaedic surgery include: diagnostics, such as fracture recognition and tumour detection; predictive models of clinical and patient-reported outcome measures, such as calculating mortality rates and length of hospital stay; and real-time rehabilitation monitoring and surgical training. However, clinicians should remain cognizant of AI’s limitations, as the development of robust reporting and validation frameworks is of paramount importance to prevent avoidable errors and biases. The aim of this review article is to provide a comprehensive understanding of AI and its subfields, as well as to delineate its existing clinical applications in trauma and orthopaedic surgery. Furthermore, this narrative review expands upon the limitations of AI and future direction. Cite this article:
This study aimed to compare the performance of survival prediction models for bone metastases of the extremities (BM-E) with pathological fractures in an Asian cohort, and investigate patient characteristics associated with survival. This retrospective cohort study included 469 patients, who underwent surgery for BM-E between January 2009 and March 2022 at a tertiary hospital in South Korea. Postoperative survival was calculated using the PATHFx3.0, SPRING13, OPTIModel, SORG, and IOR models. Model performance was assessed with area under the curve (AUC), calibration curve, Brier score, and decision curve analysis. Cox regression analyses were performed to evaluate the factors contributing to survival.Aims
Methods
Understanding spinopelvic mechanics is important for the success of total hip arthroplasty (THA). Despite significant advancements in appreciating spinopelvic balance, numerous challenges remain. It is crucial to recognize the individual variability and postoperative changes in spinopelvic parameters and their consequential impact on prosthetic component positioning to mitigate the risk of dislocation and enhance postoperative outcomes. This review describes the integration of advanced diagnostic approaches, enhanced technology, implant considerations, and surgical planning, all tailored to the unique anatomy and biomechanics of each patient. It underscores the importance of accurately predicting postoperative spinopelvic mechanics, selecting suitable imaging techniques, establishing a consistent nomenclature for spinopelvic stiffness, and considering implant-specific strategies. Furthermore, it highlights the potential of artificial intelligence to personalize care. Cite this article:
To map the Oxford Knee Score (OKS) and High Activity Arthroplasty Score (HAAS) items to a common scale, and to investigate the psychometric properties of this new scale for the measurement of knee health. Patient-reported outcome measure (PROM) data measuring knee health were obtained from the NHS PROMs dataset and Total or Partial Knee Arthroplasty Trial (TOPKAT). Assumptions for common scale modelling were tested. A graded response model (fitted to OKS item responses in the NHS PROMs dataset) was used as an anchor to calibrate paired HAAS items from the TOPKAT dataset. Information curves for the combined OKS-HAAS model were plotted. Bland-Altman analysis was used to compare common scale scores derived from OKS and HAAS items. A conversion table was developed to map between HAAS, OKS, and the common scale.Aims
Methods
Artificial intelligence (AI) is, in essence, the concept of ‘computer thinking’, encompassing methods that train computers to perform and learn from executing certain tasks, called machine learning, and methods to build intricate computer models that both learn and adapt, called complex neural networks. Computer vision is a function of AI by which machine learning and complex neural networks can be applied to enable computers to capture, analyze, and interpret information from clinical images and visual inputs. This annotation summarizes key considerations and future perspectives concerning computer vision, questioning the need for this technology (the ‘why’), the current applications (the ‘what’), and the approach to unlocking its full potential (the ‘how’). Cite this article:
Hip dysplasia (HD) leads to premature osteoarthritis. Timely detection and correction of HD has been shown to improve pain, functional status, and hip longevity. Several time-consuming radiological measurements are currently used to confirm HD. An artificial intelligence (AI) software named HIPPO automatically locates anatomical landmarks on anteroposterior pelvis radiographs and performs the needed measurements. The primary aim of this study was to assess the reliability of this tool as compared to multi-reader evaluation in clinically proven cases of adult HD. The secondary aims were to assess the time savings achieved and evaluate inter-reader assessment. A consecutive preoperative sample of 130 HD patients (256 hips) was used. This cohort included 82.3% females (n = 107) and 17.7% males (n = 23) with median patient age of 28.6 years (interquartile range (IQR) 22.5 to 37.2). Three trained readers’ measurements were compared to AI outputs of lateral centre-edge angle (LCEA), caput-collum-diaphyseal (CCD) angle, pelvic obliquity, Tönnis angle, Sharp’s angle, and femoral head coverage. Intraclass correlation coefficients (ICC) and Bland-Altman analyses were obtained.Aims
Methods
Obtaining solid implant fixation is crucial in revision total knee arthroplasty (rTKA) to avoid aseptic loosening, a major reason for re-revision. This study aims to validate a novel grading system that quantifies implant fixation across three anatomical zones (epiphysis, metaphysis, diaphysis). Based on pre-, intra-, and postoperative assessments, the novel grading system allocates a quantitative score (0, 0.5, or 1 point) for the quality of fixation achieved in each anatomical zone. The criteria used by the algorithm to assign the score include the bone quality, the size of the bone defect, and the type of fixation used. A consecutive cohort of 245 patients undergoing rTKA from 2012 to 2018 were evaluated using the current novel scoring system and followed prospectively. In addition, 100 first-time revision cases were assessed radiologically from the original cohort and graded by three observers to evaluate the intra- and inter-rater reliability of the novel radiological grading system.Aims
Methods
Total hip arthroplasty (THA) and total knee arthroplasty (TKA) are common orthopaedic procedures requiring postoperative radiographs to confirm implant positioning and identify complications. Artificial intelligence (AI)-based image analysis has the potential to automate this postoperative surveillance. The aim of this study was to prepare a scoping review to investigate how AI is being used in the analysis of radiographs following THA and TKA, and how accurate these tools are. The Embase, MEDLINE, and PubMed libraries were systematically searched to identify relevant articles. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for scoping reviews and Arksey and O’Malley framework were followed. Study quality was assessed using a modified Methodological Index for Non-Randomized Studies tool. AI performance was reported using either the area under the curve (AUC) or accuracy.Aims
Methods
Accurate identification of the ankle joint centre is critical for estimating tibial coronal alignment in total knee arthroplasty (TKA). The purpose of the current study was to leverage artificial intelligence (AI) to determine the accuracy and effect of using different radiological anatomical landmarks to quantify mechanical alignment in relation to a traditionally defined radiological ankle centre. Patients with full-limb radiographs from the Osteoarthritis Initiative were included. A sub-cohort of 250 radiographs were annotated for landmarks relevant to knee alignment and used to train a deep learning (U-Net) workflow for angle calculation on the entire database. The radiological ankle centre was defined as the midpoint of the superior talus edge/tibial plafond. Knee alignment (hip-knee-ankle angle) was compared against 1) midpoint of the most prominent malleoli points, 2) midpoint of the soft-tissue overlying malleoli, and 3) midpoint of the soft-tissue sulcus above the malleoli.Aims
Methods
We obtained information from the Elective Orthopaedic
Centre on 1523 patients with baseline and six-month Oxford hip scores
(OHS) after undergoing primary hip replacement (THR) and 1784 patients
with Oxford knee scores (OKS) for primary knee replacement (TKR)
who completed a six-month satisfaction questionnaire. Receiver operating characteristic curves identified an absolute
change in OHS of 14 points or more as the point that discriminates
best between patients’ satisfaction levels and an 11-point change
for the OKS. Satisfaction is highest (97.6%) in patients with an
absolute change in OHS of 14 points or more, compared with lower
levels of satisfaction (81.8%) below this threshold. Similarly,
an 11-point absolute change in OKS was associated with 95.4% satisfaction
compared with 76.5% below this threshold. For the six-month OHS
a score of 35 points or more distinguished patients with the highest
satisfaction level, and for the six-month OKS 30 points or more identified
the highest level of satisfaction. The thresholds varied according
to patients’ pre-operative score, where those with severe pre-operative
pain/function required a lower six-month score to achieve the highest
levels of satisfaction. Our data suggest that the choice of a six-month follow-up to
assess patient-reported outcomes of THR/TKR is acceptable. The thresholds
help to differentiate between patients with different levels of
satisfaction, but
There is increasing popularity in the use of artificial intelligence and machine-learning techniques to provide diagnostic and prognostic models for various aspects of Trauma & Orthopaedic surgery. However, correct interpretation of these models is difficult for those without specific knowledge of computing or health data science methodology. Lack of current reporting standards leads to the potential for significant heterogeneity in the design and quality of published studies. We provide an overview of machine-learning techniques for the lay individual, including key terminology and best practice reporting guidelines. Cite this article:
Adverse spinal motion or balance (spine mobility) and adverse pelvic mobility, in combination, are often referred to as adverse spinopelvic mobility (SPM). A stiff lumbar spine, large posterior standing pelvic tilt, and severe sagittal spinal deformity have been identified as risk factors for increased hip instability. Adverse SPM can create functional malposition of the acetabular components and hence is an instability risk. Adverse pelvic mobility is often, but not always, associated with abnormal spinal motion parameters. Dislocation rates for dual-mobility articulations (DMAs) have been reported to be between 0% and 1.1%. The aim of this study was to determine the early survivorship from the Australian Orthopaedic Association National Joint Replacement Registry (AOANJRR) of patients with adverse SPM who received a DMA. A multicentre study was performed using data from 227 patients undergoing primary total hip arthroplasty (THA), enrolled consecutively. All the patients who had one or more adverse spine or pelvic mobility parameter had a DMA inserted at the time of their surgery. The mean age was 76 years (22 to 93) and 63% were female (n = 145). At a mean of 14 months (5 to 31) postoperatively, the AOANJRR was analyzed for follow-up information. Reasons for revision and types of revision were identified.Aims
Methods
The primary aim was to estimate the cost-effectiveness of routine operative fixation for all patients with humeral shaft fractures. The secondary aim was to estimate the health economic implications of using a Radiographic Union Score for HUmeral fractures (RUSHU) of < 8 to facilitate selective fixation for patients at risk of nonunion. From 2008 to 2017, 215 patients (mean age 57 yrs (17 to 18), 61% female (n = 130/215)) with a nonoperatively managed humeral diaphyseal fracture were retrospectively identified. Union was achieved in 77% (n = 165/215) after initial nonoperative management, with 23% (n = 50/215) uniting after surgery for nonunion. The EuroQol five-dimension three-level health index (EQ-5D-3L) was obtained via postal survey. Multiple regression was used to determine the independent influence of patient, injury, and management factors upon the EQ-5D-3L. An incremental cost-effectiveness ratio (ICER) of < £20,000 per quality-adjusted life-year (QALY) gained was considered cost-effective.Aims
Methods
The aim of this study was to develop and internally validate a prognostic nomogram to predict the probability of gaining a functional range of motion (ROM ≥ 120°) after open arthrolysis of the elbow in patients with post-traumatic stiffness of the elbow. We developed the Shanghai Prediction Model for Elbow Stiffness Surgical Outcome (SPESSO) based on a dataset of 551 patients who underwent open arthrolysis of the elbow in four institutions. Demographic and clinical characteristics were collected from medical records. The least absolute shrinkage and selection operator regression model was used to optimize the selection of relevant features. Multivariable logistic regression analysis was used to build the SPESSO. Its prediction performance was evaluated using the concordance index (C-index) and a calibration graph. Internal validation was conducted using bootstrapping validation.Aims
Methods
The aim of this study was to review the current evidence surrounding curve type and morphology on curve progression risk in adolescent idiopathic scoliosis (AIS). A comprehensive search was conducted by two independent reviewers on PubMed, Embase, Medline, and Web of Science to obtain all published information on morphological predictors of AIS progression. Search items included ‘adolescent idiopathic scoliosis’, ‘progression’, and ‘imaging’. The inclusion and exclusion criteria were carefully defined. Risk of bias of studies was assessed with the Quality in Prognostic Studies tool, and level of evidence for each predictor was rated with the Grading of Recommendations, Assessment, Development and Evaluations (GRADE) approach. In all, 6,286 publications were identified with 3,598 being subjected to secondary scrutiny. Ultimately, 26 publications (25 datasets) were included in this review.Aims
Methods
The aims of this study were to assess mapping models to predict the three-level version of EuroQoL five-dimension utility index (EQ-5D-3L) from the Oxford Knee Score (OKS) and validate these before and after total knee arthroplasty (TKA). A retrospective cohort of 5,857 patients was used to create the prediction models, and a second cohort of 721 patients from a different centre was used to validate the models, all of whom underwent TKA. Patient characteristics, BMI, OKS, and EQ-5D-3L were collected preoperatively and one year postoperatively. Generalized linear regression was used to formulate the prediction models.Aims
Methods
Artificial intelligence and machine-learning analytics have gained extensive popularity in recent years due to their clinically relevant applications. A wide range of proof-of-concept studies have demonstrated the ability of these analyses to personalize risk prediction, detect implant specifics from imaging, and monitor and assess patient movement and recovery. Though these applications are exciting and could potentially influence practice, it is imperative to understand when these analyses are indicated and where the data are derived from, prior to investing resources and confidence into the results and conclusions. In this article, we review the current benefits and potential limitations of machine-learning for the orthopaedic surgeon with a specific emphasis on data quality.
To develop and internally validate a preoperative clinical prediction model for acute adjacent vertebral fracture (AVF) after vertebral augmentation to support preoperative decision-making, named the after vertebral augmentation (AVA) score. In this prognostic study, a multicentre, retrospective single-level vertebral augmentation cohort of 377 patients from six Japanese hospitals was used to derive an AVF prediction model. Backward stepwise selection (p < 0.05) was used to select preoperative clinical and imaging predictors for acute AVF after vertebral augmentation for up to one month, from 14 predictors. We assigned a score to each selected variable based on the regression coefficient and developed the AVA scoring system. We evaluated sensitivity and specificity for each cut-off, area under the curve (AUC), and calibration as diagnostic performance. Internal validation was conducted using bootstrapping to correct the optimism.Aims
Methods
In recent years, machine learning (ML) and artificial neural networks (ANNs), a particular subset of ML, have been adopted by various areas of healthcare. A number of diagnostic and prognostic algorithms have been designed and implemented across a range of orthopaedic sub-specialties to date, with many positive results. However, the methodology of many of these studies is flawed, and few compare the use of ML with the current approach in clinical practice. Spinal surgery has advanced rapidly over the past three decades, particularly in the areas of implant technology, advanced surgical techniques, biologics, and enhanced recovery protocols. It is therefore regarded an innovative field. Inevitably, spinal surgeons will wish to incorporate ML into their practice should models prove effective in diagnostic or prognostic terms. The purpose of this article is to review published studies that describe the application of neural networks to spinal surgery and which actively compare ANN models to contemporary clinical standards allowing evaluation of their efficacy, accuracy, and relatability. It also explores some of the limitations of the technology, which act to constrain the widespread adoption of neural networks for diagnostic and prognostic use in spinal care. Finally, it describes the necessary considerations should institutions wish to incorporate ANNs into their practices. In doing so, the aim of this review is to provide a practical approach for spinal surgeons to understand the relevant aspects of neural networks. Cite this article:
To develop and validate patient-centred algorithms that estimate individual risk of death over the first year after elective joint arthroplasty surgery for osteoarthritis. A total of 763,213 hip and knee joint arthroplasty episodes recorded in the National Joint Registry for England and Wales (NJR) and 105,407 episodes from the Norwegian Arthroplasty Register were used to model individual mortality risk over the first year after surgery using flexible parametric survival regression.Aims
Methods
While preoperative bloodwork is routinely ordered, its value in determining which patients are at risk of postoperative readmission following total knee arthroplasty (TKA) and total hip arthroplasty (THA) is unclear. The objective of this study was to determine which routinely ordered preoperative blood markers have the strongest association with acute hospital readmission for patients undergoing elective TKA and THA. Two population-based retrospective cohorts were assembled for all adult primary elective TKA (n = 137,969) and THA (n = 78,532) patients between 2011 to 2018 across 678 North American hospitals using the American College of Surgeons National Quality Improvement Programme (ACS-NSQIP) registry. Six routinely ordered preoperative blood markers - albumin, haematocrit, platelet count, white blood cell count (WBC), estimated glomerular filtration rate (eGFR), and sodium level - were queried. The association between preoperative blood marker values and all-cause readmission within 30 days of surgery was compared using univariable analysis and multivariable logistic regression adjusted for relevant patient and treatment factors.Aims
Methods
The purpose of this study was to develop a personalized outcome prediction tool, to be used with knee arthroplasty patients, that predicts outcomes (lengths of stay (LOS), 90 day readmission, and one-year patient-reported outcome measures (PROMs) on an individual basis and allows for dynamic modifiable risk factors. Data were prospectively collected on all patients who underwent total or unicompartmental knee arthroplasty at a between July 2015 and June 2018. Cohort 1 (n = 5,958) was utilized to develop models for LOS and 90 day readmission. Cohort 2 (n = 2,391, surgery date 2015 to 2017) was utilized to develop models for one-year improvements in Knee Injury and Osteoarthritis Outcome Score (KOOS) pain score, KOOS function score, and KOOS quality of life (QOL) score. Model accuracies within the imputed data set were assessed through cross-validation with root mean square errors (RMSEs) and mean absolute errors (MAEs) for the LOS and PROMs models, and the index of prediction accuracy (IPA), and area under the curve (AUC) for the readmission models. Model accuracies in new patient data sets were assessed with AUC.Aims
Methods
Failure of irrigation and debridement (I&D) for prosthetic joint infection (PJI) is influenced by numerous host, surgical, and pathogen-related factors. We aimed to develop and validate a practical, easy-to-use tool based on machine learning that may accurately predict outcome following I&D surgery taking into account the influence of numerous factors. This was an international, multicentre retrospective study of 1,174 revision total hip (THA) and knee arthroplasties (TKA) undergoing I&D for PJI between January 2005 and December 2017. PJI was defined using the Musculoskeletal Infection Society (MSIS) criteria. A total of 52 variables including demographics, comorbidities, and clinical and laboratory findings were evaluated using random forest machine learning analysis. The algorithm was then verified through cross-validation.Aims
Methods
To assess the effect of physical exercise (PE) on the histological and transcriptional characteristics of proteoglycan-induced arthritis (PGIA) in BALB/c mice. Following PGIA, mice were subjected to treadmill PE for ten weeks. The tarsal joints were used for histological and genetic analysis through microarray technology. The genes differentially expressed by PE in the arthritic mice were obtained from the microarray experiments. Bioinformatic analysis in the DAVID, STRING, and Cytoscape bioinformatic resources allowed the association of these genes in biological processes and signalling pathways.Aims
Methods
The aim of this study was to determine if the Oxford Knee and Hip Score (OKHS) can accurately predict when a primary knee or hip referral is deemed nonsurgical We retrospectively reviewed pre-consultation OKHS for all consecutive primary total knee arthroplasty (TKA) and total hip arthroplasty (THA) consultations of a single surgeon over three years. The 1436 knees (1016 patients) and 478 hips (388 patients) included were categorized based on the surgeon’s decision into those offered surgery during the first consultation Aims
Patients and Methods
We investigated whether blood metal ion levels could effectively
identify patients with bilateral Birmingham Hip Resurfacing (BHR)
implants who have adverse reactions to metal debris (ARMD). Metal ion levels in whole blood were measured in 185 patients
with bilateral BHRs. Patients were divided into those with ARMD
who either had undergone a revision for ARMD or had ARMD on imaging
(n = 30), and those without ARMD (n = 155). Receiver operating characteristic
analysis was used to determine the optimal thresholds of blood metal
ion levels for identifying patients with ARMD.Aims
Patients and Methods
Graft-tunnel mismatch of the bone-patellar tendon-bone
(BPTB) graft is a major concern during anatomical anterior cruciate
ligament (ACL) reconstruction if the femoral tunnel is positioned
using a far medial portal technique, as the femoral tunnel tends
to be shorter compared with that positioned using a transtibial
portal technique. This study describes an accurate method of calculating
the ideal length of bone plugs of a BPTB graft required to avoid
graft–tunnel mismatch during anatomical ACL reconstruction using
a far medial portal technique of femoral tunnel positioning. Based on data obtained intra-operatively from 60 anatomical ACL
reconstruction procedures, we calculated the length of bone plugs
required in the BPTB graft to avoid graft–tunnel mismatch. When
this was prevented in all the 60 cases, we found that the mean length
of femoral bone plug that remained in contact with the interference
screw within the femoral tunnel was 14 mm (12 to 22) and the mean
length of tibial bone plug that remained in contact with the interference
screw within the tibial tunnel was 23 mm (18 to 28). These results
were used to validate theoretical formulae developed to predict
the required length of bone plugs in BPTB graft during anatomical
ACL reconstruction using a far medial portal technique. Cite this article:
Clinical prediction algorithms are used to differentiate
transient synovitis from septic arthritis. These algorithms typically
include the erythrocyte sedimentation rate (ESR), although in clinical practice
measurement of the C-reactive protein (CRP) has largely replaced
the ESR. We evaluated the use of CRP in a predictive algorithm. The records of 311 children with an effusion of the hip, which
was confirmed on ultrasound, were reviewed (mean age 5.3 years (0.2
to 15.1)). Of these, 269 resolved without intervention and without
long-term sequelae and were considered to have had transient synovitis.
The remaining 42 underwent arthrotomy because of suspicion of septic
arthritis. Infection was confirmed in 29 (18 had micro-organisms
isolated and 11 had a high synovial fluid white cell count). In
the remaining 13 no evidence of infection was found and they were
also considered to have had transient synovitis. In total 29 hips
were categorised as septic arthritis and 282 as transient synovitis.
The temperature, weight-bearing status, peripheral white blood cell
count and CRP was reviewed in each patient. A CRP >
20 mg/l was the strongest independent risk factor for
septic arthritis (odds ratio 81.9, p <
0.001). A multivariable
prediction model revealed that only two determinants (weight-bearing
status and CRP >
20 mg/l) were independent in differentiating septic
arthritis from transient synovitis. Individuals with neither predictor
had a <
1% probability of septic arthritis, but those with both
had a 74% probability of septic arthritis. A two-variable algorithm
can therefore quantify the risk of septic arthritis, and is an excellent
negative predictor.
The December 2014 Oncology Roundup360 looks at: metaphyseal and diaphyseal osteosarcoma subtly different beasts; sports and endoprosthetic reconstruction of the knee; is curettage without tissue diagnosis sensible in cartilaginous tumours?; autoclaved autograft in bone tumour reconstruction; vascularised graft a step too far in bone defects?; interdigitated neoadjuvant chemoradiotherapy in high-grade sarcoma; predicting life expectancy in patients with painful metastasis; and osteolytic lesions of the hands and feet.
The April 2014 Research Roundup360 looks at: scientific writing needed in orthopaedic papers; antiseptics and osteoblasts; thromboembolic management in orthopaedic patients; nicotine and obesity in post-operative complications; defining the “Patient Acceptable Symptom State”; and cheap and nasty implants of poor quality.
The February 2014 Knee Roundup360 looks at: whether sham surgery is as good as arthroscopic meniscectomy; distraction in knee osteoarthritis; whether trans-tibial tunnel placement increases the risk of graft failure in ACL surgery; whether joint replacements prevent cardiac events; the size of the pulmonary embolism problem; tranexamic acid and knee replacement haemostasis; matching the demand for knee replacement and follow-up; predicting the length of stay after knee replacement; and popliteal artery injury in TKR.
The February 2013 Oncology Roundup360 looks at: proximal fibular tumours; radiotherapy-induced chondrosarcoma; mega-prosthesis; CRP predictions of sarcoma survival; predicting survival in metastatic disease; MRI for recurrence in osteoid osteoma; and a sarcoma refresher
The revised Tokuhashi, Tomita and modified Bauer
scores are commonly used to make difficult decisions in the management
of patients presenting with spinal metastases. A prospective cohort
study of 199 consecutive patients presenting with spinal metastases,
treated with either surgery and/or radiotherapy, was used to compare
the three systems. Cox regression, Nagelkerke’s R2 and
Harrell’s concordance were used to compare the systems and find their
best predictive items. The three systems were equally good in terms
of overall prognostic performance. Their most predictive items were
used to develop the Oswestry Spinal Risk Index (OSRI), which has
a similar concordance, but a larger coefficient of determination
than any of these three scores. A bootstrap procedure was used to
internally validate this score and determine its prediction optimism. The OSRI is a simple summation of two elements: primary tumour
pathology (PTP) and general condition (GC): OSRI = PTP + (2 – GC). This simple score can predict life expectancy accurately in patients
presenting with spinal metastases. It will be helpful in making
difficult clinical decisions without the delay of extensive investigations. Cite this article:
The June 2012 Oncology Roundup360 looks at: avoiding pelvic hemipelvectomy; proximal femoral metastasis; extendible prostheses; rotationplasty; soft-tissue sarcomas; osteosarcoma of the pelvis; recurrent chondrosarcoma ; MRI and the differentiation between benign and malignant lesions; and malignant fibrous histiocytoma.
Objectives. We aimed first to summarise minimal clinically important differences
(MCIDs) after total hip (THR) or knee replacement (TKR) in health-related
quality of life (HRQoL), measured using the Short-Form 36 (SF-36).
Secondly, we aimed to improve the precision of MCID estimates by
means of meta-analysis. Methods. We conducted a systematic review of English and non-English articles
using MEDLINE, the Cochrane Controlled Trials Register (1960–2011),
EMBASE (1991–2011), Web of Science, Academic Search Premier and
Science Direct. Bibliographies of included studies were searched
in order to find additional studies. Search terms included MCID
or minimal clinically important change, THR or TKR and Short-Form
36. We included longitudinal studies that estimated MCID of SF-36
after THR or TKR. Results. Three studies met our inclusion criteria, describing a distinct
study population: primary THR, primary TKR and revision THR. No
synthesis of study results can be given. Conclusions. Although we found MCIDs in HRQoL after THR or TKR have limited
precision and are not