m (Cinmemj moved page Draft Samper 251045735 to Rodriguez-Rodriguez et al 2017a) |
|
(No difference)
|
Different machine learning methods have been used to develop predictive models of high quality and precision [1]. Among them, Random Survival Forests (RSF) has been proposed as an alternative to traditional survival models [2], being able to overcome most of the limitation of traditional survival techniques, such as Cox proportional hazards models. Objectives Our objective was to develop and internally validate a predictive model for rheumatoid arthritis (RA) mortality using Random Survival Forests (RSF). Methods Retrospective longitudinal study involving 1,461 patients diagnosed with RA between January 1994 and August 2011, and followed at the outpatient clinic of the Rheumatology Department of the Hospital Clínico San Carlos (Madrid, Spain) until death or September 2013. Demographic and clinical-related variables collected during the first two years after disease diagnosis were used. RSF models were developed, based on 1,000 trees. 100 iterations of each model were performed to measure the mean and standard deviation (SD) of the predictive error and the integrated Brier score (IBS). Missing values were imputed using the function implemented by the randomForestSRC package [3]. The predictive capacity of the variables was assessed using the “variable importance” (VIMP). Two models were constructed using the log-rank (MLG) or log-rank score (MLGS) splitting rules. The model with the lowest prediction error was selected. Next, those variables with negative VIMP were excluded and a final model developed. Results 148 patients died (10.1%). MLG showed the lowest prediction error. All variables exhibited a positive VIMP. Final model showed a mean (SD) prediction error and IBS of 0.187 (0.002) and 0.150 (0.003) respectively. The most important predictor variables were age at diagnosis, median erythrocyte sedimentation rate and number of hospital admissions in the first 2 years after RA diagnosis. Conclusions We developed an accurate and precise model for RA mortality using RSF. Age and disease activity showed the highest influence in mortality.
Published on 01/01/2017
DOI: 10.1136/annrheumdis-2017-eular.6139
Licence: CC BY-NC-SA license
Are you one of the authors of this document?