Comparison of the National Early Warning Score+Lactate score with the pre-endoscopic Rockall, Glasgow-Blatchford, and AIMS65 scores in patients with upper gastrointestinal bleeding
Article information
Abstract
Objective
We compared the predictive value of the National Early Warning Score+Lactate (NEWS+L) score with those of other parameters such as the pre-endoscopic Rockall score (PERS), Glasgow-Blatchford score (GBS), and albumin, international normalized ratio, altered mental status, systolic blood pressure, age older than 65 years score (AIMS65) among patients with upper gastrointestinal bleeding (UGIB).
Methods
We conducted a retrospective study of patients with UGIB during 2 consecutive years. The primary outcome was the composite of in-hospital death, intensive care unit admission, and the need for ≥5 packs of red blood cell transfusion within 24 hours.
Results
Among 530 included patients, the composite outcome occurred in 59 patients (19 in-hospital deaths, 13 intensive care unit admissions, and 40 transfusions of ≥5 packs of red blood cells within 24 hours). The area under the receiver operating characteristic curve of the NEWS+L score for the composite outcome was 0.76 (95% confidence interval, 0.70 to 0.82), which demonstrated a significant difference compared to PERS (0.66, 0.59–0.73, P=0.004), but not to GBS (0.70, 0.64–0.77, P=0.141) and AIMS65 (0.76, 0.70–0.83, P=0.999). The sensitivities of NEWS+L scores of 3 (n=34, 6.4%), 4 (n=92, 17.4%), and 5 (n=171, 32.3%) were 100%, 98.3%, and 96.6%, respectively, while the sensitivity of an AIMS65 score of 0 (n=159, 30.0%) was 91.5%.
Conclusion
The NEWS+L score showed better discriminative performance than the PERS and comparable discriminative performance to the GBS and AIMS65. The NEWS+L score may be used to identify low-risk patients among patients with UGIB.
INTRODUCTION
Upper gastrointestinal bleeding (UGIB) is defined as bleeding that results from a source proximal to the ligament of Treitz and accounts for up to 85% of gastrointestinal bleeding episodes [1]. The annual incidence of UGIB is approximately 100 per 100,000 individuals [2,3]. UGIB is a potentially life-threatening condition, with a mortality rate of 6% to 10% [4-8]. To manage patients with UGIB, substantial medical resources, such as urgent endoscopy or blood transfusion, are often required; however, this may not always be the case. Therefore, the European Society of Gastrointestinal Endoscopy recently recommended stratifying patients with UGIB into high and low risk groups when physicians make decisions regarding the timing of endoscopy or hospital discharge [9]. High-risk patients have higher risks of requiring blood transfusions, admission to the intensive care unit (ICU), and death. To date, several scoring systems have been developed to predict outcomes in patients with UGIB such as the pre-endoscopic Rockall score (PERS) [10], Glasgow-Blatchford score (GBS) [11], and albumin, international normalized ratio, altered mental status, systolic blood pressure, age older than 65 years score (AIMS65) [12]. However, a recent study reported that the aforementioned three scoring systems showed only fair to poor discriminative values for 30-day mortality. Moreover, the PERS and AIMS65 showed poor discriminative values for the need for transfusions [13].
Recently, serum lactate has been identified as a significant factor associated with mortality in critically ill patients with UGIB [14-16]. However, the aforementioned risk scoring systems for patients with UGIB do not include the serum lactate level. To this end, the National Early Warning Score+Lactate (NEWS+L) score [17], which the authors originally designed, and which is composed of a physiologic component (NEWS) and a laboratory component (the serum lactate level), has been developed.
Herein, we hypothesized that the NEWS+L score, which includes the serum lactate level as a component, would have a better predictive value for clinically important outcomes than the PERS, GBS, and AIMS65 score among patients with UGIB. Accordingly, the primary aim of this study was to compare the predictive value of the NEWS+L score with those of the PERS, GBS, and AIMS65 score regarding the composite of mortality, ICU admission, and need for red blood cell (RBC) transfusions.
METHODS
Study design and setting
We performed a retrospective cohort study that included consecutive patients with UGIB between January 1, 2014 and December 31, 2015. This study was approved by the Institutional Review Board of the Chonbuk National University Hospital (2016-09-021), which waived the requirement of informed consent for all subjects in this study. The study hospital is a 1,200-bed urban academic, tertiary-care, university hospital.
Selection of participants
All patients 18 years of age or older who visited the study hospital with melena, hematochezia, hematemesis, and syncope were screened for eligibility in this study. For patients to be enrolled in the study, they needed to have confirmed UGIB by esophagogastroduodenoscopy (EGD) evaluation. The exclusion criteria were a lack of UGIB by EGD evaluation and/or an unavailable serum lactate level at the time of presentation to the emergency department (ED).
Measurement and data collection
A trained abstractor collected data from the charts using a structured data-collection form. The medical record review and data abstraction were performed by an emergency medicine resident who had 3 years of emergency medicine training. The resident underwent data-collection training that included how to define the eligibility criteria and other variables that were included in this study. After data collection was completed, random chart reviews were performed to ensure data accuracy. Screening was performed using a list that extracted symptoms of melena, hematochezia, hematemesis, or syncope from the electronic medical record.
The data points that were included were as follows: age, sex, comorbidities (chronic liver disease, chronic kidney disease, active malignancy, heart failure, or coronary artery disease), presenting symptoms (hematemesis, melena, hematochezia, or syncope), systolic blood pressure and heart rate at triage, National Early Warning Score (NEWS) at triage, lactate level, albumin, prothrombin time, INR, hemoglobin, blood urea nitrogen, endoscopic diagnosis (bleeding ulcer, bleeding varix, hemorrhagic gastritis, bleeding tumor, or Mallory-Weiss syndrome), endoscopic treatment (endoscopic variceal ligation, endoscopic variceal obturation, hemostasis [using hypertonic saline-epinephrine solution], or hemoclip), disposition (discharge, admission to the ICU, or ward), number of RBC transfusions within 24 hours, and survival status at hospital discharge. During the examination, comorbidities were checked based on the patients’ or guardians’ statements on the medical chart by ED physicians. At the study hospital, NEWS is automatically calculated using vital signs at triage and registered in the electronic medical record. The serum lactate level is a routine laboratory test at the study hospital ED.
The serum lactate level was primarily measured using arterial blood, but the venous blood lactate level was also permitted. The lactate level was measured using a Stat Profile Critical Care Xpress Analyzer (Nova Biomedical, Waltham, MA, USA). The measurement capacity for lactate in this instrument ranges from 0.3 to 20 mmol/L. This machine was periodically inspected by staff employed by the manufacturer.
The PERS, GBS, and AIMS65 score were calculated from the collected variables. The details of the NEWS+L score are shown in Table 1.
Outcome measures
The primary outcome was the composite of in-hospital death, ICU admission, and RBC transfusion of ≥5 packs within 24 hours. Each component of the primary outcome was then set as a secondary outcome. All deaths during the hospital stay were counted, regardless of the cause. Five packs of RBCs were determined arbitrarily as the cut-off; however, it seemed to be a meaningful number considering the result of a previous randomized control trial that compared restrictive transfusion with liberal transfusion among patients with UGIB. In the liberal strategy group, the mean number of transfused RBC packs was 3.7±3.8 [18].
Statistical analysis
Continuous data are presented as the mean and standard deviation. Continuous data that were not normally distributed are presented as the median and interquartile range. Discrete data are presented as both counts and percentages.
Student’s t-test for independent samples was used to compare the means of normally distributed variables. The Mann-Whitney U-test was used for variables that were not normally distributed. For categorical data, the chi-square test or the chi-square test with a Fisher exact test for 2×2 tables was used. The results were considered significant at a threshold of P<0.05 (two-tailed).
Pearson’s correlation test or Spearman’s rank correlation was used to investigate the correlations between the NEWS+L score and other scores according to the variable distribution. Calibration was assessed using the Hosmer-Lemeshow goodness-of-fit test.
An area under the receiver operating characteristic curve (AUROC) analysis was used to determine the predictive value of the NEWS+L score and other risk scores including the NEWS. The standard error of the mean and P-values for the AUROC and comparisons between the NEWS+L and other risk scores were calculated using the method described by Hanley and McNeil [19]. Sensitivity and specificity analyses were also performed for various cut-off points of the NEWS+L score and for the specific points of the other risk scores that were calculated for the low-risk group.
The logistic regression analysis results are presented as odds ratios with 95% confidence intervals (CIs). To reveal the associations between the serum lactate level and outcomes, multivariable logistic regression analyses were performed after adjusting for other laboratory variables. Trend factors with P<0.1 were included in the multivariable logistic model.
We referred to the STARD (Standards for Reporting Diagnostic Accuracy) statement for the analysis of the results [20]. All analyses were conducted using Stata ver. 11.1 (StataCorp., College Station, TX, USA) and SAS ver. 9.1 (SAS Institute Inc., Cary, NC, USA).
RESULTS
A total of 780 patients who presented with melena, hematochezia, hematemesis, or syncope were screened. Among those, 95 patients were not examined using EGD evaluation. In addition, 111 patients had lower gastrointestinal bleeding, 26 patients did not have UGIB when the EGD evaluation was conducted, and 2 patients were examined before ED arrival. After exclusion of the aforementioned patients, 546 patients were eligible. Among those patients, 16 patients were excluded due to a lack of an initially recorded lactate level. thus, 530 patients were finally enrolled in the analysis (Fig. 1). Of those patients, the composite outcome occurred in 59 patients; in-hospital death occurred in 19 patients, 13 patients were admitted to the ICU, and 40 patients received ≥5 packs of RBC transfusions within 24 hours.
Table 2 shows the characteristics of the entire cohort, the non-composite outcome group, and composite outcome group. The mean age of the entire cohort was 64.5±14.9 years, and 371 (70.0%) patients were male. There was no significant difference between the groups. Chronic liver disease and malignancy were the most frequent co-morbidities, and malignancies were found more often in the composite outcome group. Hematemesis was the most frequent presenting symptom, followed by melena, hematochezia, and syncope. Hematemesis occurred slightly more often in the composite outcome group, while melena occurred more frequently in the non-composite outcome group. The physiologic and laboratory values were significantly worse in the composite outcome group. Bleeding ulcer was the most frequent diagnosis, followed by bleeding varices, hemorrhagic gastritis, bleeding tumor, and Mallory-Weiss syndrome. Bleeding varices were the most frequent diagnosis in the composite outcome group, whereas bleeding ulcers were the most frequent diagnosis in the non-composite outcome group. Endoscopic variceal ligation was the most frequent endoscopic treatment.
The mean time to endoscopy was 11.5±17.9 hours, and there was no significant difference between the composite outcome and non-composite outcome groups. In the entire cohort, the mean values were 7.1±3.9 for the NEWS+L score, 3.0±1.9 for the PERS, 9.0±4.0 for the GBS, and 1.1±1.0 for the AIMS65 score. All scores were worse in the composite outcome group. The mean number of packs of RBCs used for transfusions was 1.5±1.0. The composite outcome group received more RBC transfusions (2.6±0.7) than the non-composite outcome group (1.4±0.9, P<0.001). In the entire cohort, 396 patients (74.7%) were admitted to the ward, and 42 (10.6%) of those patients were in the composite outcome group. Radiologic intervention was more frequent in the composite outcome group (20.3% vs. 6.2%, P<0.01). There were no significant differences in the time to intervention and time to operation.
Quartile graphing was used for the easy presentation of the distribution and accordant composite outcomes of the risk scores, except the AIMS65 score (Fig. 2). It was difficult to divide the AIMS65 scores into quartiles due to the inherent nature of this narrow score spectrum. Instead of quartiles, the data presentation of the AIMS65 scores was performed as scores of 0, 1, 2, and ≥3. The proportion of the primary outcome (composite outcome) for the NEWS+L score were 0.8% for the 1st quartile (1Q), 9.0% for 2Q, 11.1% for 3Q, and 23.8% for 4Q. For the PERS, the corresponding proportions were 4.8%, 8.2%, 15.7%, and 18.9%, respectively. For the GBS, they were 2.3%, 7.4%, 17.2%, and 20.4%, respectively. For the AIMS65, the outcomes were 3.1% for 0 points, 4.4% for 1 points, 25.2% for 2 points, and 29.8% for ≥3 points. For patients classified as 1Q of the NEWS+L score, there were no in-hospital deaths or ICU admissions.
The NEWS+L score showed a moderately (0.3 to 0.5) positive association with other risk scores (with PERS: Spearman’s correlation coefficient r=0.414, P<0.001; with GBS: r=0.407, P<0.001; and with AIMS65: r=0.300, P<0.001).
As measured by the Hosmer-Lemeshow chi-square test for assessment of the goodness of fit for the predicted composite outcome, the observed probability and predicted probability measured by the NEWS+L score were not significantly different (Hosmer-Lemeshow χ2=135.75, 10 degrees of freedom, P=0.538), and neither were those for the PERS (Hosmer-Lemeshow χ2=2.94, 10 degrees of freedom, P=0.916) and GBS (Hosmer-Lemeshow χ2=22.23, 10 degrees of freedom, P=0.176). Only the AIMS65 score showed a significant difference (Hosmer-Lemeshow χ2=11.30, 10 degrees of freedom, P=0.010).
Fig. 3 shows the discriminative power of the risk scores evaluated by the AUROC analysis. For the composite outcome, the AUROC value of the NEWS+L score was the highest among the risk scores (0.76; 95% CI, 0.70 to 0.82), and was significantly higher than that of the PERS (0.66; 95% CI, 0.59 to 0.73; P=0.004); however, there was no significant difference compared to the GBS (0.70; 95% CI, 0.64 to 0.77; P=0.141) and AIMS65 score (0.76; 95% CI, 0.70 to 0.83; P=0.999). For the in-hospital death and ICU admission rates, the AUROC values of the NEWS+L score were also the highest and these were significantly higher or comparable to those of the other risk scores. For RBC transfusions ≥5 packs within 24 hours, the AUROC value of the AIMS65 was the highest, despite no significant difference being observed compared with that of the NEWS+L score. For all primary and secondary outcomes, the NEWS+L score showed a better discriminative power than the NEWS score alone.
Table 3 shows the diagnostic performances of the risk scores, including the sensitivity, specificity, positive predictive value, negative predictive value, positive likelihood ratio, and negative likelihood ratio. Generally, high sensitivity is required for the risk scores in order to identify the low-risk group for a certain outcome. The cut-off points for the PERS, GBS, and AIMS65 scores were determined according to the low-risk group point for each risk score. However, the NEWS+L score did not report any specific point for the low-risk group, only patients with a NEWS+L score of 7 or more were regarded as the high-risk group. Therefore, we tested various cut-off points that discriminated a similar number of patients for the low-risk group compared to other risk scores. The sensitivities of NEWS+L scores of 3, 4, and 5 for the composite outcome were 100%, 98.3%, and 96.6%, respectively. The sensitivity of a GBS of 0 was the same as the sensitivity of a NEWS+L score of 4, but the GBS discriminated only 13 patients (2.5%) while the NEWS+L score discriminated 92 patients (17.4%) as the low-risk group. The sensitivities of the PERS and AIMS65 score were lower than that of the NEWS+L score.
Logistic regression analyses were performed to investigate the associations between the laboratory variables and the primary and secondary outcomes (Table 4). When tested as a continuous variable, serum lactate was a powerful predictor for most outcomes (adjusted odds ratio [AOR] 1.22 [1.11 to 1.34], P<0.001 for the composite outcome; 1.16 [1.04 to 1.30], P=0.008 for in-hospital death; and 1.24 [1.12 to 1.37], P<0.001 for RBC transfusions requiring ≥5 packs within 24 hours) except for ICU admissions (1.13 [0.99 to 1.29], P=0.071). Only serum albumin was significantly associated with all outcomes. Similar results were obtained when tested as categorical variables (AOR of lactate ≥4 mmol/L, 6.60 [2.94 to 14.82], P<0.001 for the composite outcome; AOR of lactate ≥4 mmol/L, 11.10 [2.77 to 44.40], P=0.001 for in-hospital death; AOR of lactate ≥4 mmol/L, 5.09 [1.11 to 23.38], P=0.036 for the rate of ICU admission; and AOR of lactate ≥4 mmol/L, 5.98 [2.29 to 15.62], P<0.001 for RBC transfusions requiring ≥5 packs within 24 hours). No variable except lactate was significantly associated with the outcomes.
DISCUSSION
In this study, the NEWS+L score showed better predictive value for the composite outcome of in-hospital mortality, ICU admission, and RBC transfusions requiring ≥5 packs within 24 hours than the PERS, and comparable predictive value to the GBS and AIMS65 score among patients with UGIB. For the secondary outcomes, the NEWS+L score also showed better or comparable predictive values over the other risk scores. The sensitivities of NEWS+L scores of 3, 4, and 5 were 100%, 98.3%, and 96.6%, respectively, which were higher than those of the other risk scores. The serum lactate level was a significant predictor for outcomes even after adjusting for other laboratory variables.
The Rockall score was developed in 1996 to identify patients at low risk for further bleeding or death; this allowed those patients to be discharged without waiting for a diagnostic endoscopy [10]. The complete Rockall score includes age; the presence of shock, as measured by the pulse rate and systolic blood pressure; comorbidities; diagnosis; and endoscopic stigmata of recent hemorrhage, while the PERS only includes age, presence of shock, and comorbidities. In the original cohort, patients who scored 2 or less were regarded as being at low-risk for re-bleeding (4.3%) and death (0.1%). Of note, no laboratory markers are included in the Rockall score, which may be due to the limited use of laboratory machines in previous decades.
The GBS was developed in 2000 to identify a patient’s need for treatment [11]. However, the outcome is a mixture of a treatment portion, which includes any blood transfusion or any operative or endoscopic intervention, and another portion, which includes a substantial decrease in the hemoglobin level, re-bleeding, or death. In addition, the pulse rate, systolic blood pressure, comorbidities, presenting symptoms, and blood urea and hemoglobin levels are included in the GBS. This score showed an excellent discriminative performance (AUROC 0.92) in the original cohort [11]. In a previous meta-analysis, a GBS of 0 was associated with a low likelihood (0.02) of the need for urgent endoscopic intervention [21].
The AIMS65 score developed in 2011 was designed to provide an easy calculation for in-hospital mortality prediction [12]. The AIMS65 score includes albumin levels, the INR, (altered) mental status, systolic blood pressure, and age older than 65 years. In the original cohort, the AUROC was 0.80 for in-hospital mortality; among patients with an AIMS65 score of 0, 0.3% were deceased.
However, a recent study reported that the aforementioned three scores showed only fair to poor discriminative value for 30-day mortality. Moreover, the PERS and AIMS65 score showed poor discriminative value for the need for transfusions [15]. Similarly, Robertson et al. [22] reported that the GBS and PERS showed fair discriminative values for mortality and that the three scores showed fair to poor discriminative values for ICU admission.
In addition to the components of the aforementioned risk scores, researchers have revealed other factors that are associated with mortality among patients with UGIB. Most of those factors are related to patient co-morbidities, including glucocorticosteroid use [23], smoking, alcohol consumption, chronic obstructive pulmonary disease [24], diabetes mellitus and malignancy [25]. Meanwhile, laboratory findings have not been vigorously tested to identify potential risk factors among patients with UGIB.
In 2014, Shah et al. [15] reported that hyperlactatemia (≥4 mmol/L) showed an odds ratio of 6.4 for in-hospital mortality among 1,644 patients with gastrointestinal bleeding. In addition, after controlling for age, initial hematocrit, and heart rate, every 1-point increase in lactate had an odds ratio of 1.4 for mortality. In 2015, El-Kersh et al. [16] reported that the AUROC of lactate was 0.802 to predict in-hospital mortality among 133 patients with UGIB who were admitted to the ICU. The AOR of lactate levels above 2.1 mmol/L was 8.2.
Consistent findings with previous report were confirmed in the present study. In the multivariable logistic regression analyses, the serum lactate level remained a significant predictive factor for in-hospital death, ICU admission, and the need for ≥5 packs of RBC transfusion. However, no other laboratory markers were identified as significant factors.
In this context, we investigated the predictive value of the NEWS+L score, which includes the serum lactate level, among patients with UGIB and compared the performance of the NEWS+L score to traditional risk scores, namely the PERS, GBS, and AIMS65 score. Originally, the NEWS+L score was designed from data of critically ill medical patients and showed comparable or better predictive performance for in-hospital mortality over the Acute Physiology And Chronic Health Evaluation II (APACHE II), Simplified Acute Physiology Score (SAPS) II, and SAPS III [17]. The NEWS+L score performed better than the Trauma and Injury Severity Score (TRISS) in blunt trauma patients [26] and was comparable to the Pneumonia Severity Index (PSI) in community-acquired pneumonia patients [27]. Additionally, the NEWS+L score showed an excellent discriminative value for predicting 2-day mortality in general ED patients [28]. As expected, the NEWS+L score showed better or comparable prediction of the primary and secondary outcomes among patients with UGIB than the other risk scores in the present study.
The serum lactate level is increased via anaerobic glycolysis, which is representative of tissue hypoxia. Decreased hemoglobin levels can lead to reduced arterial oxygen content. When the cardiac output is unable to compensate for the decreased arterial oxygen content, the patient will inevitably face tissue hypoxia. Furthermore, even normal ranges of hematocrit and vital signs do not convey the absence of tissue hypoxia among patients with UGIB. Actually, Shah et al. [15] reported the occurrence of hyperlactatemia despite normal hematocrit and heart rate values. This finding implied that hyperlactatemia may be a first presenting sign among clinically available parameters in UGIB. Other mechanisms for hyperlactatemia in sepsis, such as mitochondrial dysfunction, increased aerobic glycolysis via tissue cytokine-mediated glucose uptake, and catecholamine-enhanced Na-K skeletal-muscle pump activity, do not seem to be directly linked to UGIB; however, further detailed research may be needed.
Endoscopy is a major treatment for UGIB. However, endoscopy utilizes substantial hospital resources and tends to be limited after work hours. In the United Kingdom, only 52% of hospitals performed emergency endoscopy after work hours [29]. Therefore, for emergency physicians, it is important to discriminate patients who are at a low-risk for an adverse outcome and for whom the emergent call to a gastroenterologist can be postponed. To date, traditional clinical scores seem to play a given role for this purpose. However, 20 and 15 years have passed since the PERS [10] and GBS [11] were introduced, respectively. In the future, we expect that the NEWS+L score could be used to identify low-risk groups among patients with UGIB.
The present study has some limitations. First, this was a retrospective, observational, single center study. The inherent weaknesses relating to the study design should be addressed. Uncertainty for generalizability to other settings such as different hospitals, regions, or nations is a major concern that should be noted. However, while there is a report that patient-related factors are associated with poorer outcomes, surrounding factors such as country, the size of the hospital, or the profile of the team managing the event have not been shown to be associated with those poor outcomes [30]. This previous report favors the possibility that the results of this study could be replicated in another setting. The risk for impairments in data collection, biases, or a lack of randomly distributed exposure should also be considered. However, most of the variables in this study are objective and easy to retrieve, and possible biases were avoided whenever possible. Finally, it should be noted that the data abstractor was not blinded to the outcome.
In conclusion, the NEWS+L score showed better discriminative performance than the PERS, and comparable discriminative performance to both the GBS and AIMS65 values for the composite of in-hospital death, ICU admission, and requiring RBC transfusion of ≥5 packs within 24 hours. Thus, the NEWS+L score may be used to identify the low-risk group among patients with UGIB.
Notes
No potential conflict of interest relevant to this article was reported.
References
Article information Continued
Notes
Capsule Summary
What is already known
Recently, the serum lactate level has been shown to be a significant predictor of mortality in upper gastrointestinal bleeding (UGIB). However, the traditional risk scores that are currently used for UGIB do not contain the serum lactate level as one of their components.
What is new in the current study
The National Early Warning Score+Lactate score showed better discriminative performance than the pre-endoscopic Rockall score and comparable discriminative performance to the Glasgow-Blatchford score and albumin, international normalized ratio, altered mental status, systolic blood pressure, age older than 65 years score for the composite of in-hospital death, intensive care unit admission, and requiring red blood cell transfusion of ≥5 packs within 24 hours.