Comparison of Approaches to the identification of Symptom Burden in Hemodialysis Patients Utilizing Electronic Health Records

Lili Chan; Kelly Beers; Kinsuk Chauhan; Neha Debnath; Aparna Saha; Pattharawin Pattharanitima; Judy Cho; Peter Kotanko; Alex Federman; Steven Coca; Tielman Van Vleck; Girish N. Nadkarni

doi:10.1101/458976

Abstract

Background Identification of symptoms is challenging with surveys, which are time-intensive and low-throughput. Natural language processing (NLP) could be utilized to identify symptoms from narrative documentation in the electronic health record (EHR).

Methods We utilized NLP to parse notes for maintenance hemodialysis (HD) patients from two EHR databases (BioMe and MIMIC-III) to identify fatigue, nausea/vomiting, anxiety, depression, cramping, itching, and pain. We compared NLP performance with International Classification of Diseases (ICD) codes and validated the performance of both NLP and codes against manual chart review in a representative subset.

Results We identified 1034 and 929 HD patients from BioMe and MIMIC-III respectively. The most frequently identified symptoms by NLP from both cohorts were fatigue, pain, and nausea and/or vomiting. NLP was significantly more sensitive than ICD codes for nearly all symptoms. In the BioMe dataset, sensitivity for NLP ranged from 0.85-0.99 vs. 0.09-0.59 for ICD codes. In the MIMIC-III dataset, NLP sensitivity was 0.8-0.98 vs. 0.02-0.53 for ICD. ICD codes were significantly more specific for nausea and/or vomiting (NLP 0.57 vs. ICD 0.97, P=0.03) in BioMe and for depression (NLP 0.67 vs. ICD 0.99, P=0.002) in MIMIC-III. A majority of patients in both cohorts had ?4 symptoms. The more encounters available for a patient the more likely NLP was to identify a symptom.

Conclusions NLP out performed ICD codes for identification of symptoms on several tests parameters including sensitivity for a majority of symptoms. NLP may be useful for the high-throughput identification of patient centered outcomes from EHR.

Significance Statement Patients on maintenance hemodialysis experience a high frequency of symptoms. However, symptoms have been measured utilizing time-intensive surveys. This paper compares natural language processing (NLP) to administrative codes for the identification of seven key symptoms from two cohorts with electronic health records and validation through manual chart review. NLP identified high rates of symptoms; the most common were fatigue, pain, and nausea and/or vomiting. A majority of patients had ≥4 symptoms. NLP was significantly more sensitive at identifying symptoms compared to administrative codes for nearly all symptoms but specificity was not significantly different compared to codes. This paper demonstrates utility of a high throughput method of identifying symptoms from EHR which may advance the field of patient centered research in nephrology.

Introduction

In the U.S. there are over 450,000 patients on hemodialysis.¹ As mortality has decreased by over 30% over the past decade, improving the quality of life in HD patients has become a clinical and research priority. Symptom burden is extremely high in HD patients and from prior published survey data patients on average report a median of 9 symptoms over a seven day period.² A recent initiative, the Standardized Outcomes in Nephrology (SONG)-HD, has identified outcomes important not only to physicians but also to patients.³ The top tier includes fatigue, cardiovascular disease, vascular access, and mortality. Middle and lower tier outcomes include symptoms such as pain, depression, anxiety, and cramps. While cardiovascular disease and mortality outcomes are easily tracked and identified, symptoms such as fatigue and depression are more difficult to identify and usually require prospective survey of patients. However, this method is low-throughput, time consuming (many surveys being over 30 questions), and only provides a view of the symptom burden at the time of survey administration.^4,5

Electronic health records (EHRs) have been widely implemented in most major hospital systems and dialysis units.⁶ Granular clinical information for patient-centered care is routinely collected in EHRs. For example, during a hemodialysis treatment, patients are regularly observed for adverse signs and symptoms by nurses, technicians, and physicians. Additionally, they are under the care of a nutritionist and social worker. All of these encounters are routinely documented in EHRs as “free text” in progress notes and very infrequently as structured data such as International Classification of Diseases codes (ICD).⁷ While analysis of free text progress notes has traditionally been done via manual chart review, the advent of natural language processing (NLP) has the potential for the high throughput, rapid identification of symptoms from progress notes.

We undertook this study to determine the ability of NLP to retrospectively identify symptoms in HD patients from the EHR. We then compared the performance of NLP and ICD identification against manual chart review.

Methods

Study Population

From an original cohort of 38,575 participants from the Charles Bronfman Institute of Personalized Medicine BioMe Biobank at the Icahn School of Medicine at Mount Sinai, we included patients with end stage renal disease (ESRD) who were on HD. The BioMe Biobank is a prospective registry of racially and ethnically diverse patients that are recruited from primary care and subspecialty clinics in the Mount Sinai Healthcare System. The participants have provided their consent to have their EHR data available for biomedical research and linkage has been performed with the United States Renal Data System (USRDS) to ascertain dialysis status. The institutional review board approved the BioMe protocols and informed consent was obtained for all subjects.

We retrieved all clinical notes of BioMe Participants available from the centralized DataMart up to December 31, 2017. HD patients were identified as patients with ESRD according to the USRDS with exclusion of patients who received a kidney transplant and did not have a first dialysis date. Peritoneal (PD) patients were excluded using ICD 9 and 10 codes as PD and HD procedures are markedly different and associated with different symptoms and core outcomes (Supplemental Table 1).

Additionally, we utilized the Medical Information Mart for Intensive Care (MIMIC-III) database to identify HD patients that were admitted to the intensive care unit. ⁸ MIMIC-III is a freely accessible critical care database of patients from a large, single center tertiary care hospital (Beth Israel Deaconess Medical Center in Boston, Massachusetts) from 2001 to 2012.⁸ This database includes patient demographics, billing codes, radiology reports, progress notes, and discharge summaries in deidentified form. We reviewed all clinical notes from the MIMIC–III database. As the MIMIC-III is a de-identified cohort, no linkage to USRDS could be done. Instead, ESRD was identified as patients who had an ESRD code and a code for dialysis procedure or diagnosis. PD patients were excluded by using PD procedure codes (Supplemental Table 1). As MIMIC-III is a de-identified publically available database, evaluation of data from this source was considered IRB exempt.

Study Design

We utilized the CLiX NLP engine produced by Clinithink (London, UK) to parse documents from HD patients in the BioMe Biobank and in MIMIC-III. There was no restriction on number of notes or types of notes placed. CLiX NLP is a NLP software that parses through free text and matches it to SNOMED clinical terms.⁹ SNOMED is a comprehensive healthcare terminology resource that is used in over fifty countries around the world. SNOMED has an inherent hierarchy consisting of overarching concepts, i.e. parent terms, which encompass more specific concepts, i.e., children terms. Supplemental Figure 1 includes an example of how “cramp” would be represented in the SNOMED hierarchy and a search for cramps would also identify the seven children terms of bathing cramp, cramp in limb, cramp in lower limb, hand cramps, heat cramps, recumbency cramps, and stomach cramps. CLiX NLP is also equipped to handle typographical errors, sentence context, and negation.

We selected clinical outcomes from all tiers of outcomes identified from the SONG-HD initiative.³ Specifically, we queried for the following: fatigue, depression, pain, nausea and/or vomiting, anxiety, and cramps. The list of SNOMED concepts and children terms is included in Supplemental Table 2. These specific terms were selected due to their inability to be identified from structured data as opposed to outcomes such as hospitalizations, mortality, and dialysis adequacy, which can be identified using administrative codes or other structured data.

CLiX NLP mapped text from clinical notes to the corresponding SNOMED clinical terms. This was first identified on the document level and then on the patient level. For fatigue, pain, nausea and/or vomiting, anxiety, and cramps, NLP identification in at least one note was considered as test positive. For depression, as the disease is more chronic in nature, NLP identification in at least two notes on at least two different dates was necessarily to be considered test positive. We performed two iterations of NLP parsing with manual chart review guiding the second iteration. We rectified errors in identification in the NLP engine prior to the execution of the final parsing. Examples included phrases such as “The patient was advised to call for any fever or for prolonged or severe pain or bleeding” and “EKG sinus tach with V4, V5 depressions”. We modified the NLP algorithm to recognize these as negative expressions. We report results in this manuscript from the final NLP query.

We compared performance of ICD-CM codes with the results obtained from CLiX NLP. ICD-CM codes were chosen as described in prior literature when available and through physician extensive review of ICD-CM codes when not available. ^10–12 ICD-9 and 10 codes were used in BioMe while only ICD-9 codes were available in MIMIC-III (Supplemental Table 3). Finally, both methods were compared with independent chart review by two physicians (LC and KB). When there was disagreement between manual validations for a patient, joint review of the patient’s chart was performed until consensus agreement was obtained.

As an additional validation of our NLP results, specifically for the depression query, we identified patients who had undergone Patient Health Questionnaire 9 (PHQ-9) screening since this is a common survey based instrument that is administered to dialysis patients.^13,14 We considered depression screening positive if patients scored ≥10. If scores were <10 or there was discrepancy between depression as identified by PHQ-9, ICD, or NLP, additional manual chart review was done to identify evidence of history of depression (i.e. cognitive behavior therapy, anti-depressive medications, or prior suicide attempts). For patients with multiple PHQ9 scores, the highest score was used.

Statistical Analyses

We calculated sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F1-score of NLP and ICD9/10 codes. For cells on the 2×2 table where the value was 0, we entered 0.5 to allow for calculation of test statistics. We compared estimates of sensitivity, specificity, and F1-score using the McNemar test with significance set using a two sided p value of <0.05. We compared NPV and PPV using the generalized score statistic.¹⁵ All analyses were done using SAS version 9.4 (SAS Institute, Cary NC).

Results

Patient Characteristics

Out of 1152 patients with ESRD identified by the USRDS in BioMe, we identified 1034 (90%) patients receiving maintenance HD. These HD patients had a mean age of 63.6±13.3 years, 42% were women, and 42% self-reported as African American. As expected of HD patients, there was high prevalence of diabetes (51%), hypertension (85%), coronary artery disease (53%), and congestive heart failure (31%) (Table 1). The median number of encounters was 109, (interquartile range [IQR] 41-241).

View this table:

Table 1:

Patient Characteristics of BioMe and MIMIC-III

From MIMIC-III, we identified 929 HD patients utilizing ICD-9 codes. The mean age of patients was 67.4±37 years, 41% of patients were women, and 63% self-reported as white. Prevalence rates of chronic co-morbidities were similarly high, diabetes (59%), hypertension (92%), coronary artery disease (49%), and congestive heart failure (57%) (Table 1). Encounter analysis could not be done in MIMIC-III as the database only included encounters with ICU stays and nearly 80% of patients had only one ICU admission.

Symptom Identification using NLP vs Administrative Codes

In BioMe HD patients, NLP identified symptoms more frequently than did ICD-9 and 10 codes (Figure 1A). The most frequent symptoms identified were pain (NLP 93% vs. ICD 46%, P<0.001), fatigue (NLP 84% vs. ICD 41%, P<0.001), and nausea and/or vomiting (NLP 74% vs. ICD 19%, P<0.001). The symptoms that were picked up best by both NLP and ICD were pain (45%), fatigue (40%), and depression (19%).

Figure 1: Frequency of symptom identified by NLP and ICD from (A) BioMe and (B) MIMIC-III. Blue bar indicates percentage of patients where symptom was found only by NLP, green bar indicates percentage of patients where symptom was found by only by ICD, red bar indicates percentage of patients where symptom was found by both NLP and ICD, while purple bar indicates percentage of patients where the symptom was found by neither NLP or ICD.

In the MIMIC-III cohort, again NLP identified symptoms more commonly than ICD-9 codes. (Figure 1B) The symptoms with the highest prevalence according to NLP were pain (NLP 96% vs. ICD 6%, P=0.16), fatigue (NLP 70% vs. ICD 41%, P<0.001), and nausea and/or vomiting (NLP 63% vs. 19%, P <0.001). ICD-9 codes were best able to identify depression (17%) and no ICD-9 code for cramps was found in MIMIC-III.

Manual Chart Validation

Overall, NLP was superior to ICD for identifying symptoms in both cohorts. In the BioMe dataset sensitivity for NLP ranged from 0.85 to 0.99 while sensitivity for ICD ranged from 0.09 to 0.59 for ICD. In the MIMIC-III dataset, sensitivity for NLP ranged from 0.8 to 0.98 while sensitivity for ICD ranged from 0.02 to 0.53. (Table 2 A/B)

View this table:

Table 2:

Sensitivity, specificity, PPV, NPV, and F1 score of NLP vs. ICD for identification of symptoms for (A) BioMe and (B) MIMIC-II

However, specificity was highly variable. In the BioMe dataset, specificity for NLP ranted from 0.5 to 0.96, while specificity for ICD ranged from 0.5 to 0.98. In the MIMIC-III dataset, specificity for NLP ranged from 0.33 to 0.96, while for ICD it was 0.86-0.99. ICD codes were more specific for nausea and/or vomiting (NLP 0.57 vs. ICD 0.97, P=0.03) in BioMe and more specific for depression (NLP 0.67 vs. ICD 0.99, P=0.002) in MIMIC-III. (Table 2 A/B)

Twenty-five patients were identified by NLP to have undergone PHQ-9 depression screening. While 3 patients had PHQ-9 scores <10, 2 of the 3 had a clinical history of depression (active group therapy, inpatient psychiatric admissions for depression, or prior suicide attempts) but were receiving adequate treatments therefore resulting in low PHQ-9 scores. The last patient did not have evidence (ICD, NLP, or on chart review) of having depression. Of the 24 patients who were depression positive by PHQ-9 and/or clinical history, NLP correctly identified 22 (92%) patients while ICD 9/10 identified 20 (83%) patients.

Symptom Burden

In the BioMe cohort, symptom burden was high among HD patients. NLP, identified at least 1 symptoms in 96% of patients, and 4 or more symptoms in 50% of patients. (Figure 2A) The number of symptoms identified increased with the number of encounters in BioMe. Patients who did not have any symptoms identified by NLP had a median of 7 encounters (IQR 1-32), while patients with all 7 symptoms had a median of 230 encounters (IQR 141-419). (Figure 3) In MIMIC-III, NLP identified at least 1 symptom in 97% of patients, and 4 or more symptoms in 48% of patients. (Figure 2B) Encounter analysis could not be performed for MIMIC-III.

Figure 2: Overall symptom burden of symptoms identified by NLP from (A) BioME and (B) MIMIC-III.

Figure 3: Boxplot demonstrating the number of symptoms identified with the number of clinical encounters.

Discussion

High-throughput retrospective assessment of symptoms in patients on HD from EHR is difficult. NLP is one potential solution to this problem. We demonstrated that NLP had better sensitivity than ICD codes at identifying seven symptoms with validation across two different cohorts.³ The symptom burden was high, with a majority of patients having at least 4 or more symptoms. Finally, identification of symptoms was highly dependent on the number of encounters that HD patients had.

As the care of HD patients is improving, focus has shifted to improving how patients feel (i.e., patient centered outcomes). The SONG-HD initiative has identified several key outcomes important to all stakeholders (patients and physicians) and has emphasized the importance of clinical research that includes these symptoms as both predictors and outcomes. Prior research that employed patient-centered outcomes as endpoints have required prospective surveys for their execution.^2,16 This is labor-intensive and only allows assessment of symptom burden at the time of the survey. By using NLP, notes can be processed in a high-throughput manner. In addition, benchmarking and reporting these patient-centered outcomes from multiple dialysis providers could provide a unique opportunity to improve clinical practice.

There are currently few studies in nephrology that have utilized NLP. The predominant use has been on the identification of risk factors for progression of chronic kidney disease and the identification of CKD from EHR.^17–22. Additionally, the studies that have utilized NLP to identify risk factors for progression of chronic kidney disease included few symptoms or patient-centered outcomes in their models. Prior studies in other chronic diseases, such as pancreatic cancer, have demonstrated the utility of NLP to identify the patient-centered outcomes of urinary incontinence and erectile dysfunction from EHR.²³ Our study supports the use of NLP for identification of patient-centered outcomes in HD patients given the higher sensitivity of the NLP method compared to identification with ICD codes.

We found that overall symptom prevalence identified in the BioMe cohort by NLP is similar to prior published survey data on symptoms, i.e. prevalence of fatigue was reported to be 69-87% in literature and we found a prevalence of 84%.^16,24,25 Certain symptoms were less commonly found such as itching (NLP 48% vs. literature 52-70%) and cramps (NLP 45% vs. literature 43-74%), while other symptoms such as nausea and/or vomiting (NLP 74% vs literature 26-35%) were more commonly found. The differences are likely due to differences in cohorts and settings.

While surveys are done in patients who are stable at their outpatient hemodialysis centers, BioMe provider notes consist not only of preventative care visits, but also acute inpatient and outpatient notes where more severe symptoms are documented. As MIMIC-III consists of progress notes from hospital visits that required an ICU admission, symptoms were identified at an even lower rate. One potential reason is that patients admitted to ICUs are critically ill and potentially with altered mental status or mechanically ventilated which prevents patients from verbalizing their symptoms. Additionally, patient care and billing is often focused on the admission diagnosis and contributing comorbidities, while symptoms and psychosocial comorbidities may not be as well addressed in notes. We chose to not to place limitations on the number, timing, or type of notes, which may have increased the likelihood of NLP identifying a symptom. However, comparator measures via ICD 9/10 codes, were also identified without limitations to encounters, allowing for a fair comparison.

Despite these differences in cohorts, NLP was significantly more sensitive than ICD codes for identification of nearly all symptoms in both BioMe and MIMIC-III. ICD9/10 codes are commonly used for the identification of several disease processes from administrative data and we found that NLP out performed ICD 9/10 codes at identification of all symptoms in both BioMe and MIMIC-III.^26–29 As ICD 9/10 codes are administrative codes clinicians may be less inclined to use them to document symptoms experienced by HD patients. When ICD codes for symptoms were present in our data, they identified symptoms with high specificity.

While NLP was more sensitive at identifying depression, ICD codes were more specific. A substantial portion of the false positives for depression was due to the use of depression in other clinical contexts. As there was no consistent way that this was documented across notes it could not be easily addressed in our NLP algorithm.

Our study should be interpreted in light of some limitations of our study including the dependence of symptom identification on the number of encounters and notes available, the more encounters available the more likely a provider was to document a symptom. However, this is a common issue with EHR systems, where both sicker patients as well as patients with longer length of follow up having more data.³⁰ Additionally, only symptoms which the provider are screening for are documented and therefore NLP may miss those symptoms patients are not discussing with their providers. Neither the BioMe nor MIMIC-III datasets are exclusive to outpatient HD patients, which make comparison with prior published data on outpatient HD patients difficult. However, the prevalence of symptoms in our study is similar to prior published survey data.^24,25 Unfortunately, as we did not have concurrent survey data available, we used manual chart review as our gold standard. We did extract PHQ-9 survey results to further validate our findings, however only a small portion of patients had this screening done. Additionally, the results of our sensitivity, specificity, PPV, NPV, and F1 scores were relatively consistent across the BioMe and MIMIC-III cohort, suggesting that our NLP algorithm would have good generalizability across different medical systems.

In conclusion, we utilized NLP to identify important patient symptoms from EHR of HD patients from two diverse medical systems. Prevalence of symptoms identified by NLP was similar to previously published survey studies. NLP out performed ICD codes for identification in regards to sensitivity, NPV, and F1 score for a majority of symptoms in both cohorts. Additional refinement of the NLP algorithm and testing in the EHR of outpatient HD units is needed to further validate our findings.

Author Contributions: LC, SC, and GNN designed the study. TVV parsed the data. LC, KC, and ND carried out the analysis. LC and KB performed the manual chart review. ND and AS made the figures and tables. All authors drafted and revised the manuscript and approved the final version of the manuscript.

Disclosures

LC is supported in part by the NIH (5T32DK007757 – 18). G.N.N. and S.G.C. are co-founders of RenalytixAI and G.N.N. and S.G.C. are members of the advisory board of RenalytixAI and own equity in the same. G.N.N has received operational funding from Goldfinch Bio. G.N.N has received consulting fees for BioVie Inc. S.G.C. has received consulting fees from Goldfinch Bio, CHF Solutions, Quark Biopharma, Janssen Pharmaceuticals, and Takeda Pharmaceuticals. G.N.N. and S.G.C. are on the advisory board for pulseData and have received consulting fees and equity in return. G.N.N. is supported by a career development award from the National Institutes of Health (NIH) (K23DK107908) and is also supported by R01DK108803, U01HG007278, U01HG009610, and U01DK116100. S.G.C. is supported by the following grants from the NIH: U01DK106962, R01DK106085, R01HL85757, R01DK112258, and U01OH011326. TTVV was part of launching Clinithink and retains a financial interest in the company.

Acknowledgements

We want to thank all participants of BioMe and MIMIC-III.

Bibliography

1.↵
United States Renal Data System. 2015 USRDS annual data report: Epidemiology of kidney disease in the United States. National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases, Bethesda, MD, 2015.
2.↵
Weisbord SD, Fried LF, Arnold RM, Fine MJ, Levenson DJ, Peterson RA, Switzer GE: Prevalence, severity, and importance of physical and emotional symptoms in chronic hemodialysis patients. J. Am. Soc. Nephrol. [Internet] 16: 2487–94, 2005 Available from: http://www.ncbi.nlm.nih.gov/pubmed/15975996 [cited 2016 Oct 17]
OpenUrl Abstract/FREE Full Text
3.↵
Tong A, Manns B, Hemmelgarn B, Wheeler DC, Evangelidis N, Tugwell P, Crowe S, Van Biesen W, Winkelmayer WC, O’Donoghue D, Tam-Tham H, Shen JI, Pinter J, Larkins N, Youssouf S, Mandayam S, Ju A, Craig JC, Collins A, Narva A, Sautenet B, Powell B, Hurd B, Barrett B, Schiller B, Culleton B, Hawley C, Pollock C, Lok C, Wanner C, Chan C, Weiner D, Harris D, Johnson D, Rosenbloom D, Rifkin D, Bookman D, Brown E, Bavlovlenkov E, Tentori F, Williams J, Schell J, Flythe J, Ix J, Raimann J, Andress J, Agar J, Daugirdas J, Gill J, Kusek J, Polkinghorne K, Abbott K, Usyvat L, Krishnan M, Tonelli M, Marshall M, Gallagher M, Germain M, Walsh M, Zappitelli M, Josephson M, Burrows NR, Houston O, Kerr P, Kotanko P, Roy-Chaudhury P, Morton R, Mehrotra R, Dorpel R van den, Suri R, Wald R, Apata R, Gibson S, Evered S, Fadem S, McDonald S, Holt S, Kee T, Manns B, Hemmelgarn B, Wheeler D, Harris T, Winkelmayer W, Tong A, Narva A, Powell B, Hurd B, Barrett B, Schiller B, Culleton B, Hawley C, Lok C, Wanner C, Weiner D, Johnson D, Rosenbloom D, Rifkin D, Bookman D, O’Donoghue D, Brown E, Bavlovlenkov E, Tam-Tham H, Williams J, Schell J, Shen J, Raimann J, Daugirdas J, Kusek J, Pinter J, Polkinghorne K, Abbott K, Marshall M, Gallagher M, Walsh M, Zappitelli M, Josephson M, Larkins N, Evangelidis N, Houston O, Kerr P, Roy-Chaudhury P, Morton R, Mehrotra R, Van Den Dorpel R, Suri R, Parks R, Wald R, Apata R, Youssouf S, Gibson S, Mandayam S, Fadem S, Holt S: Establishing Core Outcome Domains in Hemodialysis: Report of the Standardized Outcomes in Nephrology– Hemodialysis (SONG-HD) Consensus Workshop. Am. J. Kidney Dis. [Internet] 69: 97– 107, 2017 Available from: http://www.ncbi.nlm.nih.gov/pubmed/27497527 [cited 2018 Aug 7]
OpenUrl
4.↵
Weisbord SD, Fried LF, Arnold RM, Rotondi AJ, Fine MJ, Levenson DJ, Switzer GE: Development of a symptom assessment instrument for chronic hemodialysis patients: the dialysis symptom index. J. Pain Symptom Manage. [Internet] 27: 226–240, 2004 Available from: http://www.sciencedirect.com/science/article/pii/S0885392403005177?via%3Dihub#FIG1 [cited 2018 Jan 22]
OpenUrl CrossRef PubMed Web of Science
5.↵
Hays RD, Kallich JD, Mapes DL, Coons SJ, Carter WB: Development of the kidney disease quality of life (KDQOL) instrument. Qual. Life Res. [Internet] 3: 329–38, 1994 Available from: http://www.ncbi.nlm.nih.gov/pubmed/7841967 [cited 2017 Jan 26]
OpenUrl CrossRef PubMed Web of Science
6.↵
Adler-Milstein J, DesRoches CM, Furukawa MF, Worzala C, Charles D, Kralovec P, Stalley S, Jha AK: More than half of US hospitals have at least a basic EHR, but stage 2 criteria remain challenging for most. Health Aff. (Millwood). [Internet] 33: 1664–71, 2014 Available from: http://www.healthaffairs.org/doi/10.1377/hlthaff.2014.0453 [cited 2018 Oct 26]
OpenUrl Abstract/FREE Full Text
7.↵
Hernandez-Boussard T, Tamang S, Blayney D, Brooks J, Shah N: New Paradigms for Patient-Centered Outcomes Research in Electronic Medical Records: An Example of Detecting Urinary Incontinence Following Prostatectomy. EGEMS (Washington, DC) [Internet] 4: 1231, 2016 Available from: http://egems.academyhealth.org/article/10.13063/2327-9214.1231/ [cited 2018 Oct 29]
OpenUrl
8.↵
Johnson AEW, Pollard TJ, Shen L, Lehman LH, Feng M, Ghassemi M, Moody B, Szolovits P, AnthonyCeli L, Mark RG: MIMIC-III, a freely accessible critical care database. Sci. Data [Internet] 3: 160035, 2016 Available from: http://www.nature.com/articles/sdata201635 [cited 2018 Mar 7]
OpenUrl
9.↵
Spackman KA, Campbell KE, Côté RA: Snomed RT: a reference terminology for health care. Proc. a Conf. Am. Med. Informatics Assoc. AMIA Fall Symp. [Internet] 640–4, 1997 Available from: http://www.ncbi.nlm.nih.gov/pubmed/9357704 [cited 2017 Jul 25]
10.↵
Fiest KM, Jette N, Quan H, St. Germaine-Smith C, Metcalfe A, Patten SB, Beck CA: Systematic review and assessment of validated case definitions for depression in administrative data. BMC Psychiatry [Internet] 14: 289, 2014 Available from: http://www.ncbi.nlm.nih.gov/pubmed/25322690 [cited 2018 Jun 13]
OpenUrl CrossRef PubMed
11.
Tian TY, Zlateva I, Anderson DR: Using electronic health records data to identify patients with chronic pain in a primary care setting. J. Am. Med. Informatics Assoc. [Internet] 20: e275–e280, 2013 Available from: http://academic.oup.com/jamia/articlelookup/doi/10.1136/amiajnl-2013-001856 [cited 2018 Jun 14]
OpenUrl CrossRef PubMed
12.↵
Kisely S, Lin E, Gilbert C, Smith M, Campbell L-A, Vasiliadis H-M: Use of Administrative Data for the Surveillance of Mood and Anxiety Disorders. Aust. New Zeal. J. Psychiatry [Internet] 43: 1118–1125, 2009 Available from: http://journals.sagepub.com/doi/10.3109/00048670903279838 [cited 2018 Jun 28]
OpenUrl
13.↵
Kroenke K, Spitzer RL: The PHQ-9: A New Depression Diagnostic and Severity Measure. Psychiatr. Ann. [Internet] 32: 509–515, 2002 Available from: http://www.healio.com/doiresolver?doi=10.3928/0048-5713-20020901-06 [cited 2018 Oct 29]
OpenUrl CrossRef Web of Science
14.↵
Watnick S, Wang P-L, Demadura T, Ganzini L: Validation of 2 Depression Screening Tools in Dialysis Patients. Am. J. Kidney Dis. [Internet] 46: 919–924, 2005 Available from: http://www.ncbi.nlm.nih.gov/pubmed/16253733 [cited 2018 Oct 29]
OpenUrl CrossRef PubMed Web of Science
15.↵
Gondara L: A SAS^® macro to compare predictive values of diagnostic tests [Internet]. Available from: http://support.sas.com/resources/papers/proceedings15/2141-2015.pdf [cited 2018 Oct 30]
16.↵
Merkus MP, Jager KJ, Dekker FW, HaanRJ De, Boeschoten EW, Krediet RT, Group S: Nephrology Dialysis Transplantation Physical symptoms and quality of life in patients on chronic dialysisߛ: results of The Netherlands Cooperative Study on Adequacy of Dialysis (NECOSAD). Nephrol. Dial. Transplant. 1163–1170, 1999
17.↵
Perotte A, Ranganath R, Hirsch JS, Blei D, Elhadad N: Risk prediction for chronic kidney disease progression using heterogeneous electronic health record data and time series analysis. J. Am. Med. Inform. Assoc. [Internet] 22: 872–80, 2015 Available from: http://www.ncbi.nlm.nih.gov/pubmed/25896647 [cited 2018 Oct 31]
OpenUrl CrossRef PubMed
18.
Singh K, Betensky RA, Wright A, Curhan GC, Bates DW, Waikar SS: A Concept-Wide Association Study of Clinical Notes to Discover New Predictors of Kidney Failure. Clin. J. Am. Soc. Nephrol. [Internet] 11: 2150–2158, 2016 Available from: http://www.ncbi.nlm.nih.gov/pubmed/27927892 [cited 2018 Sep 18]
OpenUrl Abstract/FREE Full Text
19.
Chase HS, Radhakrishnan J, Shirazian S, Rao MK, Vawdrey DK: Under-documentation of chronic kidney disease in the electronic health record in outpatients. J. Am. Med. Inform. Assoc. [Internet] 17: 588–94, 2010 Available from: http://www.ncbi.nlm.nih.gov/pubmed/20819869 [cited 2018 Oct 31]
OpenUrl CrossRef PubMed
20.
Nadkarni GN, Gottesman O, Linneman JG, Chase H, Berg RL, Farouk S, Nadukuru R, Lotay V, Ellis S, Hripcsak G, Peissig P, Weng C, Bottinger EP: Development and validation of an electronic phenotyping algorithm for chronic kidney disease. AMIA Annu. Symp. Proc. [Internet] 2014: 907–16, 2014 Available from: http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=4419875&tool=pmcentrez&rendertype=abstract
OpenUrl
21.
Malas MS, Wish J, Moorthi R, Grannis S, Dexter P, Duke J, Moe S: A comparison between physicians and computer algorithms for form CMS-2728 data reporting. Hemodial. Int. [Internet] 21: 117–124, 2017 Available from: http://www.ncbi.nlm.nih.gov/pubmed/27353890 [cited 2018 Oct 31]
OpenUrl
22.↵
Nigwekar SU, Solid CA, Ankers E, Malhotra R, Eggert W, Turchin A, Thadhani RI, Herzog CA: Quantifying a Rare Disease in Administrative Data: The Example of Calciphylaxis. J. Gen. Intern. Med. [Internet] 29: 724–731, 2014 Available from: http://www.ncbi.nlm.nih.gov/pubmed/25029979 [cited 2018 Oct 31]
OpenUrl
23.↵
Hernandez-Boussard T, Kourdis PD, Seto T, Ferrari M, Blayney DW, Rubin D, Brooks JD: Mining Electronic Health Records to Extract Patient-Centered Outcomes Following Prostate Cancer Treatment. AMIA … Annu. Symp. proceedings. AMIA Symp. [Internet] 2017: 876–882, 2017 Available from: http://www.ncbi.nlm.nih.gov/pubmed/29854154 [cited 2018 Oct 29]
OpenUrl
24.↵
Caplin B, Kumar S, Davenport A: Patients’ perspective of haemodialysis-associated symptoms. Nephrol. Dial. Transplant. 26: 2656–2663, 2011
OpenUrl CrossRef PubMed Web of Science
25.↵
Weisbord SD, Fried LF, Arnold RM, Fine MJ, Levenson DJ, Peterson RA, Switzer GE: Prevalence, Severity, and Importance of Physical and Emotional Symptoms in Chronic Hemodialysis Patients. J. Am. Soc. Nephrol. [Internet] 16: 2487–2494, 2005 Available from: http://jasn.asnjournals.org/content/16/8/2487.abstract
OpenUrl Abstract/FREE Full Text
26.↵
Waikar SS, Wald R, Chertow GM, Curhan GC, Winkelmayer WC, Liangos O, Sosa M-A, Jaber BL: Validity of International Classification of Diseases, Ninth Revision, Clinical Modification Codes for Acute Renal Failure. J. Am. Soc. Nephrol. [Internet] 17: 1688–94, 2006 Available from: http://www.jasn.org/cgi/doi/10.1681/ASN.2006010073 [cited 2016 Dec 5]
OpenUrl Abstract/FREE Full Text
27.
Vlasschaert MEO, Bejaimal SAD, Hackam DG, Quinn R, Cuerden MS, Oliver MJ, Iansavichus A, Sultan N, Mills A, Garg AX: Validity of administrative database coding for kidney disease: A systematic review. Am. J. Kidney Dis. [Internet] 57: 29–43, 2011 Available from: http://dx.doi.org/10.1053/j.ajkd.2010.08.031
OpenUrl CrossRef PubMed Web of Science
28.
Semins MJ, Trock BJ, Matlaga BR: Validity of administrative coding in identifying patients with upper urinary tract calculi. J. Urol. [Internet] 184: 190–2, 2010 Available from: http://www.ncbi.nlm.nih.gov/pubmed/20478584 [cited 2018 Sep 19]
OpenUrl PubMed
29.↵
Mc Cormick N, Lacaille D, Bhole V, Avina-Zubieta JA: Validity of Heart Failure Diagnoses in Administrative Databases: A Systematic Review and Meta-Analysis. PLoS One [Internet] 9: >e104519, 2014 Available from: http://dx.plos.org/10.1371/journal.pone.0104519 [cited 2018 Sep 19]
30.↵
Weiskopf NG, Rusanov A, Weng C: Sick patients have more data: the non-random completeness of electronic health records. AMIA … Annu. Symp. proceedings. AMIA Symp. [Internet] 2013: 1472–7, 2013 Available from: http://www.ncbi.nlm.nih.gov/pubmed/24551421 [cited 2018 Oct 29]
OpenUrl