TY - JOUR T1 - An Empirical Analysis of Topic Modeling for Mining Cancer Clinical Notes JF - bioRxiv DO - 10.1101/062307 SP - 062307 AU - Katherine Redfield Chang AU - Xinghua Lou AU - Theofanis Karaletsos AU - Christopher Crosbie AU - Stuart Gardos AU - David Artz AU - Gunnar Rätsch Y1 - 2016/01/01 UR - http://biorxiv.org/content/early/2016/07/06/062307.abstract N2 - Using a variety of techniques including Topic Modeling, Principal Component Analysis and Bi-clustering, we explore electronic patient records in the form of unstructured clinical notes and genetic mutation test results. Our ultimate goal is to gain insight into a unique body of clinical data, specifically regarding the topics discussed within the note content and relationships between patient clinical notes and their underlying genetics. ER -