TY - JOUR T1 - A unified encyclopedia of human functional DNA elements through fully automated annotation of 164 human cell types JF - bioRxiv DO - 10.1101/086025 SP - 086025 AU - Maxwell W. Libbrecht AU - Oscar Rodriguez AU - Zhiping Weng AU - Jeffrey A. Bilmes AU - Michael M. Hoffman AU - William S. Noble Y1 - 2016/01/01 UR - http://biorxiv.org/content/early/2016/11/07/086025.abstract N2 - Semi-automated genome annotation methods such as Segway enable understanding of chromatin activity. Here we present chromatin state annotations of 164 human cell types using 1,615 genomics data sets. To produce these annotations, we developed a fully-automated annotation strategy in which we train separate unsupervised annotation models on each cell type and use a machine learning classifier to automate the state interpretation step. Using these annotations, we developed a measure of the functional importance of each genomic position called the “functionality score”, which allows us to aggregate information across cell types into a multi-cell type view. This score provides a measure of importance directly attributable to a specific activity in a specific set of cell types. In contrast to evolutionary conservation, this measure is not biased to detect only elements shared with related species. Using the functionality score, we combined all our annotations into a single cell type-agnostic encyclopedia that catalogs all human functional regulatory elements, enabling easy and intuitive interpretation of the effect of genome variants on phenotype, such as in disease-associated, evolutionary conserved or positively selected loci. These resources, including cell type-specific annotations, enyclopedia and a visualization server, are publicly available online at http://noble.gs.washington.edu/proj/encyclopedia/. ER -