PT - JOURNAL ARTICLE AU - Omri Tal AU - Tat Dat Tran AU - Jacobus Portegies TI - From Typical Sequences to Typical Genotypes AID - 10.1101/079491 DP - 2016 Jan 01 TA - bioRxiv PG - 079491 4099 - http://biorxiv.org/content/early/2016/10/13/079491.short 4100 - http://biorxiv.org/content/early/2016/10/13/079491.full AB - We demonstrate an application of a core notion of information theory, that of typical sequences and their related properties, to analysis of population genetic data. Based on the asymptotic equipartition property (AEP) for non-stationary discrete-time sources producing independent symbols, we introduce the concepts of typical genotypes and population entropy rate and cross entropy rate. We analyze three perspectives on typical genotypes: a set perspective on the interplay of typical sets of genotypes from two populations, a geometric perspective on their structure in high dimensional space, and a statistical learning perspective on the prospects of constructing typical-set based classifiers. In particular, we show that such classifiers have a surprising resilience to noise originating from small population samples, and highlight the potential for further links between inference and communication.