PT - JOURNAL ARTICLE AU - Joseph Kaplinsky AU - Ramy Arnaout TI - Robust Estimates of Overall Immune-Repertoire Diversity from High-Throughput Measurements on Samples AID - 10.1101/024612 DP - 2016 Jan 01 TA - bioRxiv PG - 024612 4099 - http://biorxiv.org/content/early/2016/03/02/024612.short 4100 - http://biorxiv.org/content/early/2016/03/02/024612.full AB - The diversity of a person’s B- and T-cell repertoires is both clinically important and a key measure of immunological complexity. However, diversity is hard to estimate by current methods, due to inherent uncertainty in the number of B- and T-cell clones that will be missing from a blood or tissue sample by chance (the missing-species problem), inevitable sampling bias, and experimental noise. To solve this problem we developed Recon, a modified maximum-likelihood method that outputs the overall diversity of a repertoire from measurements on a sample. Recon outputs accurate, robust estimates by any of a vast set of complementary diversity measures, including species richness and entropy, at fractional repertoire coverage. It also outputs error bars and power tables, allowing robust comparisons of diversity between individuals and over time. We apply Recon to in silico and experimental immune-repertoire sequencing datasets as proof of principle for measuring diversity in large, complex systems.