PT - JOURNAL ARTICLE AU - Tatyana Goldberg AU - Shrikant Vinchurkar AU - Juan Miguel Cejuela AU - Lars Juhl Jensen AU - Burkhard Rost TI - Linked annotations: a middle ground for manual curation of biomedical databases and text corpora AID - 10.1101/014274 DP - 2015 Jan 01 TA - bioRxiv PG - 014274 4099 - http://biorxiv.org/content/early/2015/01/23/014274.short 4100 - http://biorxiv.org/content/early/2015/01/23/014274.full AB - Annotators of text corpora and biomedical databases carry out the same labor-intensive task to manually extract structured data from unstructured text. Tasks are needlessly repeated because text corpora are widely scattered. We envision that a linked annotation resource unifying many corpora could be a game changer. Such an open forum will help focus on novel annotations and on optimally benefiting from the energy of many experts. As proof-of-concept, we annotated protein subcellular localization in 100 abstracts cited by UniProtKB. The detailed comparison between our new corpus and the original UniProtKB annotations revealed sustained novel annotations for 42% of the entries (proteins). In a unified linked annotation resource these could immediately extend the utility of text corpora beyond the text-mining community. Our example motivates the central idea that linked annotations from text corpora can complement database annotations.