TY - JOUR T1 - Linked annotations: a middle ground for manual curation of biomedical databases and text corpora JF - bioRxiv DO - 10.1101/014274 SP - 014274 AU - Tatyana Goldberg AU - Shrikant Vinchurkar AU - Juan Miguel Cejuela AU - Lars Juhl Jensen AU - Burkhard Rost Y1 - 2015/01/01 UR - http://biorxiv.org/content/early/2015/01/23/014274.abstract N2 - Annotators of text corpora and biomedical databases carry out the same labor-intensive task to manually extract structured data from unstructured text. Tasks are needlessly repeated because text corpora are widely scattered. We envision that a linked annotation resource unifying many corpora could be a game changer. Such an open forum will help focus on novel annotations and on optimally benefiting from the energy of many experts. As proof-of-concept, we annotated protein subcellular localization in 100 abstracts cited by UniProtKB. The detailed comparison between our new corpus and the original UniProtKB annotations revealed sustained novel annotations for 42% of the entries (proteins). In a unified linked annotation resource these could immediately extend the utility of text corpora beyond the text-mining community. Our example motivates the central idea that linked annotations from text corpora can complement database annotations. ER -