Abstract
High-throughput RNA-seq has revolutionized the process of small RNA (sRNA) discovery, leading to a rapid expansion of sRNA categories. In addition to previously well-characterized sRNAs such as miRNAs, piRNAs and snoRNAs, recent emerging studies have spotlighted on tsRNAs (tRNA-derived small RNAs) and rsRNAs (rRNA-derived small RNAs) as new categories of sRNAs that bear versatile functions. Since existing software and pipelines for sRNA annotation are mostly focusing on analyzing miRNAs or piRNAs, here we developed SPORTS1.0 (small RNA annotation pipeline optimized for rRNA- and tRNA-derived small RNAs), which is optimized for analyzing tsRNAs and rsRNAs from sRNA-seq data, also with the capacity to annotate canonical sRNAs such as miRNAs and piRNAs. In addition, SPORTS1.0 can predict potential RNA modification sites basing on nucleotide mismatches within sRNAs. SPORTS1.0 is precompiled to annotate sRNAs for a wide range of 68 species across bacteria, yeast, plant and animal kingdoms additional species for analyses could be readily expanded upon end users’ input. As an example, SPORTS1.0 revealed distinct tsRNA and rsRNA signatures from different mice tissues/cells; and discovered that tsRNAs bear the highest mismatch rate compared with other sRNA species, which is consistent with their highly modified nature. SPORTS1.0 is an open-source software deposited at https://github.com/junchaoshi/sports1.0.