Dynamic marine viral infections and major contribution to photosynthetic processes shown by regional and seasonal picoplankton metatranscriptomes

Ella T. Sieradzki; Ignacio-Espinoza J. Cesar; David M. Needham; Erin B. Fichot; Jed A. Fuhrman

doi:10.1101/176644

Abstract

Viruses are an important top-down control on microbial communities, yet their direct study in natural environments has been hindered by culture limitations^1-3. The advance of sequencing and bioinformatics over the last decade enabled the cultivation independent study of viruses. Many studies focus on assembling new viral genomes^4-6 and studying viral diversity using marker genes amplified from free viruses^7,8. We used cellular metatranscriptomics to study community-wide viral infections at three coastal California sites throughout a year. Generation of and recruitment to viral contigs (> 5kbp, N=66) allowed tracking of infection dynamics over time and space. Here we show that while these assemblies represent viral populations, they are likely biased towards clonal or low diversity assemblages. Furthermore, we demonstrate that published T4-like cyanophages (N=50) and pelagiphages (N=4), having genomic continuity between close relatives, are better tracked using marker genes. Additionally, we demonstrate determination of potential hosts by matching infection dynamics with microbial community composition. Finally, we quantify the relative contribution of various cyanobacteria and viruses to photosystem-II psbA expression in our study sites. We show sometimes >50% of all cyanobacterial+viral psbA expression we observed is of viral origin, which highlights the proportion of infected cells and makes viruses a remarkable contributor to photosynthesis and oxygen production.

We sampled surface seawater in different seasons over three sites across the San Pedro Channel, California, USA: The Port of Los Angeles (POLA), Santa Catalina Island Two Harbors (CAT) and the San Pedro Ocean Time-series (SPOT). These sites represent a gradient of human impact with POLA being the most impacted and SPOT resembling open ocean conditions. In all of these sites free virus-like particles outnumber bacteria and archaea roughly 10:1 (sup. fig. 1). We examined only the 0.2-1 μm size-fraction, which includes most bacteria, archaea and some picoeukaryotes. Via assembly of metatranscriptomes, we obtained 1455 contigs longer than 5 kb of which 57 (3.9%) were characterized as viral using virSorter and virFinder (see methods). Additionally, a cross-assembly of the metatranscriptomic viral contigs with metagenomes of the same samples (N=12) yielded 9 more contigs (mean length 26,563 bp) characterized as viral. Most of the contigs represent dsDNA viruses (N= 65) as apparent from their presence in metagenomes, but one appears to be an RNA virus possibly infecting a eukaryotic host. This contig contained an RNA-dependent-RNA-polymerase whose nearest match in NCBI non-redundant database was marine Antarctic phytoplankton RNA virus PAL_E4⁹. These 66 viral contigs revealed varied patterns of presence (in metagenomes) and activity (in metatranscriptomes) in the three sites over a year (fig. 1).

Figure 1:

Mean coverage of 66 viral contigs across three sites (Port of LA – POLA, San Pedro Ocean Time-series – SPOT and Two harbors – CAT) and four dates (July 2012, October 2012, January 2013 and April 2013) in metagenomes (MG) and metatranscriptomes (MT). The bar heights are normalized to the highest mean coverage within the sample. Each cell in the color bar on the bottom represents a contig and corresponds with the column above it in all samples. Mean coverage was calculated excluding contig positions in the 4^th quartile of coverage depth which can be biased by recruitment localized to a small portion of the contig (sup. fig. 3).

Active non-synchronized viral infection would manifest as recruitment to an entire contig in both metagenome and metatranscriptomes of the same sample. We found that patterns of mean coverage from metagenomes and metatranscriptomes of our assembled viral contigs usually differed, not just between metagenomes and metatranscriptomes but also between dates and locations, implying widespread boom-bust dynamics of infection. While some variation may be due to synchronization known for some photosynthetic and heterotrophic bacteria in the ocean^10,11 and for some of their phages¹², this explanation is less likely as samples were collected from all sites within the same 4 hours morning-time window.

Some regional patterns were evident, e.g. some viral contigs were unique to the Port of LA (fig. 1), and that site always clustered separately from SPOT and CAT by Bray-Curtis similarity of expression of viral contigs (sup. fig. 1B). This pattern corresponds to the difference in biotic parameters between the port and the other sites (sup. fig. 2), though the port did not cluster separately in microbial community composition by 16S-rRNA (sup. fig. 1A). The latter may reflect offshore microbes brought in with the tide but less active than port organisms. Clustering by metagenomic recruitment to viral contigs did not reveal consistent patterns by site or date (sup. fig. 1C).

Figure 2:

Metagenomic read recruitment to (A) an assembled cyanophage contig and (B) Prochlorococcus phage P-HM2 genome. Most recruitment to the assembled contig is at 99-100% identity (high density near 100% is not fully evident from the graph due to overlaps, see C), whereas P-HM2 reveals a genomic continuum. (C) Recruitment as a function of percent ID of reads demonstrates that assembled contigs mostly recruit at 100% ID and have few moderately close relatives (top) whereas published genomes of cyanophages reveal clouds of moderately close relatives but few matches near 100% (middle), and pelagiphages range from 100% down (bottom).

Ephemeral infections dominated the assembled landscape, as 56 out of 66 of the contigs only appeared in few metatranscriptomes, presumably reflecting sporadic infections. Persistent infections (mean coverage >= 0.75x in at least 3 out of 4 samples per site, 10 out of 66) were limited to CAT and SPOT except for one that was persistent in all three sites. Moniruzzaman et al.¹³ also recently demonstrated dominance of ephemeral dynamics in infections of marine single-cell eukaryotes during an algal bloom. Bray-Curtis dissimilarity of the viral contigs within each site was 80-100%, whereas the dissimilarity of microbial communities within site was distributed around 50-70%. High dissimilarity indicates that even within site different viruses are actively infecting in different seasons (sup. fig. 1D+E).

Moreover, assembled viral contigs appeared to be biased towards low-microdiversity (i.e. more clonal) viruses. High diversity, extremely common in marine microorganisms¹⁴, tends to break assemblies created with either read-overlaps or DeBruijn graphs^15,16. We expect that low virus diversity could result from boom-bust lifestyle due to bottlenecks during “bursts”. This might lead to a method bias towards ephemerally infecting viruses. Indeed, all the viral contigs we assembled in this study appear to have many nearly identical relatives but few moderately close ones as shown by recruitment plots (most recruitment at 98-100% identity and little recruitment at 90-97%, fig. 2C), while some of the published pelagiphages had recruitment along most of the genome and high mean coverage at up to 100% identity and yet did not assemble (fig. 2C, sup. table 1).

The recruitment plots also reveal a common pattern of recruitment to short fragments near 100% identity whereas the rest of the genome or contig is only recruited to at lower percentage if at all (sup. fig. 3). This pattern highlights two issues: (1) some genes are so conserved or so often laterally transferred that their partial sequences cannot be used to identify which phage is present and (2) that mean coverage of contigs could be highly biased by these conserved regions which needs to be considered when evaluating abundance of the contigs and for coverage-based binning of genomes.

A previous report indicated that Synechococcus phage genomes occur in discrete “clouds” with a discontinuity in recruitment below ∽95% identity¹⁷. While this pattern exists for some cyanophage genomes, and we often saw some gaps in coverage at ∽90-95% consistent with that idea (sup. fig. 3), it is by no means the rule in our data, especially for pelagiphages (fig. 2C). We also note that widely used recruitment algorithms only map reads with a local or end-to-end match at a very high percent identity, and would therefore miss much genetic diversity that may be relevant (fig. 2B).

We were surprised not to find multiple cyanophage (especially myovirus) contigs, because such cyanophages belong to the family Myoviridae, some of the most common dsDNA viruses in the ocean¹⁸ and we know this region has a diverse community of myoviruses and cyanobacteria^7,14. Few of the assembled viral contigs contained myoviral marker genes (e.g. capsid protein gp23) (sup. Table 2). The only assembled contig that is with high certainty from a cyanophage is a putative podovirus (see below). Recruitment of reads to published cyanophage genomes revealed the likely reason for so few such contigs: high genomic diversity (fig. 2B) which probably broke assemblies of T4-like cyanophages. We lacked assemblies despite persistent myovirus activity. We assigned translated reads identified by a Gp23-HMM (Hidden Markov Model) to published and assembled Gp23 proteins. Most versions of this marker gene from published genomes as well as the nine assembled Gp23 ORFs were expressed persistently throughout all sites and dates (sup. fig. 4). While the exact published genomes themselves were not present in our samples (fig. 2B), we posit that other T4-like cyanophages closely-related to those published are present and persistently infecting their hosts.

Matching viral contigs and hosts is challenging, but we were able to use physiological information and distributions among samples to make a likely match. Many cyanophages contain a variety of genes that maintain photosynthetic activity in the host during infection, from “spare parts” for photosynthetic reaction centers through regulation and optimization of those apparati¹⁹. In particular, viruses were shown to maintain photosystem II function during infection in order to supply energy to the host, as transcription of host genes is shut down during infection and PS-II proteins have a short lifetime^20,21. Our assembled cyanophage contig contained genes coding for photosystem-II protein D1 (psbA) and high-light induced protein (hli) reportedly widespread in cyanophages⁸. The putative cyanophage from which this contig was derived was actively transcribed (presumably infecting its host) in all three sites only in October 2012 (fig. 4A). The cyanobacterial community by 16S-rRNA was dominated in October by two operational taxonomic units (OTUs): one Synechococcus and one Prochlorococcus. Both OTUs were present at SPOT and CAT in October, but only Synechococcus was also present at POLA (fig. 4B).

Thus, we propose that this assembled contig is from a phage that infects Synechococcus OTU 10 which has a 16S sequence over the amplified region 100% identical to Synechococcus CC9902 of clade IV. On a phylogenetic tree of PS-II D1, translated PS-II D1 of this phage clustered closely with a different phage isolated on Synechococccus (sup. fig. 5).

Figure 3:

Presence and activity of the assembled cyanophage and its potential hosts in October 2012: (A) Mean coverage (quartiles 1-3) of assembled cyanophage (B) OTU relative abundance by 16S-rRNA of the two most abundant cyanobacteria OTUs in order: Prochlorococcus in DNA, Prochlorococcus in RNA, Synechococcus in DNA, Synechococcus in RNA. Note the near-absence of Prochlorococcus in POLA, in contrast to Synechococcus and the phage, leading us to infer the phage infects Synechococcus.

Because viruses and hosts both code for photosynthetic functions, a comparison of viral and host-coded contributions to activity is possible. Sharon et al.²² previously showed viral psbA gene can outnumber cyanobacterial psbA genes in metagenomes from the Mediterranean, and showed viral gene expression is evident. We extended this to quantitatively partition gene expression into bacterial contribution from Synechococcus and Prochlorococcus and viral contribution from cyanomyoviruses and cyanopodoviruses, as evident from HMM-placed translated reads onto our PS-II D1 phylogenetic tree. We found psbA transcripts of T4-like cyanomyovirus origin generally accounted for roughly 50% of cyanobacterial and cyanophage psbA transcripts. Prochlorococcus transcripts were almost always comparable to the T4-like contribution. On several occasions, the viral version exceeded the cyanobacterial version in read count (fig. 4).

Figure 4:

Distribution of psbA of T4-like phages, Synechococcus, Prochlororoccus, and T7-like phages in (A) metagenomes and (B) metatranscriptomes.

We can roughly estimate the proportion of infected cyanobacteria from our psbA data and compare it to previously published estimates. For cyanobacteria in marine systems, the highest estimates of infection are roughly 50-60% infected at any given time^2,17,23,24. One consideration when calculating the proportion of infected cyanobacteria is that during host infection, the number of phage mRNA of psbA increases quickly during early infection until it becomes the exclusive source of psbA transcripts in the cell^20,21. Another consideration is that, regardless their source, host or virus, the abundance of psbA transcripts is comparable in infected and uninfected cells²³. What we observe in the sample is a comparable contribution of T4-like phages and cyanobacteria (fig. 5 D) at a ratio of 1.2±0.6 (mean ± standard deviation) phage/cyanobacteria, which suggests that on average about half of the cyanobacteria are infected. This is in accordance with the high end of published estimates, confirming that infection is an important part of cyanobacterial ecology.

In both metagenomes and metatranscriptomes, there is minor consistent recruitment to T7-like cyanopodovirus psbA. However, in every sample the contribution of T7-like cyanopodoviruses was very low compared to that of T4-like cyanomyoviruses. This could be due to the more specific host range reported for cyanopodoviruses compared to cyanomyoviruses^25-27. As T4-like and T7-like cyanophages are reported to be strictly lytic²⁸, their presence in metagenomes results from late infection genomic copies or virions within host cells, pseudolysogeny or phages that adsorbed to cells or particles.

Extending metatranscriptomics methods as recently applied to marine eukaryotic viral infection^13,29,30, we show the power of multiple approaches to track viral infection and dynamics within the broad picoplankton community, using metatranscriptomes of the cellular fraction, with particular examples in the cyanobacteria. Use of marker genes is especially important to study viruses with many close relatives in the same environment (whose contigs assemble poorly), whereas assemblies are useful for tracking ephemeral, more clonal viruses. The observed infection dynamics can sometimes be used in combination with microbial community structure and viral marker genes found within contigs to deduce a host. Use of metagenomes and metatranscriptomes provides an insight into quantifiable viral contribution to photosynthesis and to estimating the fraction of infected cyanobacteria.

Methods

Sample collection

Surface seawater was collected by bucket on 7/15/2012, 10/19/2012, 1/9/2013 and 4/24/2013 in three locations: The Port of Los Angeles (33°42.75’N 118°15.55’W), the San Pedro Ocean Time-series (33°33.00’N 118°24.01’W) and Two Harbors, Santa Catalina Island (33°27.18’N 118°28.51’W). Duplicate samples of 20 liters were filtered in each location through an 80 μm mesh followed by a glass fiber syringe prefilter (Gelman, 4523) which collected the >1 μm size fraction and a 0.2 μm PES Sterivex filter (Millipore, SVGPB1010) which collected the free-living size fraction. RNAlater (Thermo-Fisher, AM7020) was added to each filter and filters were flash frozen no more than 5 minutes post-filtration.

Library preparation

DNA and RNA were extracted simultaneously from Sterivex filters by bead-beating followed by an AllPrep kit (Qiagen, 80204). An internal standard (ERCC RNA Spike-In Mix, Thermo-Fisher 4456740) was added into the lysate after bead-beating for quality assurance. RNA was enriched for mRNA with RiboZero (Illumina, MRZB12424). Resulting mRNA was reverse transcribed using SuperScript-III (Invitrogen, 18080-051). DNA and cDNA were sheared with Covaris m2 and size-selected for products larger than 300 bp. RNA libraries were prepared and barcoded using NEBNext Ultra Directional RNA library Prep Kit for Illumina (E74205). DNA libraries were prepared and barcoded with Ovation UltraLow Library Prep V2 (Nugen, 0344).

Metagenomes were sequenced on Illumina HiSeq 2x125 bp or 2x150 bp. Metatranscriptomes were sequenced on Illumina HiSeq 2x250 bp.

Read processing and assembly

Raw metagenomics and metatranscriptomics reads were quality trimmed and filtered with Trimmomatic version 0.33 with parameters LEADING:20 TRAILING:20 SLIDINGWINDOW:15:25³¹. Metatranscriptomic reads were merged with PEAR³², using the default settings and residual ribosomal reads as well as the internal standard were removed informatically. Merged reads from each sample separately were assembled with Megahit.

Contigs smaller than 2kbp from all samples were co-assembled with Newbler³³ version 2.9 (Roche) (minimum overlap 40bp minimum id 99%) and contigs larger than 2kbp from all samples were co-assembled with minimus2³⁴ (minimum overlap 40bp minimum id 99%). Only contigs larger than 5 Kbp were further analyzed.

Identification and annotation of viral contigs

Viral contigs were identified by VirSorter³⁵ using RefSeq on the CyVerse platform and only contigs classified as category 1 or category 2 were considered. In addition, the contigs were ranked using VirFinder³⁶ (rank >=0.95). Prodigal³⁷ was used to predict ORFs in those contigs, and the amino acid sequences were searched against the nr database (August 12^th 2016) using blastp³⁸ and a maximum E-value 10⁻⁵. The annotations were used to verify viral contigs from the VirFinder results. Contigs were verified to be non-chimeric by even recruitment.

Quality filtered metagenomic and metatranscriptomic reads were mapped back to these contigs with Bowtie2 version 2.2.6 using the default settings and the expression patterns were identified and visualized with Anvi’o³⁹ version 2.1.0.

Microbial community composition analysis

The V4-V5 regions of the 16S-rRNA coding gene were amplified from DNA and cDNA from all samples using the 515-N-F and 926-R primers, and sequenced on an Illumina MiSeq 2x300 bp (UC Davis genome center) along with a mock community as described in Parada et al.⁴⁰.

The ends of resulting reads were trimmed with PRINSEQ⁴¹ to a quality score higher than 20. The trimmed reads were merged with USEARCH7⁴² allowing for 3 mismatches in the overlap region. Retained assembled reads were clustered with mothur⁴³ version 1.38.0 according to the MiSeq and classified with SILVA version 119. Bray-Curtis dissimilarity and dendrograms were calculated and plotted with R package vegan⁴⁴.

Analysis of PS-II D1 protein sequences

A curated set of PS-II D1 amino acid sequences of myoviruses, podoviruses, cyanobacteria and eukaryotes (chloroplast) from Pfam⁴⁵ and RefSeq release 80 was downloaded. All sequences of marine viral PS-II D1 were retained in addition to sequences of bacterial and eukaryotic taxa that were identified in the 16S-rRNA community composition. One of the assembled contigs contained a psbA gene coding for PS-II D1. The translated amino acid sequences were added to the set of proteins.

Merged reads from the metatranscriptomes and unmerged forward reads from the metagenomes were aligned with blastx³⁸ against this set demanding an e-value of 10⁻⁵. The reads that passed the filter were translated using bioPython⁴⁶ into amino acids according to the reading frame indicated by the blastx start and end values.

Following the protocol used in Ignacio-Espinoza et al.⁴⁷ total of 158 sequences were aligned with mafft⁴⁸ version 7.305b with parameters set to globalpair, gap open penalty 1.5, gap extension penalty 0.5 and scoring matrix BLOSUM30. Informative blocks were identified using Gblocks⁴⁹ version 0.91b with a minimum block length 5, blocks represent at least half of the sequences and allowing gaps (b3=50, b4=5, b5=h). The blocks were used to build a maximum likelihood phylogenetic tree using RAxML⁵⁰ (best of 20 trees, gamma model and WAG substitution matrix). A hidden Markov Model (HMM) of the same set was also built with hmmer 3.0⁵¹. The translated metagenomics and metatranscriptomics amino acid sequences were searched using the HMM and a threshold of e-value 10⁻⁵. A total of 190,928 translated metatranscriptomics reads and 72,292 metagenomics reads from all samples remained after this step. Those reads were locally aligned to the HMM using hmmer 3.0 function hmmalign and placed into the phylogenetic tree using pplacer⁵² version v1.1.alpha17 (sup. fig. 6).

Analysis of gp23 protein sequences

Metatranscriptomic and metagenomics reads were searched against a set of T4-like clusters of orthologous groups (COGs) with an E-value threshold of 10⁻⁵. 89,768 metatranscriptomic reads and 134,995 metagenomic reads were annotated as gp23. An HMM of gp23 was built as described previously and translated reads were searched and placed with pplacer. The tree was visualized by the Interactive Tree Of Life (iTOL)⁵³.

Recruitment to phage genomes

The four currently available full pelagiphage genomes were downloaded from NCBI and concatenated with assembled viral contigs from metatranscriptomes the metagenomes as well as with published cyanophage genomes downloaded from NCBI RefSeq. Metagenomic and metatranscriptomics reads were searched against the genomes dataset with blastn default settings. For metagenomes only hits longer than 100bp were retained, and for metatranscriptomes only hits longer than 200bp. Hits were then plotted against the genomes using R⁵⁴.

Data availability

All data can be found on EMBL-ENA under project number PRJEB12234. Raw metatranscriptomics sequences accession numbers are ERS1864892-ERS1864903, and negative control library sequences accession number is ERR2089009. Raw metagenomic sequences accession numbers are ERS1869885-ERS1869896 and negative control accession number is ERS1872073. Assembled viral contigs accession numbers are ERZ474118-ERZ474183.

Acknowledgements

The authors would like to thank R. Sachdeva, N. Ahlgren, A. Parada, L. Berdjeb, E. Graham, M. Lee, J. Ren, F. Sun and T. Delmont for insightful discussions and advice on bioinformatics analyses. We thank Catherine Roney-Garcia, the Sundiver crew and the USC Wrigley Institute of Environmental Studies for logistic support. This work was supported by NSF grant 1136818, Gordon and Betty Moore Foundation Marine Microbiology Initiative grant GBMF3779 and Norma and Jerol Sonosky summer fellowship to E.T.S.

References:

1.↵
Proctor, L. M., & Fuhrman, J. A. (1990). Viral mortality of marine bacteria and cyanobacteria. Nature, 343(6253), 60.
OpenUrl CrossRef Web of Science
2.↵
Suttle, C. A. (2007). Marine viruses-major players in the global ecosystem. Nature reviews. Microbiology, 5(10), 801.
OpenUrl CrossRef PubMed Web of Science
3.↵
Lima-Mendez, G., Faust, K., Henry, N., Decelle, J., Colin, S., Carcillo, F., … & Bittner, L. (2015). Determinants of community structure in the global plankton interactome. Science, 348(6237), 1262073.
OpenUrl Abstract/FREE Full Text
4.↵
Brum, J. R., Ignacio-Espinoza, J. C., Roux, S., Doulcier, G., Acinas, S. G., Alberti, A., … & Gorsky, G. (2015). Patterns and ecological drivers of ocean viral communities. Science, 348(6237), 1261498.
OpenUrl Abstract/FREE Full Text
5.
Paez-Espino, D., Eloe-Fadrosh, E. A., Pavlopoulos, G. A., Thomas, A. D., Huntemann, M., Mikhailova, N., … & Kyrpides, N. C. (2016). Uncovering Earth’s virome. Nature, 536 (7617).
6.↵
Nishimura, Y., Watai, H., Honda, T., Mihara, T., Omae, K., Roux, S., … & Sullivan, M. B. (2017). Environmental viral genomes shed new light on virus-host interactions in the ocean. mSphere, 2(2), e00359–16.
OpenUrl
7.↵
Chow, C. E. T., & Fuhrman, J. A. (2012). Seasonality and monthly dynamics of marine myovirus communities. Environmental microbiology, 14(8), 2171–2183.
OpenUrl CrossRef PubMed Web of Science
8.↵
Adriaenssens, E. M., & Cowan, D. A. (2014). Using signature genes as tools to assess environmental viral ecology and diversity. Applied and environmental microbiology, 80(15), 4470–4480.
OpenUrl Abstract/FREE Full Text
9.↵
Miranda, J. A., Culley, A. I., Schvarcz, C. R., & Steward, G. F. (2016). RNA viruses as major contributors to Antarctic virioplankton. Environmental microbiology, 18(11), 3714–3727.
OpenUrl CrossRef
10.↵
Ottesen, E. A., Young, C. R., Eppley, J. M., Ryan, J. P., Chavez, F. P., Scholin, C. A., & DeLong, E. F. (2013). Pattern and synchrony of gene expression among sympatric marine microbial populations. Proceedings of the National Academy of Sciences, 110(6), E488–E497.
OpenUrl Abstract/FREE Full Text
11.↵
Ottesen, E. A., Young, C. R., Gifford, S. M., Eppley, J. M., Marin, R., Schuster, S. C., … & DeLong, E. F. (2014). Multispecies diel transcriptional oscillations in open ocean heterotrophic bacterial assemblages. Science, 345(6193), 207–212.
OpenUrl Abstract/FREE Full Text
12.↵
Jia, Y., Shan, J., Millard, A., Clokie, M. R., & Mann, N. H. (2010). Light-dependent adsorption of photosynthetic cyanophages to Synechococcus sp. WH7803. FEMS microbiology letters, 310(2), 120–126.
OpenUrl CrossRef PubMed Web of Science
13.↵
Moniruzzaman, M., Wurch, L. L., Alexander, H., Dyhrman, S. T., Gobler, C. J., & Wilhelm, S. W. (2017). Virus-host relationships of marine single-celled eukaryotes resolved from metatranscriptomics. Nature Communications, 8.
14.↵
Needham, D. M., Sachdeva, R., & Fuhrman, J. A. (2017). Ecological dynamics and cooccurrence among marine phytoplankton, bacteria and myoviruses shows microdiversity matters. The ISME Journal.
15.↵
Awad, S., Irber, L., & Brown, C. T. (2017). Evaluating Metagenome Assembly on a Simple Defined Community with Many Strain Variants. bioRxiv, 155358.
16.↵
Martinez-Hernandez, F., Fornas, O., Gomez, M. L., Bolduc, B., de la Cruz Pena, M. J., Martínez, J. M., … & Sullivan, M. B. (2017). Single-virus genomics reveals hidden cosmopolitan and abundant viruses. Nature Communications, 8.
17.↵
Deng, L., Ignacio-Espinoza, J. C., Gregory, A. C., Poulos, B. T., Weitz, J. S., Hugenholtz, P., & Sullivan, M. B. (2014). Viral tagging reveals discrete populations in Synechococcus viral genome sequence space. Nature, 513(7517), 242.
OpenUrl CrossRef PubMed
18.↵
Williamson, S. J., Allen, L. Z., Lorenzi, H. A., Fadrosh, D. W., Brami, D., Thiagarajan, M., … & Venter, J. C. (2012). Metagenomic exploration of viruses throughout the Indian Ocean. PLoS One, 7(10), e42047.
OpenUrl CrossRef PubMed
19.↵
Hurwitz, B. L., & U’Ren, J. M. (2016). Viral metabolic reprogramming in marine ecosystems. Current opinion in microbiology, 31, 161–168.
OpenUrl CrossRef
20.↵
Lindell, D., Jaffe, J. D., Johnson, Z. I., Church, G. M., & Chisholm, S. W. (2005). Photosynthesis genes in marine viruses yield proteins during host infection. Nature, 438(7064), 86.
OpenUrl CrossRef PubMed Web of Science
21.↵
Clokie, M. R., Shan, J., Bailey, S., Jia, Y., Krisch, H. M., West, S., & Mann, N. H. (2006). Transcription of a ’photosynthetic’T4-type phage during infection of a marine cyanobacterium. Environmental Microbiology, 8(5), 827–835.
OpenUrl CrossRef PubMed Web of Science
22.↵
Sharon, I., Tzahor, S., Williamson, S., Shmoish, M., Man-Aharonovich, D., Rusch, D. B., … & Adir, N. (2007). Viral photosynthetic reaction center genes and transcripts in the marine environment. The ISME journal, 1(6), 492.
OpenUrl
23.↵
Proctor, L. M., & Fuhrman, J. A. (1990). Viral mortality of marine bacteria and cyanobacteria. Nature, 343(6253), 60.
OpenUrl CrossRef Web of Science
24.↵
Wommack, K. E., & Colwell, R. R. (2000). Virioplankton: viruses in aquatic ecosystems. Microbiology and molecular biology reviews, 64(1), 69–114.
OpenUrl Abstract/FREE Full Text
25.↵
Sullivan, M. B., Waterbury, J. B., & Chisholm, S. W. (2003). Cyanophages infecting the oceanic cyanobacterium Prochlorococcus. Nature, 424(6952), 1047.
OpenUrl CrossRef PubMed Web of Science
26.
Millard, A. D., & Mann, N. H. (2006). A temporal and spatial investigation of cyanophage abundance in the Gulf of Aqaba, Red Sea. Journal of the Marine Biological Association of the United Kingdom, 86(3), 507–515.
OpenUrl
27.↵
Wang, K., & Chen, F. (2008). Prevalence of highly host-specific cyanophages in the estuarine environment. Environmental microbiology, 10(2), 300–312.
OpenUrl CrossRef PubMed Web of Science
28.↵
Martin, E., & Benson, R. (1988). Phages of cyanobacteria. The bacteriophages, 2, 607–645.
OpenUrl
29.↵
Dupont, C. L., McCrow, J. P., Valas, R., Moustafa, A., Walworth, N., Goodenough, U., … & Mann, E. (2015). Genomes and gene expression across light and productivity gradients in eastern subtropical Pacific microbial communities. The ISME journal, 9(5), 1076.
OpenUrl
30.↵
Allen, L. Z., McCrow, J. P., Ininbergs, K., Dupont, C. L., Badger, J. H., Hoffman, J. M., … & Venter, J. C. (2017). The Baltic Sea Virome: Diversity and Transcriptional Activity of DNA and RNA Viruses. mSystems, 2(1), e00125–16.
OpenUrl
31.↵
Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics, 30(15), 2114–2120.
OpenUrl CrossRef PubMed Web of Science
32.↵
Zhang, J., Kobert, K., Flouri, T., & Stamatakis, A. (2013). PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics, 30(5), 614–620.
OpenUrl PubMed Web of Science
33.↵
Margulies, M., Egholm, M., Altman, W. E., Attiya, S., Bader, J. S., Bemben, L. A., … & Dewell, S. B. (2005). Genome sequencing in open microfabricated high density picoliter reactors. Nature, 437(7057), 376.
OpenUrl CrossRef PubMed Web of Science
34.↵
Sommer, D. D., Delcher, A. L., Salzberg, S. L., & Pop, M. (2007). Minimus: a fast, lightweight genome assembler. BMC bioinformatics, 8(1), 64.
OpenUrl CrossRef PubMed
35.↵
Roux, S., Enault, F., Hurwitz, B. L., & Sullivan, M. B. (2015). VirSorter: mining viral signal from microbial genomic data. PeerJ, 3, e985.
OpenUrl CrossRef
36.↵
Ren, J., Ahlgren, N. A., Lu, Y. Y., Fuhrman, J. A., & Sun, F. (2017). VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data. Microbiome, 5(1), 69.
OpenUrl CrossRef
37.↵
Hyatt, D., Chen, G. L., LoCascio, P. F., Land, M. L., Larimer, F. W., & Hauser, L. J. (2010). Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC bioinformatics, 11(1), 119.
OpenUrl CrossRef PubMed
38.↵
Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., & Madden, T. L. (2009). BLAST+: architecture and applications. BMC bioinformatics, 10(1), 421.
OpenUrl CrossRef PubMed
39.↵
Eren, A. M., Esen, Ö. C., Quince, C., Vineis, J. H., Morrison, H. G., Sogin, M. L., & Delmont, T. O. (2015). Anvi’o: an advanced analysis and visualization platform for ’omics data. PeerJ, 3, e1319.
OpenUrl CrossRef PubMed
40.↵
Parada, A. E., Needham, D. M., & Fuhrman, J. A. (2016). Every base matters: assessing small subunit rRNA primers for marine microbiomes with mock communities, time series and global field samples. Environmental microbiology, 18(5), 1403–1414.
OpenUrl CrossRef
41.↵
Schmieder, R., & Edwards, R. (2011). Quality control and preprocessing of metagenomic datasets. Bioinformatics, 27(6), 863–864.
OpenUrl CrossRef PubMed Web of Science
42.↵
Edgar, R. C. (2010). Search and clustering orders of magnitude faster than BLAST. Bioinformatics, 26(19), 2460–2461.
OpenUrl CrossRef PubMed Web of Science
43.↵
Schloss, P. D., Westcott, S. L., Ryabin, T., Hall, J. R., Hartmann, M., Hollister, E. B., … & Sahl, J. W. (2009). Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Applied and environmental microbiology, 75(23), 7537–7541.
OpenUrl Abstract/FREE Full Text
44.↵
Oksanen, J., Kindt, R., Legendre, P., O’Hara, B., Stevens, M. H. H., Oksanen, M. J., & Suggests, M. A. S. S. (2007). The vegan package. Community ecology package, 10, 631–637. http://vegan.r-forge.r-project.org
OpenUrl
45.↵
Finn, R. D., Coggill, P., Eberhardt, R. Y., Eddy, S. R., Mistry, J., Mitchell, A. L., … & Salazar, G. A. (2016). The Pfam protein families database: towards a more sustainable future. Nucleic acids research, 44(D1), D279–D285.
OpenUrl CrossRef PubMed
46.↵
Cock, P. J., Antao, T., Chang, J. T., Chapman, B. A., Cox, C. J., Dalke, A., … & De Hoon, M. J. (2009). Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics, 25(11), 1422–1423.
OpenUrl CrossRef PubMed Web of Science
47.↵
Ignacio-Espinoza, J. C., & Sullivan, M. B. (2012). Phylogenomics of T4 cyanophages: lateral gene transfer in the ’core’and origins of host genes. Environmental microbiology, 14(8), 2113–2126.
OpenUrl CrossRef PubMed Web of Science
48.↵
Katoh, K., & Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular biology and evolution, 30(4), 772–780.
OpenUrl CrossRef PubMed Web of Science
49.↵
Castresana, J. (2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Molecular biology and evolution, 17(4), 540–552.
OpenUrl CrossRef PubMed Web of Science
50.↵
Stamatakis, A. (2014). RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics, 30(9), 1312–1313.
OpenUrl CrossRef PubMed Web of Science
51.↵
Johnson, L. S., Eddy, S. R., & Portugaly, E. (2010). Hidden Markov model speed heuristic and iterative HMM search procedure. BMC bioinformatics, 11(1), 431.
OpenUrl CrossRef PubMed
52.↵
Matsen, F. A., Kodner, R. B., & Armbrust, E. V. (2010). pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC bioinformatics, 11(1), 538.
OpenUrl CrossRef PubMed
53.↵
Letunic, I., & Bork, P. (2016). Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic acids research, 44(W1), W242–W245.
OpenUrl CrossRef PubMed
54.↵
R Core Team (2016) https://www.R-project.org/

View the discussion thread.

Posted August 17, 2017.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Ecology

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11753)
Bioengineering (8752)
Bioinformatics (29201)
Biophysics (14974)
Cancer Biology (12100)
Cell Biology (17413)
Clinical Trials (138)
Developmental Biology (9422)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18309)
Genetics (12245)
Genomics (16804)
Immunology (11869)
Microbiology (28098)
Molecular Biology (11596)
Neuroscience (60975)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2886)
Systems Biology (7340)
Zoology (1651)

[1] 1.↵
Proctor, L. M., & Fuhrman, J. A. (1990). Viral mortality of marine bacteria and cyanobacteria. Nature, 343(6253), 60.
OpenUrl CrossRef Web of Science

[2] 2.↵
Suttle, C. A. (2007). Marine viruses-major players in the global ecosystem. Nature reviews. Microbiology, 5(10), 801.
OpenUrl CrossRef PubMed Web of Science

[3] 3.↵
Lima-Mendez, G., Faust, K., Henry, N., Decelle, J., Colin, S., Carcillo, F., … & Bittner, L. (2015). Determinants of community structure in the global plankton interactome. Science, 348(6237), 1262073.
OpenUrl Abstract/FREE Full Text

[4] 4.↵
Brum, J. R., Ignacio-Espinoza, J. C., Roux, S., Doulcier, G., Acinas, S. G., Alberti, A., … & Gorsky, G. (2015). Patterns and ecological drivers of ocean viral communities. Science, 348(6237), 1261498.
OpenUrl Abstract/FREE Full Text

[5] 5.
Paez-Espino, D., Eloe-Fadrosh, E. A., Pavlopoulos, G. A., Thomas, A. D., Huntemann, M., Mikhailova, N., … & Kyrpides, N. C. (2016). Uncovering Earth’s virome. Nature, 536 (7617).

[6] 6.↵
Nishimura, Y., Watai, H., Honda, T., Mihara, T., Omae, K., Roux, S., … & Sullivan, M. B. (2017). Environmental viral genomes shed new light on virus-host interactions in the ocean. mSphere, 2(2), e00359–16.
OpenUrl

[7] 7.↵
Chow, C. E. T., & Fuhrman, J. A. (2012). Seasonality and monthly dynamics of marine myovirus communities. Environmental microbiology, 14(8), 2171–2183.
OpenUrl CrossRef PubMed Web of Science

[8] 8.↵
Adriaenssens, E. M., & Cowan, D. A. (2014). Using signature genes as tools to assess environmental viral ecology and diversity. Applied and environmental microbiology, 80(15), 4470–4480.
OpenUrl Abstract/FREE Full Text

[9] 9.↵
Miranda, J. A., Culley, A. I., Schvarcz, C. R., & Steward, G. F. (2016). RNA viruses as major contributors to Antarctic virioplankton. Environmental microbiology, 18(11), 3714–3727.
OpenUrl CrossRef

[10] 10.↵
Ottesen, E. A., Young, C. R., Eppley, J. M., Ryan, J. P., Chavez, F. P., Scholin, C. A., & DeLong, E. F. (2013). Pattern and synchrony of gene expression among sympatric marine microbial populations. Proceedings of the National Academy of Sciences, 110(6), E488–E497.
OpenUrl Abstract/FREE Full Text

[11] 11.↵
Ottesen, E. A., Young, C. R., Gifford, S. M., Eppley, J. M., Marin, R., Schuster, S. C., … & DeLong, E. F. (2014). Multispecies diel transcriptional oscillations in open ocean heterotrophic bacterial assemblages. Science, 345(6193), 207–212.
OpenUrl Abstract/FREE Full Text

[12] 12.↵
Jia, Y., Shan, J., Millard, A., Clokie, M. R., & Mann, N. H. (2010). Light-dependent adsorption of photosynthetic cyanophages to Synechococcus sp. WH7803. FEMS microbiology letters, 310(2), 120–126.
OpenUrl CrossRef PubMed Web of Science

[13] 13.↵
Moniruzzaman, M., Wurch, L. L., Alexander, H., Dyhrman, S. T., Gobler, C. J., & Wilhelm, S. W. (2017). Virus-host relationships of marine single-celled eukaryotes resolved from metatranscriptomics. Nature Communications, 8.

[14] 14.↵
Needham, D. M., Sachdeva, R., & Fuhrman, J. A. (2017). Ecological dynamics and cooccurrence among marine phytoplankton, bacteria and myoviruses shows microdiversity matters. The ISME Journal.

[15] 15.↵
Awad, S., Irber, L., & Brown, C. T. (2017). Evaluating Metagenome Assembly on a Simple Defined Community with Many Strain Variants. bioRxiv, 155358.

[16] 16.↵
Martinez-Hernandez, F., Fornas, O., Gomez, M. L., Bolduc, B., de la Cruz Pena, M. J., Martínez, J. M., … & Sullivan, M. B. (2017). Single-virus genomics reveals hidden cosmopolitan and abundant viruses. Nature Communications, 8.

[17] 17.↵
Deng, L., Ignacio-Espinoza, J. C., Gregory, A. C., Poulos, B. T., Weitz, J. S., Hugenholtz, P., & Sullivan, M. B. (2014). Viral tagging reveals discrete populations in Synechococcus viral genome sequence space. Nature, 513(7517), 242.
OpenUrl CrossRef PubMed

[18] 18.↵
Williamson, S. J., Allen, L. Z., Lorenzi, H. A., Fadrosh, D. W., Brami, D., Thiagarajan, M., … & Venter, J. C. (2012). Metagenomic exploration of viruses throughout the Indian Ocean. PLoS One, 7(10), e42047.
OpenUrl CrossRef PubMed

[19] 19.↵
Hurwitz, B. L., & U’Ren, J. M. (2016). Viral metabolic reprogramming in marine ecosystems. Current opinion in microbiology, 31, 161–168.
OpenUrl CrossRef

[20] 20.↵
Lindell, D., Jaffe, J. D., Johnson, Z. I., Church, G. M., & Chisholm, S. W. (2005). Photosynthesis genes in marine viruses yield proteins during host infection. Nature, 438(7064), 86.
OpenUrl CrossRef PubMed Web of Science

[21] 21.↵
Clokie, M. R., Shan, J., Bailey, S., Jia, Y., Krisch, H. M., West, S., & Mann, N. H. (2006). Transcription of a ’photosynthetic’T4-type phage during infection of a marine cyanobacterium. Environmental Microbiology, 8(5), 827–835.
OpenUrl CrossRef PubMed Web of Science

[22] 22.↵
Sharon, I., Tzahor, S., Williamson, S., Shmoish, M., Man-Aharonovich, D., Rusch, D. B., … & Adir, N. (2007). Viral photosynthetic reaction center genes and transcripts in the marine environment. The ISME journal, 1(6), 492.
OpenUrl

[23] 23.↵
Proctor, L. M., & Fuhrman, J. A. (1990). Viral mortality of marine bacteria and cyanobacteria. Nature, 343(6253), 60.
OpenUrl CrossRef Web of Science

[24] 24.↵
Wommack, K. E., & Colwell, R. R. (2000). Virioplankton: viruses in aquatic ecosystems. Microbiology and molecular biology reviews, 64(1), 69–114.
OpenUrl Abstract/FREE Full Text

[25] 25.↵
Sullivan, M. B., Waterbury, J. B., & Chisholm, S. W. (2003). Cyanophages infecting the oceanic cyanobacterium Prochlorococcus. Nature, 424(6952), 1047.
OpenUrl CrossRef PubMed Web of Science

[26] 26.
Millard, A. D., & Mann, N. H. (2006). A temporal and spatial investigation of cyanophage abundance in the Gulf of Aqaba, Red Sea. Journal of the Marine Biological Association of the United Kingdom, 86(3), 507–515.
OpenUrl

[27] 27.↵
Wang, K., & Chen, F. (2008). Prevalence of highly host-specific cyanophages in the estuarine environment. Environmental microbiology, 10(2), 300–312.
OpenUrl CrossRef PubMed Web of Science

[28] 28.↵
Martin, E., & Benson, R. (1988). Phages of cyanobacteria. The bacteriophages, 2, 607–645.
OpenUrl

[29] 29.↵
Dupont, C. L., McCrow, J. P., Valas, R., Moustafa, A., Walworth, N., Goodenough, U., … & Mann, E. (2015). Genomes and gene expression across light and productivity gradients in eastern subtropical Pacific microbial communities. The ISME journal, 9(5), 1076.
OpenUrl

[30] 30.↵
Allen, L. Z., McCrow, J. P., Ininbergs, K., Dupont, C. L., Badger, J. H., Hoffman, J. M., … & Venter, J. C. (2017). The Baltic Sea Virome: Diversity and Transcriptional Activity of DNA and RNA Viruses. mSystems, 2(1), e00125–16.
OpenUrl

[31] 31.↵
Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics, 30(15), 2114–2120.
OpenUrl CrossRef PubMed Web of Science

[32] 32.↵
Zhang, J., Kobert, K., Flouri, T., & Stamatakis, A. (2013). PEAR: a fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics, 30(5), 614–620.
OpenUrl PubMed Web of Science

[33] 33.↵
Margulies, M., Egholm, M., Altman, W. E., Attiya, S., Bader, J. S., Bemben, L. A., … & Dewell, S. B. (2005). Genome sequencing in open microfabricated high density picoliter reactors. Nature, 437(7057), 376.
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Sommer, D. D., Delcher, A. L., Salzberg, S. L., & Pop, M. (2007). Minimus: a fast, lightweight genome assembler. BMC bioinformatics, 8(1), 64.
OpenUrl CrossRef PubMed

[35] 35.↵
Roux, S., Enault, F., Hurwitz, B. L., & Sullivan, M. B. (2015). VirSorter: mining viral signal from microbial genomic data. PeerJ, 3, e985.
OpenUrl CrossRef

[36] 36.↵
Ren, J., Ahlgren, N. A., Lu, Y. Y., Fuhrman, J. A., & Sun, F. (2017). VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data. Microbiome, 5(1), 69.
OpenUrl CrossRef

[37] 37.↵
Hyatt, D., Chen, G. L., LoCascio, P. F., Land, M. L., Larimer, F. W., & Hauser, L. J. (2010). Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC bioinformatics, 11(1), 119.
OpenUrl CrossRef PubMed

[38] 38.↵
Camacho, C., Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K., & Madden, T. L. (2009). BLAST+: architecture and applications. BMC bioinformatics, 10(1), 421.
OpenUrl CrossRef PubMed

[39] 39.↵
Eren, A. M., Esen, Ö. C., Quince, C., Vineis, J. H., Morrison, H. G., Sogin, M. L., & Delmont, T. O. (2015). Anvi’o: an advanced analysis and visualization platform for ’omics data. PeerJ, 3, e1319.
OpenUrl CrossRef PubMed

[40] 40.↵
Parada, A. E., Needham, D. M., & Fuhrman, J. A. (2016). Every base matters: assessing small subunit rRNA primers for marine microbiomes with mock communities, time series and global field samples. Environmental microbiology, 18(5), 1403–1414.
OpenUrl CrossRef

[41] 41.↵
Schmieder, R., & Edwards, R. (2011). Quality control and preprocessing of metagenomic datasets. Bioinformatics, 27(6), 863–864.
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Edgar, R. C. (2010). Search and clustering orders of magnitude faster than BLAST. Bioinformatics, 26(19), 2460–2461.
OpenUrl CrossRef PubMed Web of Science

[43] 43.↵
Schloss, P. D., Westcott, S. L., Ryabin, T., Hall, J. R., Hartmann, M., Hollister, E. B., … & Sahl, J. W. (2009). Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Applied and environmental microbiology, 75(23), 7537–7541.
OpenUrl Abstract/FREE Full Text

[44] 44.↵
Oksanen, J., Kindt, R., Legendre, P., O’Hara, B., Stevens, M. H. H., Oksanen, M. J., & Suggests, M. A. S. S. (2007). The vegan package. Community ecology package, 10, 631–637. http://vegan.r-forge.r-project.org
OpenUrl

[45] 45.↵
Finn, R. D., Coggill, P., Eberhardt, R. Y., Eddy, S. R., Mistry, J., Mitchell, A. L., … & Salazar, G. A. (2016). The Pfam protein families database: towards a more sustainable future. Nucleic acids research, 44(D1), D279–D285.
OpenUrl CrossRef PubMed

[46] 46.↵
Cock, P. J., Antao, T., Chang, J. T., Chapman, B. A., Cox, C. J., Dalke, A., … & De Hoon, M. J. (2009). Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics, 25(11), 1422–1423.
OpenUrl CrossRef PubMed Web of Science

[47] 47.↵
Ignacio-Espinoza, J. C., & Sullivan, M. B. (2012). Phylogenomics of T4 cyanophages: lateral gene transfer in the ’core’and origins of host genes. Environmental microbiology, 14(8), 2113–2126.
OpenUrl CrossRef PubMed Web of Science

[48] 48.↵
Katoh, K., & Standley, D. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular biology and evolution, 30(4), 772–780.
OpenUrl CrossRef PubMed Web of Science

[49] 49.↵
Castresana, J. (2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Molecular biology and evolution, 17(4), 540–552.
OpenUrl CrossRef PubMed Web of Science

[50] 50.↵
Stamatakis, A. (2014). RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics, 30(9), 1312–1313.
OpenUrl CrossRef PubMed Web of Science

[51] 51.↵
Johnson, L. S., Eddy, S. R., & Portugaly, E. (2010). Hidden Markov model speed heuristic and iterative HMM search procedure. BMC bioinformatics, 11(1), 431.
OpenUrl CrossRef PubMed

[52] 52.↵
Matsen, F. A., Kodner, R. B., & Armbrust, E. V. (2010). pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC bioinformatics, 11(1), 538.
OpenUrl CrossRef PubMed

[53] 53.↵
Letunic, I., & Bork, P. (2016). Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic acids research, 44(W1), W242–W245.
OpenUrl CrossRef PubMed

[54] 54.↵
R Core Team (2016) https://www.R-project.org/