Uncovering the unexplored diversity of thioamidated ribosomal peptides in Actinobacteria using the RiPPER genome mining tool

Javier Santos-Aberturas; Govind Chandra; Luca Frattaruolo; Rodney Lacret; Thu H. Pham; Natalia M. Vior; Tom H. Eyles; Andrew W. Truman

doi:10.1101/494286

ABSTRACT

The rational discovery of new specialized metabolites by genome mining represents a very promising strategy in the quest for new bioactive molecules. Ribosomally synthesized and post-translationally modified peptides (RiPPs) are a major class of natural product that derive from genetically encoded precursor peptides. However, RiPP gene clusters are particularly refractory to reliable bioinformatic predictions due to the absence of a common biosynthetic feature across all pathways. Here, we describe RiPPER, a new tool for the family-independent identification of RiPP precursor peptides and apply this methodology to search for novel thioamidated RiPPs in Actinobacteria. Until now, thioamidation was believed to be a rare post-translational modification, which is catalyzed by a pair of proteins (YcaO and TfuA) in Archaea. In Actinobacteria, the thioviridamide-like molecules are a family of cytotoxic RiPPs that feature multiple thioamides, and it has been proposed that a YcaO-TfuA pair of proteins also catalyzes their formation. Potential biosynthetic gene clusters encoding YcaO and TfuA protein pairs are common in Actinobacteria but the chemical diversity generated by these pathways is almost completely unexplored. A RiPPER analysis reveals a highly diverse landscape of precursor peptides encoded in previously undescribed gene clusters that are predicted to make thioamidated RiPPs. To illustrate this strategy, we describe the first rational discovery of a new family of thioamidated natural products, the thiovarsolins from Streptomyces varsoviensis.

INTRODUCTION

Microorganisms have provided humankind with a vast plethora of specialized metabolites with invaluable applications in medicine and agriculture.¹ The advent of widespread genome sequencing has shown that the metabolic potential of bacteria had been substantially underestimated, as their genomes contain many more biosynthetic gene clusters (BGCs) than known compounds.^2,3 Much of this enormous potential is either unexplored or undetectable under laboratory culture conditions, and is likely to include structurally novel bioactive specialized metabolites. Among the main classes of specialized metabolites produced by microorganisms, the ribosomally synthesized and post-translationally modified peptides⁴ (RiPPs) may harbor the largest amount of unexplored structural diversity. This is due to the inherent difficulties related to the in silico prediction of their BGCs, as RiPP biosynthetic pathways lack any kind of universally shared feature apart from the existence of a pathway-specific precursor peptide.

RiPP BGCs can be identified by the co-occurrence of specific RiPP tailoring enzymes (RTEs) alongside a precursor peptide that contains sequence motifs that are characteristic of a given RiPP family. This makes it relatively simple to identify further examples of known RiPP families,^5,6 but the identification of currently undiscovered RiPP families remains a significant unsolved problem. Unlike specialized metabolites such as polyketides, non-ribosomal peptides and terpenes, there are no genetic features that are common to all RiPP BGCs to aid in their identification. Furthermore, genes encoding precursor peptides are often missed during genome annotation due to their small size, yet the reliable prediction of precursor peptides constitutes a crucial task, as this starting scaffold is essential for RiPP structural prediction. Numerous analyses of specific RiPP classes signal the existence of a wide array of uncharacterized RiPP families,^7-9 but currently available prediction tools still rely on precursor peptide features that are associated with known RiPP families, thereby limiting the discovery of new RiPP families.^10-14

YcaO domain proteins are a widespread superfamily of enzymes with an intriguing catalytic potential in RiPP biosynthesis.¹⁵ These were originally shown to be responsible for the introduction of oxazoline and thiazoline heterocycles in the PP backbone of microcins,¹⁶ and were very recently demonstrated to catalyze the formation of the macroamidine ring of bottromycin.^17-19 YcaO proteins act as cyclodehydratases, activating the amide bond substrate by nucleophilic attack, which is followed by ATP-driven O-phosphorylation of the hemiorthoamide intermediate and subsequent elimination of phosphate. In most azoline-containing RiPPs, this catalytic activity requires a partner protein (E1-like or Ocin-ThiF-like proteins that are clustered with or fused to the YcaO domain), which acts as a docking element to bring the precursor peptide to the active site of the cyclodehydratase. YcaO proteins can also act as standalone proteins, as in bottromycin biosynthesis,^17-19 and many YcaO proteins are encoded in genomes without E1-like or Ocin-ThiF-like partner proteins,^9,15 including in the BGCs of thioviridamide-like molecules.^6,20-24

Thioviridamide and related compounds are cytotoxic RiPPs that contain multiple thioamide groups (Figure 1), but no azole or macroamidine rings. Thioamides are rare in nature^25-31 and it has been hypothesized that YcaO proteins could be responsible for this rare amide bond modification in thioviridamide biosynthesis, potentially in cooperation with TfuA domain proteins¹⁵ (Figure 1). This protein pair has been identified elsewhere in nature, including in archaea, where they are involved in the ATP-dependent thioamidation of a glycine residue of methyl-coenzyme M reductase.^32,33 We therefore hypothesized that the identification of tfuA-like genes could be employed as a rational criterion for the identification of BGCs responsible for the production of novel thioamidated RiPPs in bacteria.

Figure 1

An example of a thioviridamide-like molecule, thioalbamide, and inset, a proposed biochemical route to thioamides. Thioamides are highlighted in blue and other post-translational modifications are colored red.

An exploration of the diversity of tfuA-containing BGCs required methodology to identify precursor peptides that have no homology to known precursor peptides. Here, we report RiPPER (RiPP Precursor Peptide Enhanced Recognition), a method for the identification of precursor peptides that requires no information about RiPP structural class (available at https://github.com/streptomyces/ripper). This evaluates regions surrounding any putative RTE for short open reading frames (ORFs) based on the likelihood that these are truly peptide-coding genes. Peptide similarity networking is then used to identify putative RiPP families. We apply this methodology to identify RiPP BGCs encoding TfuA proteins in Actinobacteria, which reveals a highly diverse landscape of BGC families that are predicted to make thioamidated RiPPs. This analysis informed the discovery of the thioamidated thiovarsolins from Streptomyces varsoviensis, which are predicted to belong to a wider family of related thioamidated RiPPs and represents the first rational discovery of a new family of thioamidated compounds from nature.

RESULTS AND DISCUSSION

Development of a family-independent RiPP genome mining tool

Within a given RiPP family, all BGCs usually encode at least one tailoring enzyme and one precursor peptide that each feature domains conserved across the RiPP family.⁴ This has led to the development of genome mining methodology that can identify these well-characterized RiPP families with high accuracy.^11-13 However, there is a growing number of widespread RiPP BGCs with little or no homology to known RiPP BGCs.^7,34 Theoretically, backbone modification like thioamidation or epimerization³⁵ can occur on any residue. In addition, well-characterized RiPP tailoring enzymes can be associated with unusual precursor peptides that lack homology to known RiPP classes.⁹ We therefore sought to develop a method to identify likely precursor peptides that was independent of PP sequence and could be applicable for any RiPP family. The starting point for this method was to employ the functionality of RODEO^{13, 14} to identify genomic regions associated with a series of putative RTEs. RODEO uses a mixture of heuristic scoring and support vector machine classification to identify precursor peptides for lasso peptides¹³ and thiopeptides,¹⁴ but does not accurately identify other precursor peptides, whose sequences are highly variable and are often not annotated in genomes.

To enable the sequence independent discovery of precursor peptides, we sought to identify short ORFs that possess similar genetic features as other genes in a given gene cluster, including ribosome binding sites, codon usage and GC content. Prodigal (PROkaryotic DYnamic programming Gene-finding ALgorithm) uses these criteria to identify bacterial ORFs.³⁶ Therefore, following RODEO retrieval of nucleotide data, we implemented a modified form of this algorithm to specifically search for ORFs that encode for peptides of between 20 and 120 amino acids within apparently non-coding regions near to a predicted RTE (Figure 2A). Given the prevalence of characterized precursor peptides that are encoded on the same strand as a tailoring gene, a same strand score is added (custom parameter; default = 5). A modified GenBank file is generated by RiPPER that annotates these putative short ORFs within the putative BGC (Figure S1), and these are ranked alongside annotated short genes based on their Prodigal score. RiPPER then retrieves the top three scoring ORFs within ±8 kb of the RTE, plus any additional high scoring ORFs over a specified score threshold that represent probable genes. These are then assessed for Pfam domains³⁷ and data associated with each peptide is tabulated for further processing.

Figure 2

RiPPER identification of putative precursor peptides. (A) Schematic of RiPPER workflow where a cluster is identified based on a putative RiPP tailoring enzyme (RTE). (B) The 30 largest peptide similarity networks identified using RiPPER for peptides associated with tfuA-like genes in Actinobacteria. Red numbers indicate networks predicted to comprise of authentic precursor peptides (see Table S1 and Figures S6-S19) and triangular nodes indicate peptides encoded on the opposite strand to the RTE gene.

To validate this approach, we used RTE accession numbers that had previously been used to identify lasso peptide (RODEO¹³), microviridin³⁸ and thiopeptide (RODEO¹⁴) gene clusters. In each case, class-specific rules had been used to identify associated precursor peptides. These RiPP classes are well-suited to method validation as they have diverse gene cluster features and precursor peptide sequences, and span multiple bacterial taxa. In addition, the genes encoding these small peptides are often not annotated in genome sequences.¹³ We therefore used RiPPER with the same protein accessions as those previous studies to retrieve BGCs and associated precursor peptides. Comparison of the RiPPER outputs with these studies revealed that lasso peptide and microviridin precursor identification was highly reliable. 1056 out of 1122 (94.1%) and 279 out of 288 (96.7%) peptides identified by those prior mining studies were identified by RiPPER (Table 1, Supplementary Datasets 1-2).

View this table:

Table 1

Comparison of RiPPER with prior studies on the identification of RiPP precursor peptides.

In contrast, RiPPER only retrieved 439 of the 591 (74.3%) thiopeptide precursors previously identified (Table 1, Supplementary Dataset 3). This was possibly due to the comparatively large size of thiopeptide BGCs, which meant that the ±8 kb search window was not suited to a subset of these BGCs. Widening the unbiased search reduced specificity of the retrieval, so an additional targeted search step was introduced. All short peptides across the entire gene cluster region (default = 35 kb) that were not retrieved by the first search were analyzed for precursor peptide domains using hidden Markov models (HMMs) recently built by Haft et al.³⁹ Any peptides containing a domain were therefore also retrieved. This provided a minor improvement to RiPPER retrieval of lasso precursor peptides but significantly improved thiopeptide precursor peptide retrieval to 543 out of 591 (91.9%) peptides identified by RODEO.¹⁴

This data demonstrated that the RiPPER methodology was applicable to multiple diverse classes of RiPP, but the unbiased nature of retrieval meant that only between a half and a quarter (depending on RiPP class) of total retrieved peptides were likely to be precursor peptides (Table 1). We therefore generated peptide similarity networks⁴⁰ using peptides retrieved from each RiPPER analysis, where peptides with at least 40% identity were connected to each other. Despite the large sequence variance within each RiPP class, this was highly effective at filtering the peptides into networks of likely precursor peptides. For each RiPPER analysis, the largest network (“network 1”) contained the majority of precursor peptides identified by previous studies (Table 1, Figures S2-S4). Unexpectedly, network 1 of the lasso peptide dataset also contained PqqD domain proteins, a conserved feature of lasso peptide pathways that function as RiPP precursor peptide recognition elements (RREs).^41,42 These peptides could be easily filtered by Pfam analysis, as would a higher identity cut-off. In addition, network 2 comprises of 56 Burkholderia peptides that are precursors to capistruin lasso peptides (all identified by RODEO). Notably, for each RiPPER analysis, network 1 contained peptides with the expected precursor peptide domain that were not retrieved by either RODEO^13,14 or the bespoke microviridin analysis.³⁸ In total, this provided over 200 new candidate precursor peptides (Table 1), as well as additional networked peptides with no known domains that could feasibly be authentic precursor peptides. The ability of RiPPER to correctly identify a comparable number of precursor peptides to prior targeted methods demonstrates that the combination of rational ORF identification and scoring, Pfam analysis, and peptide similarity networking can identify RiPP precursor peptides with a high degree of accuracy and coverage without any prior knowledge of the RiPP class.

Identification of thioamidated RiPP BGCs using RiPPER

As a backbone modification, thioamidation potentially has no requirement for specific amino acid side chains, which means that there may be no conserved sequence motifs within precursor peptide substrates. To guide our identification of thioamidated RiPP BGCs, we identified a curated set of 229 TfuA-like proteins in Actinobacteria whose putative BGCs were retrieved using RiPPER, which showed that each TfuA protein was encoded alongside a YcaO protein but their associated gene clusters could be highly variable. RiPPER retrieved 743 peptides (Supplementary Dataset 4) and peptide similarity networking (40% identity cut-off) yielded 74 distinct networks of peptides, where 30 of these networks featured four or more peptides (Figure 2B, Figure S5, Supplementary Table 1). MultiGeneBlast⁴³ was then employed to compare the BGCs corresponding to each network.

As an initial proof of concept, this correctly grouped all thioviridamide-like precursor peptides into a single network (Figure 3A). Surprisingly, these precursor peptides were connected with four additional peptides encoded in putative BGCs that are extremely different to thioviridamide-like BGCs; three of these peptides were not previously annotated as genes. These peptides feature extensive sequence similarities with the thioviridamide-like precursor peptides (Figure S6), but the BGCs themselves are extremely different, where the only common features with the thioviridamide-like BGCs are the YcaO, TfuA and precursor peptide genes (Figure 3B). More generally, peptide networking guided the identification of a wide variety of probable tfuA-containing RiPP BGCs (Figures S6-S19). For example, many mycobacteria encode a YcaO-TfuA protein pair, and the largest network of putative precursor peptides is associated with this mycobacterial BGC (Figure 2B, Network 1) where they are usually encoded near a Type III polyketide synthase (PKS) and a sulfotransferase (Figure S7). Network 2 consists of 25 related Streptomyces peptides that possess high Prodigal scores and are encoded at the start of a conserved biosynthetic operon (Figure S8). This is a strong candidate as an authentic RiPP BGC family, yet only 6 of these 25 short peptides were originally annotated.

Figure 3

Thioviridamide-like precursor peptides. (A) The precursor peptide network that includes both thioviridamide-like precursor peptides (red nodes) and a related but uncharacterized family of precursor peptides from BGCs that are highly different to thioviridamide-like BGCs (blue nodes). Characterized compounds are listed with their respective nodess. (B) Comparative analysis of thioviridamide-like and non-thioviridamide-like BGCs from this network where related genes share the same color. See Figure S6 for full BGC details.

Thioamidated RiPPs are a largely unexplored area of the natural products landscape

To investigate whether BGC families correlate with the evolutionary relationships of the TfuA proteins, a maximum likelihood tree was constructed from standalone TfuA domain proteins and the peptide networks were mapped to this tree (Figure 4, Supplementary Dataset 5). This showed strong correlations between TfuA phylogeny and precursor peptide similarity. Despite the significant differences between their gene clusters, the thioviridamide-like and non-thioviridamide-like peptides of Network 5 are all associated with closely related TfuA proteins. Unsurprisingly, some TfuA domain proteins are associated with multiple peptide networks due to the abundance of small peptides that are unlikely to be precursor peptides, such as regulatory proteins and RREs.⁴² For example, almost all peptides from Networks 9, 11 and 18 are associated with the same set of TfuA domain proteins, but Pfam analysis indicates that Networks 11 and 18 consist of acyl carrier proteins and ThiS-like proteins,⁴⁴ respectively.

Figure 4

Examples of putative RiPP BGCs and associated TfuA phylogeny. A maximum likelihood tree (branch lengths removed) of TfuA-like proteins is color-coded to indicate the relationship between TfuA-like proteins and the associated networks of putative precursor peptides. Representative BGCs are also shown, where grey genes indicate genetic features that are conserved across multiple BGCs within that family. Fully annotated BGCs are shown in Figures S6-S19.

Therefore, the Network 9 peptides, which are encoded at the beginning of each BGC and feature no conserved domains, are likely precursor peptides for this BGC family (Figure 4). In contrast, Pfam analysis indicated that all precursor peptides in Network 7 feature nitrile hydratase domains, which is a common feature amongst precursor peptides across diverse RiPP families.^8,45 In total, at least 15 distinct predicted RiPP families were predicted from the top 30 peptide networks (Supplementary Dataset 4, Table S1, Figures S6-S19), while many smaller networks and singletons are also likely to be authentic precursor peptides, based on their Prodigal scores and positions within BGCs. A comparative analysis with the source GenBank entries indicated that over half of the peptides encoded in these BGCs were not previously annotated (Supplementary Dataset 4); on average, unannotated peptides identified by RiPPER were significantly shorter than annotated peptides (Figure S20).

Characterization of a novel family of TfuA-YcaO BGCs

To determine whether the newly identified YcaO-TfuA BGCs actually produce thioamidated RiPPs, we focused on Network 22 (Figure 5A), a group of five orphan BGCs with multiple unusual features (Figure 5B). Most notably, the predicted precursor peptides feature a series of imperfect repeats that could reflect a repeating core peptide (Figure 5C), where the family varies from a non-repeating precursor peptide (Asanoa ishikariensis) to five repeats (Streptomyces varsoviensis). In addition, the Nocardiopsis and Streptomyces BGCs encode two additional conserved proteins, an amidinotransferase (AmT) and an ATP-grasp ligase, which are homologous to proteins in the pheganomycin pathway,⁴⁶ and are adjacent to genes encoding non-ribosomal peptide synthetases (NRPSs) or PKSs (Figure 5B). Efforts to genetically manipulate S. varsoviensis and Nocardiopsis baichengensis were unsuccessful and we were unsure of the gene cluster boundaries, so transformation-associated recombination (TAR) cloning^47,48 was employed to capture a 31.7 kb DNA fragment comprising 25 genes (Table S2) centered around the ycaO-tfuA core of the S. varsoviensis BGC. Two independent positive TAR clones were conjugated into three different host strains: Streptomyces lividans TK24 and Streptomyces coelicolor M1146 and M1152⁴⁹ and the resulting TARvar exconjugants were fermented in a variety of media. Liquid chromatography-mass spectrometry (LC-MS) analysis revealed two major compounds (m/z 399.18 and m/z 401.20), and two minor compounds (m/z 385.16 and m/z 387.18) not present in the negative control strains (Figure 5D). Small amounts of these compounds could be detected when S. varsoviensis was fermented for 10 days (Figure 6, Figure S21).

Figure 5

Identification of the thiovarsolin family of RiPPs. (A) The associated precursor peptide network. (B) BGCs associated with each precursor peptide. The protein product of each var gene is listed at the top (HO = heme oxygenase; AmT = amidinotransferase; MFS = major facilitator superfamily) and genes common to multiple BGCs are color-coded by the predicted function of the protein product (see Figure S16 for full details). (C) Putative repeating precursor peptides identified by similarity networking. The predicted leader peptide is aligned, while the repeat regions are highlighted. Underlined text indicates the partially conserved core peptide that the thiovarsolins derive from, and bold text indicates equivalent residues in the other precursor peptides. (D) Analysis of thiovarsolin production by S. coelicolor M1146 TARvar, which contains a 31.7 kb DNA fragment centered on the S. varsoviensis BGC. Base peak chromatograms of crude extracts of S. coelicolor M1146 TARvar and an empty vector negative control (pCAP03) are shown, with peaks corresponding to thiovarsolins A-D indicated. Thioamidation and dehydrogenation post-translational modifications are highlighted on the thiovarsolin structures.

Figure 6

Mutational analysis of thiovarsolin biosynthesis. Extracted ion chromatograms (EICs) are shown for each thiovarsolin (A = m/z 399.18, B = m/z 401.20, C = m/z 385.16, D = m/z 387.18). M1146 pCAP03 indicates the empty plasmid control, while each ∆var mutation was made in the TARvar construct and expressed in S. coelicolor M1146. See text and Figure S35 for details of varA*.

To associate the production of these new compounds to the cloned DNA fragment, PCR-targeting mutagenesis⁵⁰ was employed to generate a series of deletion mutants on the putative BGC. A progressive trimming process determined that a cluster of seven genes that are conserved across the Nocardiopsis and Streptomyces BGCs was sufficient for compound production: varA (encoding the predicted repeating precursor peptide), varY (the YcaO protein), varT (the TfuA protein), varO (a heme oxygenase-like protein⁵¹), varL (an ATP-grasp ligase), varP (a major facilitator superfamily transporter) and varS (an amidinotransferase). The deletion of varA, varY and varT completely abolished the production of the four new compounds, while the ∆varO mutant produced only m/z 401.20 and m/z 387.18, suggesting that VarO may function as a dehydrogenase (Figure 6). Deletion of varL, varP and varS did not affect production, despite their conservation in related BGCs (Figure 5B). ∆varY, ∆varT and ∆varO mutants were successfully complemented by expressing these genes under the control of the ermE* promoter, whereas complementation of ∆varA required its native promoter. As expected, expression of a 3.7 kb DNA fragment including only varA, varY and varT in S. coelicolor M1146 led to the production of m/z 401.20 and m/z 387.18 (Figure 6, varAYT). Collectively, this data show that varAYTO are the only genes required for the biosynthesis of this new group of RiPPs, thiovarsolins A-D (observed m/z 399.1818, 401.1968, 385.1652 and 387.1808, respectively, Table S5).

The thiovarsolins are thioamidated peptides that derive from the repetitive core of the precursor peptide

The structures of thiovarsolins A and B were determined by NMR (¹H, ¹³C, COSY, HSQC and HMBC; Figures S22-S33, Table S6) following large scale fermentation and purification of each compound. This analysis showed that thiovarsolins A and B are N-acetylated APR tripeptides in which the amide bond between Pro and Arg is substituted by a thioamide (δ_C 200 ppm) (Figure 5D). This was supported by accurate mass data (Table S5) and an absorbance maximum at ~270 nm for both molecules, which is characteristic of a thioamide group.⁵² Additionally, a trans double bond is present between Cβ and Cγ of the arginine side chain in thiovarsolin A. This peptide backbone is fully compatible with an APR sequence within the repeats of VarA (Figure 5C). The name thiovarsolin corresponds to linear thioamidated peptides made by S. varsoviensis.

Tandem MS (MS²) analysis of the thiovarsolins (Figure S34) revealed a clear structural relationship between thiovarsolins A (m/z 399.18) and C (m/z 385.16), as well as between thiovarsolins B (m/z 401.20) and D (m/z 387.18), which suggested that each 14 Da mass difference could be due to one methyl group. Interestingly, the first repetition of the putative modular core peptide features a GPR motif instead of APR, which could potentially explain this 14 Da mass difference, as well as their observed abundances in relation to thiovarsolins A and B. To test this hypothesis, a mutated version of varA was constructed (varA*, Figure S35) in which the Ala residue in each repeat was substituted by Gly. This was expressed in M1146 TARvar ∆varA using a pGP9-based expression plasmid.⁵³ The resulting strain was only able to produce thiovarsolins C and D (Figure 6, varA*), confirming that these two minor compounds derive from a GPR core peptide. Such an extensively repeating precursor peptide is rare, but is comparable to the variable repeats found in precursor peptides for some cyanobactins⁵⁴ and the fungal RiPP phomopsin.⁵⁵

Our genetic and chemical analysis of the var BGC strongly suggests that the YcaO (VarY) and TfuA (VarT) proteins cooperate to introduce a thioamide bond. Given the absence of a specific protease in the gene cluster, it is plausible that endogenous peptidases are responsible for the liberation of the non-degradable thioamidated APR and GPR tripeptides, which later undergo an N- terminal acetylation catalyzed by an endogenous N-acetyltransferase, as previously reported for other metabolites containing primary amines.^56,57 The timing of VarO-catalyzed dehydrogenation is unclear and could happen directly on the precursor peptide or after proteolysis. Small amounts of thiovarsolins A and B are produced by S. varsoviensis, but the lack of a function for varS and varL suggests that the described thiovarsolins might not be the final products of these pathways. However, no further thiovarsolin-related metabolites could be detected In either S. varsoviensis or S. coelicolor M1146 TARvar when analyzed by comparative metabolomics and by assessment of MS² data for losses of H₂S (m/z 33.99), which is a fragmentation profile that is characteristic of thioamides.⁶

CONCLUSION

The discovery of the thiovarsolins supports the existence of an unexplored array of thioamidated RiPPs in Actinobacteria. The discovery that a minimal gene set of varA (precursor peptide), varY (YcaO protein) and varT (TfuA protein) is sufficient for the biosynthesis of thiovarsolin B (Figure 6) provides strong evidence that the YcaO-TfuA protein pair catalyze peptide thioamidation in bacteria, which is supported by a parallel study by Mitchell and colleagues on thiopeptide thioamidation.¹⁴ It was previously determined that a distantly related pair of homologs catalyze thioamidation of methyl-coenzyme M reductase in archaea.^32,33 The relatively simple thiovarsolin pathway therefore represents a promising system for future biochemical studies of this reaction in the context of RiPP biosynthesis. Unexpectedly, genes conserved across multiple homologous var-like pathways (varS, varP and varL, Figure 5B) were not required for thiovarsolin biosynthesis. Along with N-terminal acetylation, this suggests that the identified thiovarsolins may be shunt products, although the production of thiovarsolins by S. varsoviensis indicates that they are made naturally, so production is not simply a consequence of heterologous pathway expression. The introduction of a double bond in the arginine residue side chain of the thiovarsolins by VarO would represent new RiPP biochemistry, as heme oxygenases have never been associated with RiPP biosynthesis. This shows that the breadth and diversity of RiPP post-translational modifications is still expanding, which has also been highlighted by recent discoveries of radical SAM enzyme-catalyzed epimerization,⁴⁵ decarboxylation⁵⁸ and β-amino acid formation⁵⁹ in RiPP pathways.

RiPPER is a flexible prediction tool that can be applied to any class of predicted RiPP tailoring enzyme to aid in the discovery of this metabolic dark matter. This more general approach complements existing genome-mining tools such as BAGEL,¹⁰ RODEO,^13,14 PRISM⁶⁰ and antiSMASH,¹² which all provide in-depth analyses and product predictions for established RiPP families. The de novo identification of precursors to lasso peptides, microviridins and thiopeptides highlights the scope of RiPPER, which was achieved without any specific rules for these RiPP families. The methodology proved to be highly adept at identifying previously overlooked precursor peptide genes, and the method parameters can be easily adapted based on prior knowledge of a given RiPP family (min/max gene length, max distance from RTE, same strand score and peptide score threshold, for example). In our TfuA analysis, peptide networking proved to be a highly effective method to prioritize related precursor peptides and their associated BGCs for further analysis, where it highlighted the existence of likely RiPP families as opposed to the coincidental presence of a small ORF near a putative BGC. The diversity of TfuA-associated precursor peptides identified in Actinobacteria highlights the utility of an unbiased precursor peptide identification tool and provides the basis for investigating the breadth of this RiPP family. It will be fascinating to determine both the structure and function of these cryptic metabolites.

AVAILABILITY

RiPPER is available at:

https://github.com/streptomyces/ripper and https://hub.docker.com/r/streptomyces/ripdock/

SUPPLEMENTARY DATA

Experimental methods, Figures S1-S35 and Tables S1-S6 (PDF)

Supplementary Dataset 1: RiPPER analysis of lasso precursor peptides (XLSX)

Supplementary Dataset 2: RiPPER analysis of microviridin precursor peptides (XLSX)

Supplementary Dataset 3: RiPPER analysis of thiopeptide precursor peptides (XLSX)

Supplementary Dataset 4: RiPPER analysis of YcaO-TfuA precursor peptides (XLSX)

Supplementary Dataset 5: Phylogenetic tree of TfuA proteins and their association with peptides networks (PDF)

FUNDING

This work was supported by a Royal Society University Research Fellowship to A.W.T., a Biotechnology and Biological Sciences Research Council (BBSRC) grant (BB/M003140/1) to A.W.T., the Erasmus Programme (L.F.), and BBSRC Institute Strategic Programme Grants to the John Innes Centre (BB/J004561/1 and BB/P012523/1).

CONFLICT OF INTEREST

The authors declare no conflict of interest.

ACKNOWLEDGEMENTS

We thank Bradley Moore (Scripps Institution of Oceanography, University of California San Diego, U.S.A.) for pCAP03, Vladimir Larionov (National Cancer Institute, NIH, U.S.A.) for S. cerevisiae VL6-48N, Mervyn Bibb (John Innes Centre, U.K.) for S. coelicolor strains, and Daniel Haft (NCBI/NIH, U.S.A.) for providing the precursor peptide HMMs. We thank Lionel Hill, Paul Brett and Gerhard Saalbach (John Innes Centre, Norwich, UK) for assistance with LC-MS, and Gwenaelle Le Gall and Ian Colquhoun (Quadrum Institute, Norwich, UK) for assistance with NMR.

REFERENCES

1.↵
Baltz, R.H. (2017) Gifted microbes for genome mining and natural product discovery. J. Ind. Microbiol. Biotechnol., 44, 573–588.
OpenUrl CrossRef
2.↵
Bentley, S.D., Chater, K.F., Cerdeño-Tárraga, A.-M., Challis, G.L., Thomson, N.R., James, K.D., Harris, D.E., Quail, M.A., Kieser, H., Harper, D., et al. (2002) Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature, 417, 141–147.
OpenUrl CrossRef PubMed Web of Science
3.↵
Ikeda, H., Ishikawa, J., Hanamoto, A., Shinose, M., Kikuchi, H., Shiba, T., Sakaki, Y., Hattori, M. and Ōmura, S. (2003) Complete genome sequence and comparative analysis of the industrial microorganism Streptomyces avermitilis. Nat. Biotechnol., 21, 526–531.
OpenUrl CrossRef PubMed Web of Science
4.↵
Arnison, P.G., Bibb, M.J., Bierbaum, G., Bowers, A.A., Bugni, T.S., Bulaj, G., Camarero, J.A., Campopiano, D.J., Challis, G.L., Clardy, J., et al. (2013) Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature. Nat. Prod. Rep., 30, 108–160.
OpenUrl CrossRef PubMed
5.↵
Goto, Y., Li, B., Claesen, J., Shi, Y., Bibb, M.J. and van der Donk, W.A. (2010) Discovery of unique lanthionine synthetases reveals new mechanistic and evolutionary insights. PLOS Biol., 8, e1000339.
OpenUrl CrossRef PubMed
6.↵
Frattaruolo, L., Lacret, R., Cappello, A.R. and Truman, A.W. (2017) A Genomics-Based Approach Identifies a Thioviridamide-Like Compound with Selective Anticancer Activity. ACS Chem. Biol., 12, 2815–2822.
OpenUrl
7.↵
Haft, D.H. and Basu, M.K. (2011) Biological Systems Discovery In Silico: Radical S-Adenosylmethionine Protein Families and Their Target Peptides for Posttranslational Modification. J. Bacteriol., 193, 2745–2755.
OpenUrl Abstract/FREE Full Text
8.↵
Haft, D.H., Basu, M.K. and Mitchell, D.A. (2010) Expansion of ribosomally produced natural products: a nitrile hydratase-and Nif11-related precursor family. BMC Biol., 8, 70.
OpenUrl CrossRef PubMed
9.↵
Cox, C.L., Doroghazi, J.R. and Mitchell, D.A. (2015) The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles. BMC Genomics, 16, 778.
OpenUrl CrossRef PubMed
10.↵
van Heel, A.J., de Jong, A., Montalbán-López, M., Kok, J. and Kuipers, O.P. (2013) BAGEL3: Automated identification of genes encoding bacteriocins and (non-)bactericidal posttranslationally modified peptides. Nucleic Acids Res., 41, W448–W453.
OpenUrl CrossRef PubMed Web of Science
11.↵
Skinnider, M.A., Johnston, C.W., Edgar, R.E., Dejong, C.A., Merwin, N.J., Rees, P.N. and Magarvey, N.A. (2016) Genomic charting of ribosomally synthesized natural product chemical space facilitates targeted mining. Proc. Natl. Acad. Sci. U.S.A., 113, E6343–E6351.
OpenUrl Abstract/FREE Full Text
12.↵
Blin, K., Wolf, T., Chevrette, M.G., Lu, X., Schwalen, C.J., Kautsar, S.A., Suarez Duran, H.G., de Los Santos, E.L.C., Kim, H.U., Nave, M., et al. (2017) antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification. Nucleic Acids Res., 45, W36–W41.
OpenUrl CrossRef PubMed
13.↵
Tietz, J.I., Schwalen, C.J., Patel, P.S., Maxson, T., Blair, P.M., Tai, H.-C., Zakai, U.I. and Mitchell, D.A. (2017) A new genome-mining tool redefines the lasso peptide biosynthetic landscape. Nat. Chem. Biol., 13, 470–478.
OpenUrl CrossRef PubMed
14.↵
Schwalen, C.J., Hudson, G.A., Kille, B. and Mitchell, D.A. (2018) Bioinformatic Expansion and Discovery of Thiopeptide Antibiotics. J. Am. Chem. Soc., 140, 9494–9501.
OpenUrl
15.↵
Burkhart, B.J., Schwalen, C.J., Mann, G., Naismith, J.H. and Mitchell, D.A. (2017) YcaO-Dependent Posttranslational Amide Activation: Biosynthesis, Structure, and Function. Chem. Rev., 117, 5389–5456.
OpenUrl
16.↵
Dunbar, K.L., Melby, J.O. and Mitchell, D.A. (2012) YcaO domains use ATP to activate amide backbones during peptide cyclodehydrations. Nat. Chem. Biol., 8, 569–575.
OpenUrl CrossRef PubMed
17.↵
Crone, W.J.K., Vior, N.M., Santos-Aberturas, J., Schmitz, L.G., Leeper, F.J. and Truman, A.W. (2016) Dissecting Bottromycin Biosynthesis Using Comparative Untargeted Metabolomics. Angew. Chem. Int. Ed., 55, 9639–9643.
OpenUrl CrossRef
18.
Franz, L., Adam, S., Santos-Aberturas, J., Truman, A.W. and Koehnke, J. (2017) Macroamidine Formation in Bottromycins Is Catalyzed by a Divergent YcaO Enzyme. J. Am. Chem. Soc., 139, 18158–18161.
OpenUrl
19.↵
Schwalen, C.J., Hudson, G.A., Kosol, S., Mahanta, N., Challis, G.L. and Mitchell, D.A. (2017) In Vitro Biosynthetic Studies of Bottromycin Expand the Enzymatic Capabilities of the YcaO Superfamily. J. Am. Chem. Soc., 139, 18154–18157.
OpenUrl
20.↵
Izawa, M., Kawasaki, T. and Hayakawa, Y. (2013) Cloning and heterologous expression of the thioviridamide biosynthesis gene cluster from Streptomyces olivoviridis. Appl. Environ. Microbiol., 79, 7110–7113.
OpenUrl Abstract/FREE Full Text
21.
Izawa, M., Nagamine, S., Aoki, H. and Hayakawa, Y. (2018) Identification of essential biosynthetic genes and a true biosynthetic product for thioviridamide. J. Gen. Appl. Microbiol., 64, 50–53.
OpenUrl
22.
Kawahara, T., Izumikawa, M., Kozone, I., Hashimoto, J., Kagaya, N., Koiwai, H., Komatsu, M., Fujie, M., Sato, N., Ikeda, H., et al. (2018) Neothioviridamide, a Polythioamide Compound Produced by Heterologous Expression of a Streptomyces sp. Cryptic RiPP Biosynthetic Gene Cluster. J. Nat. Prod., 81, 264–269.
OpenUrl
23.
Hayakawa, Y., Sasaki, K., Nagai, K., Shin-ya, K. and Furihata, K. (2006) Structure of thioviridamide, a novel apoptosis inducer from Streptomyces olivoviridis. J. Antibiot., 59, 6–10.
OpenUrl CrossRef PubMed
24.↵
Kjaerulff, L., Sikandar, A., Zaburannyi, N., Adam, S., Herrmann, J., Koehnke, J. and Müller, R. (2017) Thioholgamides: Thioamide-Containing Cytotoxic RiPP Natural Products. ACS Chem. Biol., 12, 2837–2841.
OpenUrl
25.↵
Feistner, G. and Staub, C.M. (1986) 6-Thioguanine from Erwinia amylovora. Curr. Microbiol., 13, 95–101.
OpenUrl
26.
Kim, H.J., Graham, D.W., Dispirito, A.A., Alterman, M.A., Galeva, N., Larive, C.K., Asunskis, D. and Sherwood, P.M.A. (2004) Methanobactin, a copper-acquisition compound from methane-oxidizing bacteria. Science, 305, 1612–1615.
OpenUrl Abstract/FREE Full Text
27.
Pan, M., Mabry, T.J., Beale, J.M. and Mamiya, B.M. (1997) Nonprotein amino acids from Cycas revoluta. Phytochemistry, 45, 517–519.
OpenUrl PubMed
28.
Lincke, T., Behnken, S., Ishida, K., Roth, M. and Hertweck, C. (2010) Closthioamide: an unprecedented polythioamide antibiotic from the strictly anaerobic bacterium Clostridium cellulolyticum. Angew. Chem. Int. Ed., 49, 2011–2013.
OpenUrl CrossRef PubMed
29.
Banala, S. and Süssmuth, R.D. (2010) Thioamides in nature: in search of secondary metabolites in anaerobic microorganisms. ChemBioChem, 11, 1335–1337.
OpenUrl CrossRef PubMed
30.
Dunbar, K.L., Scharf, D.H., Litomska, A. and Hertweck, C. (2017) Enzymatic Carbon-Sulfur Bond Formation in Natural Product Biosynthesis. Chem. Rev., 117, 5521–5577.
OpenUrl
31.↵
Litomska, A., Ishida, K., Dunbar, K.L., Boettger, M., Coyne, S. and Hertweck, C. (2018) Enzymatic Thioamide Formation in a Bacterial Antimetabolite Pathway. Angew. Chem. Int. Ed., 57, 11574–11578.
OpenUrl
32.↵
Nayak, D.D., Mahanta, N., Mitchell, D.A. and Metcalf, W.W. (2017) Post-translational thioamidation of methyl-coenzyme M reductase, a key enzyme in methanogenic and methanotrophic Archaea. eLife, 6, e29218.
OpenUrl CrossRef PubMed
33.↵
Mahanta, N., Liu, A., Dong, S., Nair, S.K. and Mitchell, D.A. (2018) Enzymatic reconstitution of ribosomal peptide backbone thioamidation. Proc. Natl. Acad. Sci. U.S.A., 115, 3030–3035.
OpenUrl Abstract/FREE Full Text
34.↵
Haft, D.H. (2011) Bioinformatic evidence for a widely distributed, ribosomally produced electron carrier precursor, its maturation proteins, and its nicotinoprotein redox partners. BMC Genomics, 12, 21.
OpenUrl CrossRef PubMed
35.↵
Morinaka, B.I., Verest, M., Freeman, M.F., Gugger, M. and Piel, J. (2017) An Orthogonal D₂O-Based Induction System that Provides Insights into D-Amino Acid Pattern Formation by Radical S-Adenosylmethionine Peptide Epimerases. Angew. Chem. Int. Ed., 56, 762–766.
OpenUrl CrossRef PubMed
36.↵
Hyatt, D., Chen, G.-L., Locascio, P.F., Land, M.L., Larimer, F.W. and Hauser, L.J. (2010) Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics, 11, 119.
OpenUrl CrossRef PubMed
37.↵
Finn, R.D., Coggill, P., Eberhardt, R.Y., Eddy, S.R., Mistry, J., Mitchell, A.L., Potter, S.C., Punta, M., Qureshi, M., Sangrador-Vegas, A., et al. (2016) The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res., 44, D279–D285.
OpenUrl CrossRef PubMed
38.↵
Ahmed, M.N., Reyna-González, E., Schmid, B., Wiebach, V., Süssmuth, R.D., Dittmann, E. and Fewer, D.P. (2017) Phylogenomic Analysis of the Microviridin Biosynthetic Pathway Coupled with Targeted Chemo-Enzymatic Synthesis Yields Potent Protease Inhibitors. ACS Chem. Biol., 12, 1538–1546.
OpenUrl CrossRef
39.↵
Haft, D.H., DiCuccio, M., Badretdin, A., Brover, V., Chetvernin, V., O’Neill, K., Li, W., Chitsaz, F., Derbyshire, M.K., Gonzales, N.R., et al. (2018) RefSeq: an update on prokaryotic genome annotation and curation. Nucleic Acids Res., 46, D851–D860.
OpenUrl CrossRef PubMed
40.↵
Halary, S., McInerney, J.O., Lopez, P. and Bapteste, E. (2013) EGN: a wizard for construction of gene and genome similarity networks. BMC Evol. Biol., 13, 146.
OpenUrl CrossRef PubMed
41.↵
Latham, J.A., Iavarone, A.T., Barr, I., Juthani, P.V. and Klinman, J.P. (2015) PqqD is a novel peptide chaperone that forms a ternary complex with the radical S-adenosylmethionine protein PqqE in the pyrroloquinoline quinone biosynthetic pathway. J. Biol. Chem., 290, 12908–12918.
OpenUrl Abstract/FREE Full Text
42.↵
Burkhart, B.J., Hudson, G.A., Dunbar, K.L. and Mitchell, D.A. (2015) A prevalent peptide-binding domain guides ribosomal natural product biosynthesis. Nat. Chem. Biol., 11, 564–570.
OpenUrl CrossRef PubMed
43.↵
Medema, M.H., Takano, E. and Breitling, R. (2013) Detecting sequence homology at the gene cluster level with MultiGeneBlast. Mol. Biol. Evol., 30, 1218–1223.
OpenUrl CrossRef PubMed Web of Science
44.↵
Dorrestein, P.C., Zhai, H., McLafferty, F.W. and Begley, T.P. (2004) The biosynthesis of the thiazole phosphate moiety of thiamin: the sulfur transfer mediated by the sulfur carrier protein ThiS. Chem. Biol., 11, 1373–1381.
OpenUrl PubMed
45.↵
Fuchs, S.W., Lackner, G., Morinaka, B.I., Morishita, Y., Asai, T., Riniker, S. and Piel, J. (2016) A Lanthipeptide-like N-Terminal Leader Region Guides Peptide Epimerization by Radical SAM Epimerases: Implications for RiPP Evolution. Angew. Chem. Int. Ed., 55, 12330–12333.
OpenUrl
46.↵
Noike, M., Matsui, T., Ooya, K., Sasaki, I., Ohtaki, S., Hamano, Y., Maruyama, C., Ishikawa, J., Satoh, Y., Ito, H., et al. (2015) A peptide ligase and the ribosome cooperate to synthesize the peptide pheganomycin. Nat. Chem. Biol., 11, 71–76.
OpenUrl CrossRef PubMed
47.↵
Tang, X., Li, J., Millán-Aguiñaga, N., Zhang, J.J., O’Neill, E.C., Ugalde, J.A., Jensen, P.R., Mantovani, S.M. and Moore, B.S. (2015) Identification of Thiotetronic Acid Antibiotic Biosynthetic Pathways by Target-directed Genome Mining. ACS Chem. Biol., 10, 2841–2849.
OpenUrl CrossRef
48.↵
Yamanaka, K., Reynolds, K.A., Kersten, R.D., Ryan, K.S., Gonzalez, D.J., Nizet, V., Dorrestein, P.C. and Moore, B.S. (2014) Direct cloning and refactoring of a silent lipopeptide biosynthetic gene cluster yields the antibiotic taromycin A. Proc. Natl. Acad. Sci. U.S.A., 111, 1957–1962.
OpenUrl Abstract/FREE Full Text
49.↵
Gomez-Escribano, J.P. and Bibb, M.J. (2011) Engineering Streptomyces coelicolor for heterologous expression of secondary metabolite gene clusters. Microb. Biotechnol., 4, 207–215.
OpenUrl CrossRef PubMed
50.↵
Gust, B., Challis, G.L., Fowler, K., Kieser, T. and Chater, K.F. (2003) PCR-targeted Streptomyces gene replacement identifies a protein domain needed for biosynthesis of the sesquiterpene soil odor geosmin. Proc. Natl. Acad. Sci. U.S.A., 100, 1541–1546.
OpenUrl Abstract/FREE Full Text
51.↵
Kikuchi, G., Yoshida, T. and Noguchi, M. (2005) Heme oxygenase and heme degradation. Biochem. Biophys. Res. Commun., 338, 558–567.
OpenUrl CrossRef PubMed Web of Science
52.↵
Judge, R.H., Moule, D.C. and Goddard, J.D. (1987) Thioamide spectroscopy: long path length absorption and quantum chemical studies of thioformamide vapour, CHSNH2/CHSND2. Can. J. Chem., 65, 2100–2105.
OpenUrl
53.↵
Kuščer, E., Coates, N., Challis, I., Gregory, M., Wilkinson, B., Sheridan, R. and Petkovic, H. (2007) Roles of rapH and rapG in positive regulation of rapamycin biosynthesis in Streptomyces hygroscopicus. J. Bacteriol., 189, 4756–4763.
OpenUrl Abstract/FREE Full Text
54.↵
Mclntosh, J.A., Lin, Z., Tianero, M.D.B. and Schmidt, E.W. (2013) Aestuaramides, a natural library of cyanobactin cyclic peptides resulting from isoprene-derived Claisen rearrangements. ACS Chem. Biol., 8, 877–883.
OpenUrl CrossRef PubMed
55.↵
Ding, W., Liu, W.-Q., Jia, Y., Li, Y., van der Donk, W.A. and Zhang, Q. (2016) Biosynthetic investigation of phomopsins reveals a widespread pathway for ribosomal natural products in Ascomycetes. Proc. Natl. Acad. Sci. U.S.A., 113, 3521–3526.
OpenUrl Abstract/FREE Full Text
56.↵
García, I., Vior, N.M., González-Sabin, J., Braña, A.F., Rohr, J., Moris, F., Méndez, C. and Salas, J.A. (2013) Engineering the biosynthesis of the polyketide-nonribosomal peptide collismycin A for generation of analogs with neuroprotective activity. Chem. Biol., 20, 1022–1032.
OpenUrl CrossRef PubMed
57.↵
Ye, S., Molloy, B., Braña, A.F., Zabala, D., Olano, C., CortEs, J., Moris, F., Salas, J.A. and Méndez, C. (2017) Identification by Genome Mining of a Type I Polyketide Gene Cluster from Streptomyces argillaceus Involved in the Biosynthesis of Pyridine and Piperidine Alkaloids Argimycins P. Front Microbiol., 8, 194.
OpenUrl
58.↵
Khaliullin, B., Ayikpoe, R., Tuttle, M. and Latham, J.A. (2017) Mechanistic elucidation of the mycofactocin-biosynthetic radical S-adenosylmethionine protein, MftC. J. Biol. Chem., 292, 13022–13033.
OpenUrl Abstract/FREE Full Text
59.↵
Morinaka, B.I., Lakis, E., Verest, M., Helf, M.J., Scalvenzi, T., Vagstad, A.L., Sims, J., Sunagawa, S., Gugger, M. and Piel, J. (2018) Natural noncanonical protein splicing yields products with diverse β-amino acid residues. Science, 359, 779–782.
OpenUrl Abstract/FREE Full Text
60.↵
Skinnider, M.A., Merwin, N.J., Johnston, C.W. and Magarvey, N.A. (2017) PRISM 3: expanded prediction of natural product chemical structures from microbial genomes. Nucleic Acids Res., 45, W49–W54.
OpenUrl

View the discussion thread.

Posted December 13, 2018.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Microbiology

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11715)
Bioengineering (8723)
Bioinformatics (29128)
Biophysics (14935)
Cancer Biology (12049)
Cell Biology (17359)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14144)
Epidemiology (2067)
Evolutionary Biology (18268)
Genetics (12221)
Genomics (16767)
Immunology (11843)
Microbiology (28014)
Molecular Biology (11560)
Neuroscience (60810)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10384)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] 1.↵
Baltz, R.H. (2017) Gifted microbes for genome mining and natural product discovery. J. Ind. Microbiol. Biotechnol., 44, 573–588.
OpenUrl CrossRef

[2] 2.↵
Bentley, S.D., Chater, K.F., Cerdeño-Tárraga, A.-M., Challis, G.L., Thomson, N.R., James, K.D., Harris, D.E., Quail, M.A., Kieser, H., Harper, D., et al. (2002) Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature, 417, 141–147.
OpenUrl CrossRef PubMed Web of Science

[3] 3.↵
Ikeda, H., Ishikawa, J., Hanamoto, A., Shinose, M., Kikuchi, H., Shiba, T., Sakaki, Y., Hattori, M. and Ōmura, S. (2003) Complete genome sequence and comparative analysis of the industrial microorganism Streptomyces avermitilis. Nat. Biotechnol., 21, 526–531.
OpenUrl CrossRef PubMed Web of Science

[4] 4.↵
Arnison, P.G., Bibb, M.J., Bierbaum, G., Bowers, A.A., Bugni, T.S., Bulaj, G., Camarero, J.A., Campopiano, D.J., Challis, G.L., Clardy, J., et al. (2013) Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature. Nat. Prod. Rep., 30, 108–160.
OpenUrl CrossRef PubMed

[5] 5.↵
Goto, Y., Li, B., Claesen, J., Shi, Y., Bibb, M.J. and van der Donk, W.A. (2010) Discovery of unique lanthionine synthetases reveals new mechanistic and evolutionary insights. PLOS Biol., 8, e1000339.
OpenUrl CrossRef PubMed

[6] 6.↵
Frattaruolo, L., Lacret, R., Cappello, A.R. and Truman, A.W. (2017) A Genomics-Based Approach Identifies a Thioviridamide-Like Compound with Selective Anticancer Activity. ACS Chem. Biol., 12, 2815–2822.
OpenUrl

[7] 7.↵
Haft, D.H. and Basu, M.K. (2011) Biological Systems Discovery In Silico: Radical S-Adenosylmethionine Protein Families and Their Target Peptides for Posttranslational Modification. J. Bacteriol., 193, 2745–2755.
OpenUrl Abstract/FREE Full Text

[8] 8.↵
Haft, D.H., Basu, M.K. and Mitchell, D.A. (2010) Expansion of ribosomally produced natural products: a nitrile hydratase-and Nif11-related precursor family. BMC Biol., 8, 70.
OpenUrl CrossRef PubMed

[9] 9.↵
Cox, C.L., Doroghazi, J.R. and Mitchell, D.A. (2015) The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles. BMC Genomics, 16, 778.
OpenUrl CrossRef PubMed

[10] 10.↵
van Heel, A.J., de Jong, A., Montalbán-López, M., Kok, J. and Kuipers, O.P. (2013) BAGEL3: Automated identification of genes encoding bacteriocins and (non-)bactericidal posttranslationally modified peptides. Nucleic Acids Res., 41, W448–W453.
OpenUrl CrossRef PubMed Web of Science

[11] 11.↵
Skinnider, M.A., Johnston, C.W., Edgar, R.E., Dejong, C.A., Merwin, N.J., Rees, P.N. and Magarvey, N.A. (2016) Genomic charting of ribosomally synthesized natural product chemical space facilitates targeted mining. Proc. Natl. Acad. Sci. U.S.A., 113, E6343–E6351.
OpenUrl Abstract/FREE Full Text

[12] 12.↵
Blin, K., Wolf, T., Chevrette, M.G., Lu, X., Schwalen, C.J., Kautsar, S.A., Suarez Duran, H.G., de Los Santos, E.L.C., Kim, H.U., Nave, M., et al. (2017) antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification. Nucleic Acids Res., 45, W36–W41.
OpenUrl CrossRef PubMed

[13] 13.↵
Tietz, J.I., Schwalen, C.J., Patel, P.S., Maxson, T., Blair, P.M., Tai, H.-C., Zakai, U.I. and Mitchell, D.A. (2017) A new genome-mining tool redefines the lasso peptide biosynthetic landscape. Nat. Chem. Biol., 13, 470–478.
OpenUrl CrossRef PubMed

[14] 14.↵
Schwalen, C.J., Hudson, G.A., Kille, B. and Mitchell, D.A. (2018) Bioinformatic Expansion and Discovery of Thiopeptide Antibiotics. J. Am. Chem. Soc., 140, 9494–9501.
OpenUrl

[15] 15.↵
Burkhart, B.J., Schwalen, C.J., Mann, G., Naismith, J.H. and Mitchell, D.A. (2017) YcaO-Dependent Posttranslational Amide Activation: Biosynthesis, Structure, and Function. Chem. Rev., 117, 5389–5456.
OpenUrl

[16] 16.↵
Dunbar, K.L., Melby, J.O. and Mitchell, D.A. (2012) YcaO domains use ATP to activate amide backbones during peptide cyclodehydrations. Nat. Chem. Biol., 8, 569–575.
OpenUrl CrossRef PubMed

[17] 17.↵
Crone, W.J.K., Vior, N.M., Santos-Aberturas, J., Schmitz, L.G., Leeper, F.J. and Truman, A.W. (2016) Dissecting Bottromycin Biosynthesis Using Comparative Untargeted Metabolomics. Angew. Chem. Int. Ed., 55, 9639–9643.
OpenUrl CrossRef

[18] 18.
Franz, L., Adam, S., Santos-Aberturas, J., Truman, A.W. and Koehnke, J. (2017) Macroamidine Formation in Bottromycins Is Catalyzed by a Divergent YcaO Enzyme. J. Am. Chem. Soc., 139, 18158–18161.
OpenUrl

[19] 19.↵
Schwalen, C.J., Hudson, G.A., Kosol, S., Mahanta, N., Challis, G.L. and Mitchell, D.A. (2017) In Vitro Biosynthetic Studies of Bottromycin Expand the Enzymatic Capabilities of the YcaO Superfamily. J. Am. Chem. Soc., 139, 18154–18157.
OpenUrl

[20] 20.↵
Izawa, M., Kawasaki, T. and Hayakawa, Y. (2013) Cloning and heterologous expression of the thioviridamide biosynthesis gene cluster from Streptomyces olivoviridis. Appl. Environ. Microbiol., 79, 7110–7113.
OpenUrl Abstract/FREE Full Text

[21] 21.
Izawa, M., Nagamine, S., Aoki, H. and Hayakawa, Y. (2018) Identification of essential biosynthetic genes and a true biosynthetic product for thioviridamide. J. Gen. Appl. Microbiol., 64, 50–53.
OpenUrl

[22] 22.
Kawahara, T., Izumikawa, M., Kozone, I., Hashimoto, J., Kagaya, N., Koiwai, H., Komatsu, M., Fujie, M., Sato, N., Ikeda, H., et al. (2018) Neothioviridamide, a Polythioamide Compound Produced by Heterologous Expression of a Streptomyces sp. Cryptic RiPP Biosynthetic Gene Cluster. J. Nat. Prod., 81, 264–269.
OpenUrl

[23] 23.
Hayakawa, Y., Sasaki, K., Nagai, K., Shin-ya, K. and Furihata, K. (2006) Structure of thioviridamide, a novel apoptosis inducer from Streptomyces olivoviridis. J. Antibiot., 59, 6–10.
OpenUrl CrossRef PubMed

[24] 24.↵
Kjaerulff, L., Sikandar, A., Zaburannyi, N., Adam, S., Herrmann, J., Koehnke, J. and Müller, R. (2017) Thioholgamides: Thioamide-Containing Cytotoxic RiPP Natural Products. ACS Chem. Biol., 12, 2837–2841.
OpenUrl

[25] 25.↵
Feistner, G. and Staub, C.M. (1986) 6-Thioguanine from Erwinia amylovora. Curr. Microbiol., 13, 95–101.
OpenUrl

[26] 26.
Kim, H.J., Graham, D.W., Dispirito, A.A., Alterman, M.A., Galeva, N., Larive, C.K., Asunskis, D. and Sherwood, P.M.A. (2004) Methanobactin, a copper-acquisition compound from methane-oxidizing bacteria. Science, 305, 1612–1615.
OpenUrl Abstract/FREE Full Text

[27] 27.
Pan, M., Mabry, T.J., Beale, J.M. and Mamiya, B.M. (1997) Nonprotein amino acids from Cycas revoluta. Phytochemistry, 45, 517–519.
OpenUrl PubMed

[28] 28.
Lincke, T., Behnken, S., Ishida, K., Roth, M. and Hertweck, C. (2010) Closthioamide: an unprecedented polythioamide antibiotic from the strictly anaerobic bacterium Clostridium cellulolyticum. Angew. Chem. Int. Ed., 49, 2011–2013.
OpenUrl CrossRef PubMed

[29] 29.
Banala, S. and Süssmuth, R.D. (2010) Thioamides in nature: in search of secondary metabolites in anaerobic microorganisms. ChemBioChem, 11, 1335–1337.
OpenUrl CrossRef PubMed

[30] 30.
Dunbar, K.L., Scharf, D.H., Litomska, A. and Hertweck, C. (2017) Enzymatic Carbon-Sulfur Bond Formation in Natural Product Biosynthesis. Chem. Rev., 117, 5521–5577.
OpenUrl

[31] 31.↵
Litomska, A., Ishida, K., Dunbar, K.L., Boettger, M., Coyne, S. and Hertweck, C. (2018) Enzymatic Thioamide Formation in a Bacterial Antimetabolite Pathway. Angew. Chem. Int. Ed., 57, 11574–11578.
OpenUrl

[32] 32.↵
Nayak, D.D., Mahanta, N., Mitchell, D.A. and Metcalf, W.W. (2017) Post-translational thioamidation of methyl-coenzyme M reductase, a key enzyme in methanogenic and methanotrophic Archaea. eLife, 6, e29218.
OpenUrl CrossRef PubMed

[33] 33.↵
Mahanta, N., Liu, A., Dong, S., Nair, S.K. and Mitchell, D.A. (2018) Enzymatic reconstitution of ribosomal peptide backbone thioamidation. Proc. Natl. Acad. Sci. U.S.A., 115, 3030–3035.
OpenUrl Abstract/FREE Full Text

[34] 34.↵
Haft, D.H. (2011) Bioinformatic evidence for a widely distributed, ribosomally produced electron carrier precursor, its maturation proteins, and its nicotinoprotein redox partners. BMC Genomics, 12, 21.
OpenUrl CrossRef PubMed

[35] 35.↵
Morinaka, B.I., Verest, M., Freeman, M.F., Gugger, M. and Piel, J. (2017) An Orthogonal D₂O-Based Induction System that Provides Insights into D-Amino Acid Pattern Formation by Radical S-Adenosylmethionine Peptide Epimerases. Angew. Chem. Int. Ed., 56, 762–766.
OpenUrl CrossRef PubMed

[36] 36.↵
Hyatt, D., Chen, G.-L., Locascio, P.F., Land, M.L., Larimer, F.W. and Hauser, L.J. (2010) Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics, 11, 119.
OpenUrl CrossRef PubMed

[37] 37.↵
Finn, R.D., Coggill, P., Eberhardt, R.Y., Eddy, S.R., Mistry, J., Mitchell, A.L., Potter, S.C., Punta, M., Qureshi, M., Sangrador-Vegas, A., et al. (2016) The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res., 44, D279–D285.
OpenUrl CrossRef PubMed

[38] 38.↵
Ahmed, M.N., Reyna-González, E., Schmid, B., Wiebach, V., Süssmuth, R.D., Dittmann, E. and Fewer, D.P. (2017) Phylogenomic Analysis of the Microviridin Biosynthetic Pathway Coupled with Targeted Chemo-Enzymatic Synthesis Yields Potent Protease Inhibitors. ACS Chem. Biol., 12, 1538–1546.
OpenUrl CrossRef

[39] 39.↵
Haft, D.H., DiCuccio, M., Badretdin, A., Brover, V., Chetvernin, V., O’Neill, K., Li, W., Chitsaz, F., Derbyshire, M.K., Gonzales, N.R., et al. (2018) RefSeq: an update on prokaryotic genome annotation and curation. Nucleic Acids Res., 46, D851–D860.
OpenUrl CrossRef PubMed

[40] 40.↵
Halary, S., McInerney, J.O., Lopez, P. and Bapteste, E. (2013) EGN: a wizard for construction of gene and genome similarity networks. BMC Evol. Biol., 13, 146.
OpenUrl CrossRef PubMed

[41] 41.↵
Latham, J.A., Iavarone, A.T., Barr, I., Juthani, P.V. and Klinman, J.P. (2015) PqqD is a novel peptide chaperone that forms a ternary complex with the radical S-adenosylmethionine protein PqqE in the pyrroloquinoline quinone biosynthetic pathway. J. Biol. Chem., 290, 12908–12918.
OpenUrl Abstract/FREE Full Text

[42] 42.↵
Burkhart, B.J., Hudson, G.A., Dunbar, K.L. and Mitchell, D.A. (2015) A prevalent peptide-binding domain guides ribosomal natural product biosynthesis. Nat. Chem. Biol., 11, 564–570.
OpenUrl CrossRef PubMed

[43] 43.↵
Medema, M.H., Takano, E. and Breitling, R. (2013) Detecting sequence homology at the gene cluster level with MultiGeneBlast. Mol. Biol. Evol., 30, 1218–1223.
OpenUrl CrossRef PubMed Web of Science

[44] 44.↵
Dorrestein, P.C., Zhai, H., McLafferty, F.W. and Begley, T.P. (2004) The biosynthesis of the thiazole phosphate moiety of thiamin: the sulfur transfer mediated by the sulfur carrier protein ThiS. Chem. Biol., 11, 1373–1381.
OpenUrl PubMed

[45] 45.↵
Fuchs, S.W., Lackner, G., Morinaka, B.I., Morishita, Y., Asai, T., Riniker, S. and Piel, J. (2016) A Lanthipeptide-like N-Terminal Leader Region Guides Peptide Epimerization by Radical SAM Epimerases: Implications for RiPP Evolution. Angew. Chem. Int. Ed., 55, 12330–12333.
OpenUrl

[46] 46.↵
Noike, M., Matsui, T., Ooya, K., Sasaki, I., Ohtaki, S., Hamano, Y., Maruyama, C., Ishikawa, J., Satoh, Y., Ito, H., et al. (2015) A peptide ligase and the ribosome cooperate to synthesize the peptide pheganomycin. Nat. Chem. Biol., 11, 71–76.
OpenUrl CrossRef PubMed

[47] 47.↵
Tang, X., Li, J., Millán-Aguiñaga, N., Zhang, J.J., O’Neill, E.C., Ugalde, J.A., Jensen, P.R., Mantovani, S.M. and Moore, B.S. (2015) Identification of Thiotetronic Acid Antibiotic Biosynthetic Pathways by Target-directed Genome Mining. ACS Chem. Biol., 10, 2841–2849.
OpenUrl CrossRef

[48] 48.↵
Yamanaka, K., Reynolds, K.A., Kersten, R.D., Ryan, K.S., Gonzalez, D.J., Nizet, V., Dorrestein, P.C. and Moore, B.S. (2014) Direct cloning and refactoring of a silent lipopeptide biosynthetic gene cluster yields the antibiotic taromycin A. Proc. Natl. Acad. Sci. U.S.A., 111, 1957–1962.
OpenUrl Abstract/FREE Full Text

[49] 49.↵
Gomez-Escribano, J.P. and Bibb, M.J. (2011) Engineering Streptomyces coelicolor for heterologous expression of secondary metabolite gene clusters. Microb. Biotechnol., 4, 207–215.
OpenUrl CrossRef PubMed

[50] 50.↵
Gust, B., Challis, G.L., Fowler, K., Kieser, T. and Chater, K.F. (2003) PCR-targeted Streptomyces gene replacement identifies a protein domain needed for biosynthesis of the sesquiterpene soil odor geosmin. Proc. Natl. Acad. Sci. U.S.A., 100, 1541–1546.
OpenUrl Abstract/FREE Full Text

[51] 51.↵
Kikuchi, G., Yoshida, T. and Noguchi, M. (2005) Heme oxygenase and heme degradation. Biochem. Biophys. Res. Commun., 338, 558–567.
OpenUrl CrossRef PubMed Web of Science

[52] 52.↵
Judge, R.H., Moule, D.C. and Goddard, J.D. (1987) Thioamide spectroscopy: long path length absorption and quantum chemical studies of thioformamide vapour, CHSNH2/CHSND2. Can. J. Chem., 65, 2100–2105.
OpenUrl

[53] 53.↵
Kuščer, E., Coates, N., Challis, I., Gregory, M., Wilkinson, B., Sheridan, R. and Petkovic, H. (2007) Roles of rapH and rapG in positive regulation of rapamycin biosynthesis in Streptomyces hygroscopicus. J. Bacteriol., 189, 4756–4763.
OpenUrl Abstract/FREE Full Text

[54] 54.↵
Mclntosh, J.A., Lin, Z., Tianero, M.D.B. and Schmidt, E.W. (2013) Aestuaramides, a natural library of cyanobactin cyclic peptides resulting from isoprene-derived Claisen rearrangements. ACS Chem. Biol., 8, 877–883.
OpenUrl CrossRef PubMed

[55] 55.↵
Ding, W., Liu, W.-Q., Jia, Y., Li, Y., van der Donk, W.A. and Zhang, Q. (2016) Biosynthetic investigation of phomopsins reveals a widespread pathway for ribosomal natural products in Ascomycetes. Proc. Natl. Acad. Sci. U.S.A., 113, 3521–3526.
OpenUrl Abstract/FREE Full Text

[56] 56.↵
García, I., Vior, N.M., González-Sabin, J., Braña, A.F., Rohr, J., Moris, F., Méndez, C. and Salas, J.A. (2013) Engineering the biosynthesis of the polyketide-nonribosomal peptide collismycin A for generation of analogs with neuroprotective activity. Chem. Biol., 20, 1022–1032.
OpenUrl CrossRef PubMed

[57] 57.↵
Ye, S., Molloy, B., Braña, A.F., Zabala, D., Olano, C., CortEs, J., Moris, F., Salas, J.A. and Méndez, C. (2017) Identification by Genome Mining of a Type I Polyketide Gene Cluster from Streptomyces argillaceus Involved in the Biosynthesis of Pyridine and Piperidine Alkaloids Argimycins P. Front Microbiol., 8, 194.
OpenUrl

[58] 58.↵
Khaliullin, B., Ayikpoe, R., Tuttle, M. and Latham, J.A. (2017) Mechanistic elucidation of the mycofactocin-biosynthetic radical S-adenosylmethionine protein, MftC. J. Biol. Chem., 292, 13022–13033.
OpenUrl Abstract/FREE Full Text

[59] 59.↵
Morinaka, B.I., Lakis, E., Verest, M., Helf, M.J., Scalvenzi, T., Vagstad, A.L., Sims, J., Sunagawa, S., Gugger, M. and Piel, J. (2018) Natural noncanonical protein splicing yields products with diverse β-amino acid residues. Science, 359, 779–782.
OpenUrl Abstract/FREE Full Text

[60] 60.↵
Skinnider, M.A., Merwin, N.J., Johnston, C.W. and Magarvey, N.A. (2017) PRISM 3: expanded prediction of natural product chemical structures from microbial genomes. Nucleic Acids Res., 45, W49–W54.
OpenUrl