RT Journal Article SR Electronic T1 The G-box transcriptional regulatory code in Arabidopsis JF bioRxiv FD Cold Spring Harbor Laboratory SP 128371 DO 10.1101/128371 A1 Daphne Ezer A1 Samuel JK Shepherd A1 Anna Brestovitsky A1 Patrick Dickinson A1 Sandra Cortijo A1 Varodom Charoensawan A1 Mathew S. Box A1 Surojit Biswas A1 Philip Wigge YR 2017 UL http://biorxiv.org/content/early/2017/04/18/128371.abstract AB Plants have significantly more transcription factor (TF) families than animals and fungi, and plant TF families tend to contain more genes—these expansions are linked to adaptation to environmental stressors (1, 2). Many TF family members bind to similar or identical sequence motifs, such as G-boxes (CACGTG), so it is difficult to predict regulatory relationships. We determine that the flanking sequences near G-boxes help determine in vitro specificity, but that this is insufficient to predict the transcription pattern of genes near G-boxes. Therefore, we construct a gene regulatory network that identifies the set of bZIPs and bHLHs that are most predictive of the gene expression of genes downstream of perfect G-boxes. This network accurately predicts transcriptional patterns and reconstructs known regulatory subnetworks. Finally, we present Ara-BOX-cis (araboxcis.org), a website that provides interactive visualisations of the G-box regulatory network, a useful resource for generating predictions for gene regulatory relations.