TY - JOUR T1 - Sequence based prediction of novel domains in the cellulosome of <em>Ruminiclostridium thermocellum</em> JF - bioRxiv DO - 10.1101/066357 SP - 066357 AU - Zarrin Basharat AU - Azra Yasmin Y1 - 2016/01/01 UR - http://biorxiv.org/content/early/2016/07/27/066357.abstract N2 - Ruminiclostridium thermocellum strain ATCC 27405 is valuable with reference to the next generation biofuel production being a degrader of crystalline cellulose. The completion of its genome sequence has revealed that this organism carries 3,376 genes with more than hundred genes encoding for enzymes involved in cellulysis. Novel protein domain discovery in the cellulose degrading enzyme complex of this strain has been attempted to understand this organism at molecular level. Streamlined automated methods were employed to generate possibly unreported or new domains. A set of 12 novel Pfam-B domains was developed after detailed analysis. This finding will enhance our understanding of this bacterium and its molecular processes involved in the degradation of cellulose. This approach of in silico analysis prior to experimentation facilitates in lab study. Previously uncorrelated data has been utilized for rapid generation of new biological information in this study.Note This research was conducted in 2014 for Clostridium thermocellum ATCC 27405. The bacterium was later reannotated as Ruminiclostridium thermocellum. See NCBI nonredundant RefSeq protein annotation details at http://www.ncbi.nlm.nih.gov/refseq/about/prokaryotes/reannotation/. The study utilizes Pfam-B database, which was discontinued with effect from release 28.0 (5/2015). Availability of new information, reannotation/modification in accession numbers might impact some of the analyzed values although effort has been made to provide latest accession numbers and reference strain parameters. The preprint version may contain grammatical and proofreading mistakes. Errors and omissions excepted. ER -