User profiles for R. Durbin
Richard DurbinDept of Genetics, University of Cambridge Verified email at cam.ac.uk Cited by 306443 |
InterPro, progress and status in 2005
InterPro, an integrated documentation resource of protein families, domains and functional
sites, was created to integrate the major protein signature databases. Currently, it includes …
sites, was created to integrate the major protein signature databases. Currently, it includes …
A large genome center's improvements to the Illumina sequencing system
The Wellcome Trust Sanger Institute is one of the world's largest genome centers, and a
substantial amount of our sequencing is performed with 'next-generation' massively parallel …
substantial amount of our sequencing is performed with 'next-generation' massively parallel …
An overview of Ensembl
…, KC Woodwark, G Cameron, R Durbin… - Genome …, 2004 - genome.cshlp.org
Ensembl ( http://www.ensembl.org/ ) is a bioinformatics project to organize biological
information around the sequences of large genomes. It is a comprehensive source of stable …
information around the sequences of large genomes. It is a comprehensive source of stable …
[BOOK][B] Biological sequence analysis: probabilistic models of proteins and nucleic acids
Probabilistic models are becoming increasingly important in analysing the huge amount of
data being produced by large-scale DNA-sequencing efforts such as the Human Genome …
data being produced by large-scale DNA-sequencing efforts such as the Human Genome …
The complete sequence of a human genome
Since its initial release in 2000, the human reference genome has covered only the
euchromatic fraction of the genome, leaving important heterochromatic regions unfinished. …
euchromatic fraction of the genome, leaving important heterochromatic regions unfinished. …
The variant call format and VCFtools
The variant call format (VCF) is a generic format for storing DNA polymorphism data such as
SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is …
SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is …
The Pfam protein families database
Pfam is a large collection of protein families and domains. Over the past 2 years the number
of families in Pfam has doubled and now stands at 6190 (version 10.0). Methodology …
of families in Pfam has doubled and now stands at 6190 (version 10.0). Methodology …
[HTML][HTML] Towards complete and error-free genome assemblies of all vertebrate species
High-quality and complete reference genome assemblies are fundamental for the application
of genomics to biology, disease, and biodiversity conservation. However, such assemblies …
of genomics to biology, disease, and biodiversity conservation. However, such assemblies …
Systematic functional analysis of the Caenorhabditis elegans genome using RNAi
A principal challenge currently facing biologists is how to connect the complete DNA sequence
of an organism to its development and behaviour. Large-scale targeted-deletions have …
of an organism to its development and behaviour. Large-scale targeted-deletions have …
Pfam: clans, web tools and services
Pfam is a database of protein families that currently contains 7973 entries (release 18.0). A
recent development in Pfam has enabled the grouping of related families into clans. Pfam …
recent development in Pfam has enabled the grouping of related families into clans. Pfam …