User profiles for R. Chikhi
Rayan ChikhiInstitut Pasteur & CNRS Verified email at ens-cachan.org Cited by 7000 |
Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species
…, S Boisvert, JA Chapman, G Chapuis, R Chikhi… - …, 2013 - academic.oup.com
Background The process of generating raw genome sequence data continues to become
cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished …
cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished …
[HTML][HTML] Critical assessment of metagenome interpretation—a benchmark of metagenomics software
Methods for assembly, taxonomic profiling and binning are key to interpreting metagenome
data, but a lack of consensus about benchmarking complicates performance assessment. …
data, but a lack of consensus about benchmarking complicates performance assessment. …
Data structures based on k-mers for querying large collections of sequencing data sets
High-throughput sequencing data sets are usually deposited in public repositories (eg, the
European Nucleotide Archive) to ensure reproducibility. As the amount of data has reached …
European Nucleotide Archive) to ensure reproducibility. As the amount of data has reached …
Assemblathon 1: a competitive assessment of de novo short read assembly methods
Low-cost short read sequencing technology has revolutionized genomics, though it is only
just becoming practical for the high-quality de novo assembly of a novel large genome. We …
just becoming practical for the high-quality de novo assembly of a novel large genome. We …
Informed and automated k-mer size selection for genome assembly
R Chikhi, P Medvedev - Bioinformatics, 2014 - academic.oup.com
… (2010), we do a maximum-likelihood estimation of the parameters using the optim function
in R (BFGS algorithm). Let d be the number of distinct k-mers present in the histogram. Let …
in R (BFGS algorithm). Let d be the number of distinct k-mers present in the histogram. Let …
[HTML][HTML] Petabase-scale sequence alignment catalyses viral discovery
Public databases contain a planetary collection of nucleic acid sequences, but their systematic
exploration has been inhibited by a lack of efficient methods for searching this corpus, …
exploration has been inhibited by a lack of efficient methods for searching this corpus, …
[HTML][HTML] Space-efficient and exact de Bruijn graph representation based on a Bloom filter
… where r = m/n is the number of bits per element. For a fixed ratio r, minimizing Equation 1
yields the optimal number of hash functions h≈0.7r, for which F is approximately 0.6185 r . …
yields the optimal number of hash functions h≈0.7r, for which F is approximately 0.6185 r . …
DSK: k-mer counting with very low memory usage
Counting all the k-mers (substrings of length k) in DNA/RNA sequencing reads is the
preliminary step of many bioinformatics applications. However, state of the art k-mer counting …
preliminary step of many bioinformatics applications. However, state of the art k-mer counting …
Data Structures to Represent a Set of k-long DNA Sequences
R Chikhi, J Holub, P Medvedev - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
… An intermediate model was also considered by Chikhi et al. [2014], where S is parametrized
by the number of maximal unitigs … The type of index varies: The FM-index is used by Chikhi …
by the number of maximal unitigs … The type of index varies: The FM-index is used by Chikhi …
[HTML][HTML] Hybrids of RNA viruses and viroid-like elements replicate in fungi
… parasitica ACP43 were incubated in RNAse R 1X reaction buffer, 1 U of RNAse R for 5,
15 or 30 minutes at 37 C. Negative controls were obtained mixing 2 µg of total RNA with the …
15 or 30 minutes at 37 C. Negative controls were obtained mixing 2 µg of total RNA with the …