This function creates count tables summarizing counts for each observed sequence in a set of fastQ files. Input files are taken from the given input directory and are selected based on the given fastQ suffix.
createCountTableFromFastQs(in_dir, fq_suffix = ".fq.gz", pheno_info)
in_dir | Input directory containing fastQ files to be analyzed |
---|---|
fq_suffix | Suffix of your fastq files, default is ".fq.gz" |
pheno_info | A data frame with sample names as row names and file names in column 'sample'. The sample names will be used as headers in the count table. If not given the file names of the fastQ files count table will be the headers of the count table. |
A data frame with unique sequences as row names and the correspoding number of occurrences for each sample