Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
8 changes: 4 additions & 4 deletions docs/04-genomic-file-formats/02-FASTQ-files.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,9 +34,9 @@ by newlines.
Random access within FASTQ files is not typical—generally, FASTQs are used
solely as input to some alignment process, which then produces a BAM file—so
they are gzipped (not bgzipped) to save space. Conventionally, FASTQ file names
indicate which read the files contains (e.g. Sample.fastq.gz for single-end
sequencing or Sample_R1.fastq.gz and Sample_R2.fastq.gz in paired-end sequencing
where _R1 stands for "read one(s)" and _R2 stands for "read two(s)").
indicate which read the files contains (e.g. `Sample.fastq.gz` for single-end
sequencing or `Sample_R1.fastq.gz` and `Sample_R2.fastq.gz` in paired-end sequencing
where `_R1` stands for "read one(s)" and `_R2` stands for "read two(s)").

Note that in the case of paired-end sequencing, it is crucial that each of the FASTQ files list their reads in same order. If even one read is deleted from either file, the entire read pairing will be off, which will have disastrous results during the alignment phase. To catch common formatting errors in single-end or paired-end FASTQ files, consider using [fqlib](https://github.com/stjude/fqlib) (specifically, the lint subcommand).

Expand Down Expand Up @@ -82,4 +82,4 @@ zcat Sample_R2.fastq.gz | head -n 7 | gzip -c > Sample_R2.bad.fastq.gz
fq lint Sample_R1.fastq.gz Sample_R2.bad.fastq.gz

# Sample_R2.bad.fastq.gz:8:1: [S004] CompleteValidator: Incomplete record: quality is empty
```
```