<< All versions
Skill v1.0.1
currentAutomated scan100/100majiayu000/claude-skill-registry-data/quality-reports
3 files
──Details
PublishedMay 15, 2026 at 06:23 PM
Content Hashsha256:46bbdaaf139d785e...
Git SHA01042ae58061
Bump Typepatch
──Files
Files (1 file, 4.6 KB)
SKILL.md4.6 KBactive
SKILL.md · 192 lines · 4.6 KB
version: "1.0.1" name: bio-read-qc-quality-reports description: Generate and interpret quality reports from FASTQ files using FastQC and MultiQC. Assess per-base quality, adapter content, GC bias, duplication levels, and overrepresented sequences. Use when performing initial QC on raw sequencing data or validating preprocessing results. tool_type: cli primary_tool: fastqc
Quality Reports
Generate quality reports for FASTQ files using FastQC and aggregate multiple reports with MultiQC.
FastQC - Single Sample Reports
Basic Usage
bash
# Single filefastqc sample.fastq.gz# Multiple filesfastqc *.fastq.gz# Specify output directoryfastqc -o qc_reports/ sample_R1.fastq.gz sample_R2.fastq.gz# Set threadsfastqc -t 4 *.fastq.gz
Output Files
FastQC produces two files per input:
sample_fastqc.html- Interactive HTML reportsample_fastqc.zip- Data files and images
Key Modules
| Module | What It Shows | Warning Signs | |
|---|---|---|---|
| Per base sequence quality | Quality scores across read | Drop below Q20 at 3' end | |
| Per sequence quality | Quality score distribution | Bimodal distribution | |
| Per base sequence content | Nucleotide composition | Imbalance at start (normal) | |
| Per sequence GC content | GC distribution | Secondary peak (contamination) | |
| Per base N content | Unknown bases | High N content | |
| Sequence length distribution | Read lengths | Unexpected variation | |
| Sequence duplication | Duplicate reads | High duplication (PCR) | |
| Overrepresented sequences | Common sequences | Adapter contamination | |
| Adapter content | Adapter sequences | Visible adapter curves |
Extract Data from ZIP
bash
# Unzip to access raw dataunzip sample_fastqc.zip# View summarycat sample_fastqc/summary.txt# Get per-base qualitycat sample_fastqc/fastqc_data.txt | grep -A 50 ">>Per base sequence quality"
MultiQC - Aggregate Reports
Basic Usage
bash
# Aggregate all FastQC reports in current directorymultiqc .# Specify input and outputmultiqc qc_reports/ -o multiqc_output/# Custom report namemultiqc . -n my_project_qc# Force overwritemultiqc . -f
Common Options
bash
# Flat directory (no sample subdirs)multiqc --flat .# Export data as TSVmultiqc . --export# Only specific modulesmultiqc . -m fastqc# Exclude patternsmultiqc . --ignore '*_trimmed*'# Include patternsmultiqc . --ignore-samples '*negative*'
Output Files
multiqc_report.html- Interactive HTML reportmultiqc_data/- Directory with data tablesmultiqc_fastqc.txt- FastQC metricsmultiqc_general_stats.txt- Summary statisticsmultiqc_sources.txt- Source files used
Extract Data Programmatically
python
import pandas as pdgeneral_stats = pd.read_csv('multiqc_data/multiqc_general_stats.txt', sep='\t')print(general_stats.columns)fastqc_data = pd.read_csv('multiqc_data/multiqc_fastqc.txt', sep='\t')
Batch Processing
Process Multiple Samples
bash
# All FASTQ files in parallelfastqc -t 8 -o qc_reports/ raw_data/*.fastq.gz# Then aggregatemultiqc qc_reports/ -o multiqc_output/
Before and After Trimming
bash
# Create separate directoriesmkdir -p qc_reports/raw qc_reports/trimmed# QC raw readsfastqc -o qc_reports/raw/ raw_data/*.fastq.gz# After trimming (using fastp, cutadapt, etc.)fastqc -o qc_reports/trimmed/ trimmed_data/*.fastq.gz# Compare with MultiQCmultiqc qc_reports/ -o qc_comparison/
Interpretation Guide
Quality Scores
| Phred Score | Error Rate | Interpretation | |
|---|---|---|---|
| Q40 | 0.0001 | Excellent | |
| Q30 | 0.001 | Good (Illumina target) | |
| Q20 | 0.01 | Acceptable | |
| Q10 | 0.1 | Poor |
Common Issues
| Issue | Likely Cause | Action | |
|---|---|---|---|
| Low quality at 3' end | Normal degradation | Trim 3' end | |
| Adapter contamination | Short inserts | Trim adapters | |
| GC bias | Library prep | Consider correction | |
| High duplication | Low complexity, PCR | Mark/remove duplicates | |
| Overrepresented seqs | Adapters, primers | Check sequences |
Configuration
Custom Adapters
Create ~/.fastqc/Configuration/adapter_list.txt:
Custom_Adapter_Name ACGTACGTACGT
Custom Limits
Create ~/.fastqc/Configuration/limits.txt to customize thresholds:
# Warn if mean quality below 25quality_sequence warn 25quality_sequence error 20
Related Skills
- adapter-trimming - Remove adapters detected by FastQC
- fastp-workflow - All-in-one QC and trimming
- sequence-io/read-sequences - FASTQ file reading/writing