nf-core/nanoseq
Nanopore demultiplexing, QC and alignment pipeline
3.0.0
). The latest
stable release is
3.1.0
.
Define where the pipeline should find input data and save output data.
Path to comma-separated file containing information about the samples in the experiment.
string
./samplesheet.csv
^\S+\.csv$
You will need to create a design file with information about the samples in your experiment before running the pipeline. Use this parameter to specify its location. It has to be a comma-separated file with 3 columns, and a header row. See usage docs.
Path to comma-separated file containing information about the samples in the experiment.
string
You will need to create a design file with information about the samples in your experiment before running the pipeline. Use this parameter to specify its location. It has to be a comma-separated file with 3 columns, and a header row. See usage docs.
The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
string
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Set this parameter to your e-mail address to get a summary e-mail with details of the run sent to you when the workflow exits. If set in your user config file (~/.nextflow/config
) then you don't need to specify this on the command line for every run.
MultiQC report title. Printed as page header, used for filename if not otherwise specified.
string
Options required to basecall and demultiplex samples.
Path to Nanopore run directory files (e.g. 'fastq_pass/*') or a basecalled fastq file that requires demultiplexing. The latter can only be provided in conjunction with the '--skip_basecalling' parameter.
string
Flowcell used to perform the sequencing e.g. 'FLO-MIN106'. Not required if '--guppy_config' is specified.
string
Kit used to perform the sequencing e.g. 'SQK-LSK109'. Not required if '--guppy_config' is specified.
string
Barcode kit used to perform the sequencing e.g. 'SQK-PBK004'.
string
If you would like to skip the basecalling (--skip_basecalling
) but still perform the demultiplexing please specify a barcode kit that can be recognised by qcat:
| qcat
barcode kit specifications | description |
|-----------------------------------|-------------------------------------------------------------------------------|
| Auto
| Auto detect barcoding kit |
| RBK001
| Rapid barcoding kit |
| RBK004
| Rapid barcoding kit v4 |
| NBD103/NBD104
| Native barcoding kit with barcodes 1-12 |
| NBD114
| Native barcoding kit with barcodes 13-24 |
| NBD104/NBD114
| Native barcoding kit with barcodes 1-24 |
| PBC001
| PCR barcoding kits with 12 barcodes |
| PBC096
| PCR barcoding kits with 96 barcodes |
| RPB004/RLB001
| Rapid PCR Barcoding Kit (SQK-RPB004) and Rapid Low Input by PCR Barcoding Kit |
| RPB004/LWB001
| Low Input by PCR Barcoding Kit |
| RAB204
| 16S Rapid Amplicon Barcoding Kit with 12 Barcodes |
| VMK001
| Voltrax Barcoding Kit with 4 barcodes |
Require barcode on both ends for Guppy basecaller.
boolean
Whether to trim the barcodes from the output sequences in the FastQ files from Guppy basecaller.
boolean
Config file used for basecalling that will be passed to Guppy via the '--config' parameter.
string
Cannot be used in conjunction with --flowcell
and --kit
. This can be a local file (e.g. /your/dir/guppy_conf.cfg
) or a string specifying a configuration stored in the /opt/ont/guppy/data/
directory of Guppy.
Custom basecalling model file in json
format that will be passed to Guppy via the '--model' parameter.
string
Custom basecalling models can be trained with software such as Taiyaki. This can also be a string specifying a model stored in the /opt/ont/guppy/data
directory of Guppy.
Whether to demultiplex with Guppy in GPU mode.
boolean
Number of '--gpu_runners_per_device' used for Guppy when using '--guppy_gpu'.
integer
6
Number of '--cpu_threads_per_caller' used for Guppy when using '--guppy_gpu'.
integer
1
Basecalling device specified to Guppy in GPU mode using '--device'.
string
auto
Cluster options required to use GPU resources (e.g. '--part=gpu --gres=gpu:1').
string
Demultiplex fast5 files with demux_fast5.
boolean
Specify the minimum quality score for qcat in the range 0-100.
integer
60
Search for adapters in the whole read by applying the '--detect-middle' parameter in qcat.
boolean
Skip basecalling with Guppy.
boolean
Skip demultiplexing with Guppy/qcat.
boolean
Filter reads from FastQ files using NanoLyse
boolean
Fasta file to be filtered against using NanoLyse
string
Options to adjust parameters and filtering criteria for read alignments.
Specifies the aligner to use i.e. 'minimap2' or 'graphmap2'.
string
minimap2
Specifies if the data is strand-specific. Automatically activated when using '--protocol directRNA'.
boolean
When using --protocol
/--stranded
the following command-line arguments will be set for minimap2
and graphmap2
:
| nanoseq
input | minimap2
presets | graphmap2
presets |
|------------------------------|---------------------|---------------------|
| --protocol DNA
| -ax map-ont | no presets |
| --protocol cDNA
| -ax splice | -x rnaseq |
| --protocol directRNA
| -ax splice -uf -k14 | -x rnaseq |
| --protocol cDNA --stranded
| -ax splice -uf | -x rnaseq |
Save the '.sam' files from the alignment step - not done by default.
boolean
Skip alignment and downstream processes.
boolean
Options to adjust pameters for DNA varinat calling and structural variant calling.
Specifies if variant calling will executed.
boolean
Specifies the variant caller that will used to call small variants (available are: medaka, deepvariant, and pepper_margin_deepvariant). Variant calling is only available if '--call_variants' is set and the protocol is set to DNA
. Please note deepvariant
and pepper_margin_deepvariant
are only avaible if using singularity or docker.
string
medaka
Specifies the variant caller that will be used to call structural variants (available are: sniffles and cutesv). Structural variant calling is only available if '--call_variants' is set and the protocol is set to DNA
.
string
sniffles
Specifies if MNPs will be split into SNPs when using medaka.
boolean
Specifies whether to call variants with pepper_margin_deepvariant in GPU mode.
boolean
Specifies if vcf will be phased when using medaka.
boolean
Skip variant calling.
boolean
Skip structural variant calling.
boolean
Options to adjust quantification and differential analysis
Specifies the transcript quantification method to use (available are: bambu or stringtie2). Only available when protocol is cDNA or directRNA.
string
bambu
Skip transcript quantification and differential analysis.
boolean
Skip differential analysis with DESeq2 and DEXSeq.
boolean
Options to adjust the RNA fusion analysis
Specifies the reference directory for JAFFAL.
string
for_jaffal
Skip differential analysis with DESeq2 and DEXSeq.
boolean
Options to adjust the RNA modification analysis
Skip RNA modification analysis.
boolean
Skip differential modification analysis with xpore.
boolean
Skip m6A detection with m6anet.
boolean
Options to skip various steps within the workflow.
Skip BigBed file generation.
boolean
Skip BigWig file generation.
boolean
Skip pycoQC.
boolean
Skip NanoPlot.
boolean
Skip FastQC.
boolean
Skip MultiQC.
boolean
Skip all QC steps apart from MultiQC.
boolean
Reference genome related files and options required for the workflow.
Directory / URL base for iGenomes references.
string
s3://ngi-igenomes/igenomes
Do not load the iGenomes reference config.
boolean
Do not load igenomes.config
when running the pipeline. You may choose this option if you observe clashes between custom parameters and those supplied in igenomes.config
.
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
If you're running offline, Nextflow will not be able to fetch the institutional config files from the internet. If you don't need them, then this is not a problem. If you do need them, you should download the files from the repo and tell Nextflow where to find them with this parameter.
Institutional config name.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
Set the top limit for requested resources for any single job.
Maximum number of CPUs that can be requested for any single job.
integer
16
Use to set an upper-limit for the CPU requirement for each process. Should be an integer e.g. --max_cpus 1
Maximum amount of memory that can be requested for any single job.
string
128.GB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Use to set an upper-limit for the memory requirement for each process. Should be a string in the format integer-unit e.g. --max_memory '8.GB'
Maximum amount of time that can be requested for any single job.
string
240.h
^(\d+\.?\s*(s|m|h|day)\s*)+$
Use to set an upper-limit for the time requirement for each process. Should be a string in the format integer-unit e.g. --max_time '2.h'
Less common options for the pipeline, typically set in a config file.
Display help text.
boolean
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
An email address to send a summary email to when the pipeline is completed - ONLY sent if the pipeline does not exit successfully.
Send plain-text email instead of HTML.
boolean
File size limit when attaching MultiQC reports to summary emails.
string
25.MB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Do not use coloured log outputs.
boolean
Custom config file to supply to MultiQC.
string
Directory to keep pipeline Nextflow logs and reports.
string
${params.outdir}/pipeline_info
Boolean whether to validate parameters against the schema at runtime
boolean
true
Show all params when using --help
boolean
Run this workflow with Conda. You can also use '-profile conda' instead of providing this parameter.
boolean