man subread-align (1): an accurate and efficient aligner for mapping both genomic DNA-seq reads and RNA-seq reads (for the purpose of expression analysis)

USAGE

subread-align [options] -i <index_name> -r <input> -o <output> -t <type>

Required arguments:

-i <string>: Base name of the index.
-r <string>: Name of the input file. Input formats including gzipped fastq, fastq, and fasta can be automatically detected. If paired-end, this should give the name of file including first reads.
-t <int>: Type of input sequencing data. Its values include 0: RNA-seq data 1: genomic DNA-seq data.

Optional arguments:

-o <string>: Name of the output file. By default, the output is in BAM format.
-n <int>: Number of selected subreads, 10 by default.
-m <int>: Consensus threshold for reporting a hit (minimal number of subreads that map in consensus) . If paired-end, this gives the consensus threshold for the anchor read. 3 by default
-M <int>: Specify the maximum number of mis-matched bases allowed in the alignment. 3 by default. Mis-matches found in softclipped bases are not counted.
-T <int>: Number of CPU threads used, 1 by default.
-I <int>: Maximum length (in bp) of indels that can be detected. 5 by default. The program can detect indels of up to 200bp long.
-B <int>: Maximal number of equally-best mapping locations to be reported. 1 by default. Note that -u option takes precedence over -B.
-P <3:6>: Format of Phred scores in input files, '3' for phred+33 and '6' for phred+64. '3' by default.
-u: Report uniquely mapped reads only. Number of matched bases ( for RNA-seq) or mis-matched bases(for genomic DNA-seq) is used to break the tie.
-b: Convert color-space read bases to base-space read bases in the mapping output. Note that read mapping is performed at color-space.
--sv: Detect structural variants (eg. long indel, inversion, duplication and translocation) and report breakpoints. Refer to Users Guide for breakpoint reporting.
--SAMinput: Input reads are in SAM format.
--BAMinput: Input reads are in BAM format.
--SAMoutput: Save mapping result in SAM format.
--trim5 <int>: Trim off <int> number of bases from 5' end of each read. 0 by default.
--trim3 <int>: Trim off <int> number of bases from 3' end of each read. 0 by default.
--rg-id <string>: Add read group ID to the output.
--rg <string>: Add <tag:value> to the read group (RG) header in the output.
--DPGapOpen <int> Penalty for gap opening in short indel detection. -1 by: default.
--DPGapExt <int>: Penalty for gap extension in short indel detection. 0 by default.
--DPMismatch <int> Penalty for mismatches in short indel detection. 0 by: default.
--DPMatch <int>: Score for matched bases in short indel detection. 2 by default.
--complexIndels: Detect multiple short indels that occur concurrently in a small genomic region (these indels could be as close as 1bp apart).
-v: Output version of the program.

Optional arguments for paired-end reads:

-R <string>: Name of the file including second reads.
-p <int>: Consensus threshold for the non-anchor read (receiving less votes than the anchor read from the same pair). 1 by default.
-d <int>: Minimum fragment/insert length, 50bp by default.
-D <int>: Maximum fragment/insert length, 600bp by default.
-S <ff:fr:rf>: Orientation of first and second reads, 'fr' by default ( forward/reverse).

Refer to Users Manual for detailed description to the arguments.

USAGE

LAST SEARCHED