prank(1) Computes probabilistic multiple sequence alignments

SYNOPSIS

prank sequence_file

prank [optional parameters] -d=sequence_file [optional parameters]

DESCRIPTION

The Probabilistic Alignment Kit (PRANK) is a probabilistic multiple alignment program for DNA, codon and amino-acid sequences. It's based on a novel algorithm that treats insertions correctly and avoids over-estimation of the number of deletion events.

In addition, PRANK borrows ideas from maximum likelihood methods used in phylogenetics and correctly takes into account the evolutionary distances between sequences. Lastly, PRANK allows for defining a potential structure for sequences to be aligned and then, simultaneously with the alignment, predicts the locations of structural units in the sequences.

OPTIONS

INPUT/OUTPUT PARAMETERS

-d=sequence_file
The input sequence file in FASTA format.
-t=tree_file
The tree file to use. If unset, an appriximated NJ tree is generated.
-o=output_file
Set the name of the output file. If unset, output_file is set to output.
-f=output_format
Set the output format. output_format can be one of fasta (default), phylipi, phylips, paml, or nexus.
-m=model_file
The model file to use. If unset, model_file is set to HKY2/WAG.
-support
Compute posterior support.
-showxml
Output alignment xml-file.
-showtree
Output alignment guidetree.
-showanc
Output ancestral sequences.
-showall
Output all of these.
-noanchors
Do not use Exonerate anchoring. (Exonerate to be installed separately.)
-nomafft
Do not use MAFFT for guide tree. (MAFFT to be installed separately.)
-njtree
Estimate tree from an input alignment (and realign).
-shortnames
Truncate names at first space character.
-quiet
Reduce output.

ALIGNMENT MERGE

-d1=alignment_file
The first input alignment file in FASTA format.
-d2=alignment_file
The second input alignment file in FASTA format.
-t1=tree_file
The tree file for the first alignment. If unset, an appriximated NJ tree is generated.
-t2=tree_file
The tree file for the second alignment. If unset, an appriximated NJ tree is generated.

MODEL PARAMETERS

-F, +F
Force insertions to be always skipped.
-gaprate=#
Set the gap opening rate. The default is 0.025 for DNA and 0.005 for proteins.
-gapext=#
Set the gap extension probability. The default is 0.75 for DNA and 0.5 for proteins.
-codon
Use empirical codon model for coding DNA.
-DNA, -protein
Use DNA or protein model, respectively. Disables auto-detection of model.
-termgap
Penalise terminal gaps normally.
-nomissing
No missing data. Use -F for terminal gaps.
-keep
Do not remove gaps from pre-aligned sequences.

OTHER PARAMETERS

-iterate=#
Rounds of re-alignment iteration; by default, iterate five times and keep the best result.
-once
Run only once. Same as -iterate=1.
-prunetree
Prune guide tree branches with no sequence data.
-prunedata
Prune sequence data with no guide tree leaves.
-uselogs
Slower but should work for a greater number of sequences.
-translate
Translate input data to protein sequences.
-mttranslate
Translate input data to protein sequencess using mt table.
-convert
Do not align, just convert to a different format.
-dna=dna_sequence_file
DNA sequence file for backtranslation of protein alignment.
-help
Show an extended help page with more options.
-version
Show version and check for updates.

AUTHORS

prank was written by Ari Loytynoja.

This manual page was originally written by Manuel Prinz <[email protected]> for the Debian project (and may be used by others).