samtoh5(1)
convert a SAM file to cmp.h5 format
SYNOPSIS
samtoh5
in.sam
reference.fasta
out.cmp.h5
[options]
OPTIONS
- in.sam
-
Input SAM file.
- reference.fasta
-
Reference used to generate reads.
- out.cmp.h5
-
Output cmp.h5 file.
- -smrtTitle
-
Use this option when converting alignments generated from reads
produced by the
pls2fasta(1)
from bas.h5 files by parsing read
coordinates from the SMRT read title. The title is in the
format ,/name/hole/coordinates/, where coordinates are in the
format \d+_\d+, and represent the interval of the read that was
aligned.
- -readType value
-
Set the read type: 'standard', 'strobe', 'CCS', or 'cDNA'
- -verbosity value
-
Set desired verbosity.
- -useShortRefName
-
Use abbreviated reference names obtained from file.sam instead
of using full names from reference.fasta.
- -copyQVs
-
Copy all QVs available in the SAM file into the cmp.h5 file.
This includes things like InsertionQV and DeletionTag.
NOTES
Because SAM has optional tags that have different meanings in different
programs, careful usage is required in order to have proper output. The
"xs" tag in bwa-sw is used to show the suboptimal score, but in PacBio SAM
(
blasr(1))
it is defined as the start in the query sequence of the alignment.
When
-smrtTitle is specified, the xs tag is ignored, but when it is not
specified, the coordinates given by the xs and xe tags are used to define
the interval of a read that is aligned. The CIGAR string is relative to
this interval.