man shntool (1): a multi-purpose WAVE data processing and reporting utility

SYNOPSIS

shntool mode ...
shntool [CORE OPTION]

DESCRIPTION

shntool is a command-line utility to view and/or modify WAVE data and properties. It runs in several different operating modes, and supports various lossless audio formats.

shntool is comprised of three parts - its core, mode modules, and format modules. This helps to make the code easier to maintain, as well as aid other programmers in developing new functionality. The distribution archive contains a file named 'modules.howto' that describes how to create a new mode or format module, for those so inclined.

Mode modules

shntool performs various functions on WAVE data through the use of mode modules. The core of shntool is simply a wrapper around the mode modules. In fact, when shntool is run with a valid mode as its first argument, it essentially runs the main procedure for the specified mode, and quits. shntool comes with several built-in modes, described below:

len: Displays length, size and properties of PCM WAVE data
fix: Fixes sector-boundary problems with CD-quality PCM WAVE data
hash: Computes the MD5 or SHA1 fingerprint of PCM WAVE data
pad: Pads CD(hyquality files not aligned on sector boundaries with silence
join: Joins PCM WAVE data from multiple files into one
split: Splits PCM WAVE data from one file into multiple files
cat: Writes PCM WAVE data from one or more files to the terminal
cmp: Compares PCM WAVE data in two files
cue: Generates a CUE sheet or split points from a set of files
conv: Converts files from one format to another
info: Displays detailed information about PCM WAVE data
strip: Strips extra RIFF chunks and/or writes canonical headers
gen: Generates CD-quality PCM WAVE data files containing silence
trim: Trims PCM WAVE silence from the ends of files

For more information on the meaning of the various command-line options for each mode, see the MODE-SPECIFIC OPTIONS section below.

For convenience, each mode can specify an alternate name or alias that will invoke it (this feature is currently only available on systems that support symbolic or hard linking). In particular, each mode is aliased to 'shn<mode>'. For instance, running shnlen is equivalent to running shntool len - thus saving a few keystrokes.

Format modules

File formats are abstracted from shntool through the use of format modules. They provide a means for shntool to tranparently read and/or write different file formats. This abstraction allows shntool to concentrate on its job without worrying about the details of each file format.

The following formats are currently supported:

wav: RIFF WAVE file format
aiff: Audio Interchange File Format (AIFF and uncompressed/sowt AIFF-C only) (via 'sox'):
<http://sox.sourceforge.net/>
shn: Shorten low complexity waveform coder (via 'shorten'):
<http://www.softsound.com/Shorten.html>
<http://www.etree.org/shnutils/shorten/>
flac: Free Lossless Audio Codec (via 'flac'):
<http://flac.sourceforge.net/>
ape: Monkey's Audio Compressor (via 'mac'):
<http://www.monkeysaudio.com/>
<http://supermmx.org/linux/mac/>
alac: Apple Lossless Audio Codec (via 'alac'):
<http://craz.net/programs/itunes/alac.html>
tak: (T)om's lossless (A)udio (K)ompressor (via 'takc'):
<http://www.thbeck.de/Tak/Tak.html>
ofr: OptimFROG Lossless WAVE Audio Coder (via 'ofr'):
<http://www.losslessaudio.org/>
tta: TTA Lossless Audio Codec (via 'ttaenc'):
<http://tta.sourceforge.net/>
als: MPEG-4 Audio Lossless Coding (via 'mp4als'):
<http://www.nue.tu-berlin.de/forschung/projekte/ -
lossless/mp4als.html>
wv: WavPack Hybrid Lossless Audio Compression (via 'wavpack' and 'wvunpack'):
<http://www.wavpack.com/>
lpac: Lossless Predictive Audio Compression (via 'lpac'):
<http://www.nue.tu-berlin.de/wer/liebchen/lpac.html>
la: Lossless Audio (via 'la'):
<http://www.lossless-audio.com/>
bonk: Bonk lossy/lossless audio compressor (via 'bonk'):
<http://www.logarithmic.net/pfh/bonk>
kxs: Kexis lossless WAV file compressor (via 'kexis'):
<http://www.sourceforge.net/projects/kexis/>
mkw: MKW Audio Compression format (via 'mkwcon'):
<http://www.etree.org/shnutils/mkwcon/>
cust: Custom output format module (output only, useful for encoding to a format that shntool does not yet support)
term: sends output to the terminal
null: sends output to /dev/null (output only, useful for dry-runs in several modes, such as fix mode or strip mode)

When reading files for input, shntool automatically discovers which, if any, format module handles each file. In modes where files are created as output, you can specify what the output format should be - otherwise, shntool decides for you by selecting the first format module it finds that supports output (in a default installation, this will be the wav format).

CORE OPTIONS

Modeless

When run without a mode, shntool takes these options:

-m: Show detailed mode module information
-f: Show detailed format module information
-a: Show default format module arguments
-v: Show version information
-h: Show a help screen

GLOBAL OPTIONS

All modes

All modes support the following options:

-D

Print debugging information

-F file

Specify a file containing a list of filenames to process. This overrides any files specified on the command line or on the terminal.

NOTE: Most modes will accept input filenames from a single source, according to the following order of precedence: file specified by the -F option, otherwise filenames on the command line, otherwise filenames read from the terminal.

-H

Print times in h:mm:ss.{ff,nnn} format, instead of m:ss.{ff,nnn}

-P type

Specify progress indicator type. type is one of: {pct, dot, spin, face, none}. pct shows the completion percentage of each operation. dot shows the progress of each operation by displaying a '.' after each 10% step toward completion. spin shows a spinning progress indicator. face shows the progress of each operation by displaying six emoticons that become increasingly happy as the operation nears completion. none prevents any progress completion information from being displayed. The default is pct.

-h

Show the help screen for this mode

-i fmt

Specify input file format decoder and/or arguments. The format is: "fmt decoder [arg1 ... argN]", and must be surrounded by quotes. If arguments are given, then one of them must contain "%f", which will be replaced with the input filename. Examples:

: -i 'shn shorten-2.3b' (use official shorten-2.3b instead of later versions; leave default arguments untouched)
-i 'shn shorten -x -d 2048 %f -' (force shorten to skip the first 2048 bytes of each file)

-q

Suppress non-critical output (quiet mode). Output that normally goes to stderr will not be displayed, other than errors or debugging information (if specified).

-r val

Reorder input files? val is one of: {ask, ascii, natural, none}. The default is natural.

-v

Show version information

-w

Suppress warnings

--

Indicates that everything following it is a filename

Output modes

Additionally, any mode that creates output files supports the the following options:

-O val

Overwrite existing files? val is one of: {ask, always, never}. The default is ask.

-a str

Prefix str to base part of output filenames

-d dir

Specify output directory

-o str

Specify output file format extension, encoder and/or arguments. Format is: "fmt [ext=abc] [encoder [arg1 ... argN (%f = filename)]]", and must be surrounded by quotes. If arguments are given, then one of them must contain "%f", which will be replaced with the output filename. Examples:

-o 'shn shorten -v2 - %f' (create shorten files without seek tables)

-o 'flac flake - %f' (use alternate flac encoder)

-o 'aiff ext=aif' (override default aiff extension of 'aiff' with 'aif')

-o 'cust ext=mp3 lame --quiet - %f' (create mp3 files using lame)

-z str

Postfix str to base part of output filenames

MODE-SPECIFIC OPTIONS

len mode options

-U unit

Specifies the unit in which the totals will be printed. unit is one of: {b, kb, mb, gb, tb}. The default is b.

-c

Do not show column names

-t

Do not show totals line

-u unit

Specifies the unit in which each file will be printed. unit is one of: {b, kb, mb, gb, tb}. The default is b.

len mode output

The output of len mode may seem cryptic at first, because it attempts to convey a lot of information in just a little bit of space. But it is quite easy to read once you know what the columns represent; and in certain columns, what each character in the column means. Each column is explained below.

length: Shows the length of the WAVE data, in m:ss.nnn (millisecond) format. If the data is CD-quality, then m:ss.ff is shown instead, where ff is a number from 00 to 74 that best approximates the number of frames (2352-byte blocks) remaining after m:ss. If all files are CD-quality, the total length will be shown in m:ss.ff format; otherwise it will be in m:ss.nnn format. NOTE: CD-quality files are rounded to the nearest frame; all other files are rounded to the nearest millisecond.
expanded size: Shows the total size of all WAVE chunks within the file (header, data and any extra RIFF chunks). Essentially this is the size that the file would be if it were converted to .wav format, e.g. with shntool conv.
NOTE: Do not rely on this field for audio size! If you simply want to know how many bytes of audio are in a file, run it through info mode, and look at the "data size" field in its output.
cdr: Shows properties related to CD-quality files. A 'c' in the first slot indicates that the WAVE data is not [C]D-quality. A 'b' in the second slot indicates that the CD-quality WAVE data is not cut on a sector [b]oundary. An 's' in the third slot indicates that the CD-quality WAVE data is too [s]hort to be burned.
A '-' in any of these slots indicates that the particular property is OK or normal. An 'x' in any of these slots indicates that the particular property does not apply to this file, or cannot be determined.
WAVE: Shows properties of the WAVE data. An 'h' in the first slot indicates that the WAVE [h]eader is not canonical. An 'e' in the second slot indicates that the WAVE file contains [e]xtra RIFF chunks.
A '-' in any of these slots indicates that the particular property is OK or normal. An 'x' in any of these slots indicates that the particular property does not apply to this file, or cannot be determined.
problems: Shows problems detected with the WAVE header, WAVE data, or the file itself. A '3' in the first slot indicates that the file contains an ID[3]v2 header. An 'a' in the second slot indicates that the audio data is not block-[a]ligned. An 'i' in the third slot indicates that the WAVE header is [i]nconsistent about data size and/or file size. A 't' in the fourth slot indicates that the WAVE file seems to be [t]runcated. A 'j' in the fifth slot indicates that the WAVE file seems to have [j]unk appended to it.
A '-' in any of these slots indicates that the particular problem was not detected. An 'x' in any of these slots indicates that the particular problem does not apply to this file, or cannot be determined.
fmt: Shows which file format handled this file.
ratio: Shows the compression ratio for this file.
filename: Shows the name of the file that's being inspected.

fix mode options

NOTE: file names for files created in fix mode will be based on the input file name with the string '-fixed' appended to it, and the extension will be the default extension of the output file format. For example, with an output file format of shn the file 'foo.wav' would become 'foo-fixed.shn'. This can be overridden with the -a and/or -z global options described above.

-b: Shift track breaks backward to the previous sector boundary. This is the default.
-c: Check whether fixing is needed, without actually fixing anything. shntool will exit with status 0 if fixing is needed, and status 1 otherwise. This can be useful in shell scripts, e.g.: "if shntool fix -c *; then shntool fix *; else ...; fi"
-f: Shift track breaks forward to the next sector boundary.
-k: Specifies that all files should be processed, even if the first several of them wouldn't be altered, aside from a possible file format change. The default is to skip the first N files that wouldn't be changed from a WAVE data perspective in order to avoid unnecessary work.
-n: Specifies that the last file created should not be padded with silence to make its WAVE data size a multiple of 2352 bytes. The default is to pad the last file.
-u: Round track breaks to the nearest sector boundary.

hash mode options

-c: Specifies that the composite fingerprint for all input files should be generated, instead of the default of one fingerprint per file. The composite fingerprint is simply the fingerprint of the WAVE data from all input files taken as a whole in the order given, and is identical to the one that would be generated from the joined file if the same files were joined into one large file, with no padding added. This option can be used to fingerprint file sets, or to identify file sets in which track breaks have been moved around, but no audio has been modified in any way (e.g. no padding added, no resampling done, etc.).
-m: Generate MD5 fingerprints. This is the default.
-s: Generate SHA1 fingerprints.

pad mode options

NOTE: file names for files created in pad mode will be based on the input file name with the string '-prepadded' or '-postpadded' appended to it, and the extension will be the default extension of the output file format. For example, with an output file format of shn and pre-padding specified on the command line, the file 'foo.wav' would become 'foo-prepadded.shn'. This can be overridden with the -a and/or -z global options described above.

Be aware that some output format encoders (e.g. flac, ape) automatically strip headers and/or extra RIFF chunks.

-b: Specifies that the file created should be padded at the beginning with silence to make its WAVE data size a multiple of 2352 bytes.
-e: Specifies that the file created should be padded at the end with silence to make its WAVE data size a multiple of 2352 bytes. This is the default action.

join mode options

NOTE: file names for files created in join mode will be prefixed with 'joined.', and the extension will be the default extension of the output file format. For example, with an output file format of wav the files 'files*.wav' would become 'joined.wav'. This can be overridden with the -a and/or -z global options described above.

-b: Specifies that the file created should be padded at the beginning with silence to make its WAVE data size a multiple of 2352 bytes. Note that this option does not apply if the input files are not CD-quality, since padding is undefined in that case.
-e: Specifies that the file created should be padded at the end with silence to make its WAVE data size a multiple of 2352 bytes. This is the default action. Note that this option does not apply if the input files are not CD-quality, since padding is undefined in that case.
-n: Specifies that the file created should not be padded with silence to make its WAVE data size a multiple of 2352 bytes. Note that this option does not apply if the input files are not CD-quality, since padding is undefined in that case.

split mode options

NOTE: file names for files created in split mode are of the form prefixNNN.ext, where NNN is the output file number, and 'ext' is the default extension of the output file format. If an output file format of 'wav' is used, and the prefix is not altered via the -n switch described below, then the output file names will be "split-track01.wav", "split-track02.wav", etc. This can be overridden with the -a and/or -z global options described above.

For information on specifying split points, see the Specifying split points section below.

-c num

Specifies the number to start counting from when naming output files. The default is 1.

-e len

Prefix each track with len amount of lead-in taken from the previous track. len must be given in bytes, m:ss, m:ss.ff or m:ss.nnn format.

-f file

Specifies a file from which to read split point data. If not given, then split points are read from the terminal.

-l len

Specifies that the input file should be split into smaller files based on multiples of the len time interval. len must be given in bytes, m:ss, m:ss.ff or m:ss.nnn format.

-m str

Specifies a character manipulation string for filenames generated from CUE sheets. These characters, taken one-by-one, represent from/to character translation. They must always be in pairs. Some examples:

:-: Translate all instances of ':' to '-'
:-/-: Translate both ':' and '/' to '-'
:-/_*x: Translate ':' to '-', '/' to '_', and '*' to 'x'

-n fmt

Specifies the file count output format. The default is %02d, which gives two-digit zero-padded numbers (01, 02, 03, ...).

-t fmt

Name output files in user-specified format based on CUE sheet fields. The following formatting strings are recognized:

%p: Performer
%a: Album
%t: Track title
%n: Track number

-u len

Postfix each track with len amount of lead-out taken from the next track. len must be given in bytes, m:ss, m:ss.ff or m:ss.nnn format.

-x list

Only extract tracks in list (comma separated, may contain ranges). Examples include:

7: Only extract track 7
3-5: Only extract tracks 3 through 5
2-6,9,11-13: Only extract tracks 2 through 6, 9, and 11 through 13

Specifying split points

Split points simply mark places within the WAVE data of the input file where tracks will be split. They can be specified in any combination of the following formats:

bytes: where bytes is a specific byte offset
m:ss: where m = minutes and ss = seconds
m:ss.ff: where m = minutes, ss = seconds and ff = frames (75 per second, so ff ranges from 00 to 74)
m:ss.nnn: where m = minutes, ss = seconds and nnn = milliseconds (will be rounded to closest sector boundary, or the first sector boundary if the closest one happens to be the beginning of the file)
CUE sheet: - a simple CUE sheet, in which each "INDEX 01 m:ss:ff" line is converted to a m:ss.ff split point

Split points must be given in increasing order, and must appear one per line. If the byte offset calculated from the final split point equals the input file's WAVE data size, then it is ignored. Since split points specify locations within the input file where tracks will be split, N split points will create N+1 output files. All m:ss formats will create splits on sector boundaries whenever the input file is CD-quality; to force non-sector-aligned splits, use the exact byte format.

cat mode options

-c: Specifies that extra RIFF chunks should be suppressed from the output. The default is to write the extra RIFF chunks.
-d: Specifies that the WAVE data should be suppressed from the output. The default is to write the data.
-e: Specifies that the WAVE header should be suppressed from the output. The default is to write the header.
-n: Specifies that the NULL pad byte at end of odd-sized data chunks should be suppressed from the output, if present. The default is to write the NULL pad byte. This option only applies when WAVE data is also written, otherwise it is ignored.

cmp mode options

-c secs: Sets the number of seconds of audio to use for the byte-shift comparison buffer. This option only makes sense with the -s option. The default is 3 seconds.
-f fuzz: Sets the "fuzz factor" for determining whether byte-shifted data is identical. fuzz is a positive integer that represents the maximum number of allowable byte mismatches between the two files in the area searched by the -s option. This allows one to check for differing bytes between to files that (a) are byte-shifted and (b) contain at least one error in the area searched by the -s option. The higher the fuzz factor, the longer the search takes, so set it low to begin with (8 or so), and increase it in small steps if needed. NOTE: this switch can only be used with the -s switch.
-l: List offsets and values of all differing bytes. Output is similar to 'cmp -l'; in particular, offsets are 1-based. Can be used with the -s switch.
-s: Check to see whether the WAVE data contained in the input files are identical modulo a byte-shift. Currently, this will only detect differences up to the first 529200 bytes (equal to 3 seconds of CD-quality data). This can be used to compare WAVE data within a pre-burned file to WAVE data in the corresponding track ripped from the burned CD, which is useful if the ripped track came from a CD burned TAO, and thus might have a 2-second gap of silence at the beginning. This option can also help identify a CD burner/CD reader combined read/write offset.

cue mode options

-c: Specifies that a simple CUE sheet should be output. This is the default action. NOTE: all input files must be CD-quality for CUE sheets to be valid.
-s: Specifies that split points in explicit byte-offset format should be output.

conv mode options

NOTE: file names for files created in conv mode will be named based on the input file name. Specifically, if the input file name ends with the default file extension for that file's format, then the default extension for the desired output format will replace it; otherwise, it will be appended to it. For example, for an output format of shn and a wav input file named 'file.wav', the converted file will be named 'file.shn', since '.wav' is the default extension for the wav format. On the other hand, given the same situation above, but with an input file named 'file.wave', the converted file will be named 'file.wave.shn', since '.wave' does not match '.wav'. This can be overridden with the -a and/or -z global options described above.

Be aware that some output format encoders (e.g. flac, ape) automatically strip headers and/or extra RIFF chunks, while others (e.g. sox) might adjust WAVE data sizes in rare instances in order to align the audio on a block boundary.

-t: Read WAVE data from the terminal.

info mode options

This mode doesn't support any additional options.

strip mode options

NOTE: file names for files created in strip mode will be based on the input file name with the string '-stripped' appended to it, and the extension will be the default extension of the output file format. For example, with an output file format of wav the file 'bar.shn' would become 'bar-stripped.wav'. This can be overridden with the -a and/or -z global options described above.

-c: Specifies that extra RIFF chunks should not be stripped. The default is to remove everything that appears after the first data chunk.
-e: Specifies that WAVE headers should not be made canonical. The default is to canonicalize headers.

gen mode options

NOTE: file names for files created in gen mode will be prefixed with 'silence.', and the extension will be the default extension of the output file format. For example, with an output file format of wav the generated file would become 'silence.wav'. This can be overridden with the -a and/or -z global options described above.

-l len: Generate files containing len amount of silence. len must be given in bytes, m:ss, m:ss.ff or m:ss.nnn format.

trim mode options

NOTE: file names for files created in trim mode will be based on the input file name with the string '-trimmed' appended to it, and the extension will be the default extension of the output file format. For example, with an output file format of shn the file 'foo.wav' would become 'foo-trimmed.shn'. This can be overridden with the -a and/or -z global options described above.

-b: Only trim silence from the beginning of files
-e: Only trim silence from the end of files

ENVIRONMENT VARIABLES

ST_DEBUG

If set, shntool will print debugging information. This is analogous to the -D global option, with the exception that debugging is enabled immediately, instead of when the command-line is parsed.

ST_<FORMAT>_DEC

Specify input file format decoder and/or arguments. Replace <FORMAT> with the format you wish to modify, e.g. ST_SHN_DEC. The format of this variable is analagous to the -i global option, except that the initial format is not included. Examples:

: ST_SHN_DEC='shorten-2.3b'
ST_SHN_DEC='shorten -x -d 2048 %f -'

ST_<FORMAT>_ENC

Specify output file format extension, encoder and/or arguments. Replace <FORMAT> with the format you wish to modify, e.g. ST_SHN_ENC. The format of this variable is analagous to the -o global option, except that the initial format is not included. Examples:

ST_SHN_ENC='shorten -v2 - %f'

ST_FLAC_ENC='flake - %f'

ST_AIFF_ENC='ext=aif'

ST_CUST_ENC='ext=mp3 lame --quiet - %f'

Note that command-line options take precedence over any of these environment variables.

EXIT STATUS

Generally speaking, shntool will exit with status 0 upon success, and status 1 if it encounters an error. The only exception is when the 'quit' option is selected from within the interactive file reordering menu, in which case the exist status will be 255.

NOTES

shntool is a misnomer, since it processes WAVE data, not shorten data. The name is a holdover from its early days as 'shnlen', a program created specifically to extract information about WAVE data stored within .shn files.

Aliases for shntool are prefixed with 'shn' instead of 'wav' to avoid possible collisions with existing programs.

AUTHOR

Jason Jordan <shnutils at freeshell dot org>

Please send all bug reports to the above address.

The latest version of shntool can always be found at <http://www.etree.org/shnutils/> or <http://shnutils.freeshell.org/>.

COPYRIGHT

This is free software. You may redistribute copies of it under the terms of the GNU General Public License <http://www.gnu.org/licenses/gpl.html>. There is NO WARRANTY, to the extent permitted by law.

REVISION

$Id: shntool.1,v 1.140 2009/03/30 05:59:25 jason Exp $