apertium-deshtml(1)
This application is part of (
SYNOPSIS
apertium-deshtml
[ -h ] [ -i ] [ -n ]
[ <input file> [ <output file> ] ]
DESCRIPTION
apertium-deshtml
is an HTML format processor. Data should be passed through this
processor before being piped to lt-proc. The program takes input
in the form of an HTML document and produces output suitable for
processing with lt-proc. HTML tags and other format information are enclosed in brackets so that lt-proc treats them as whitespace between words.
OPTIONS
- -h, --help
-
Display this help.
-i
Makes the addition of trailing sentence terminator (".") unconditional, often
leading to duplicates.
-n
Suppresses the addition of a trailing sentence terminator.
EXAMPLE
- You could write the following to show how the word "gener" is analysed:
-
- echo "<b>gener</b>" | apertium-deshtml | lt-proc ca-es.automorf.bin
-
BUGS
Lots of...lurking in the dark and waiting for you!
AUTHOR
Copyright (c) 2005, 2006 Universitat d'Alacant / Universidad de Alicante.
This is free software. You may redistribute copies of it under the terms
of the GNU General Public License <
http://www.gnu.org/licenses/gpl.html>.