vcheck(1) latest program version checker and auto-downloader

SYNOPSIS


vcheck [options]

vcheck is a tool for checking for latest versions of programs at HTTP and FTP locations given a list of URLs and (Perl-style) regular expressions to match, and to optionally download them automatically.

For a complete list of command line options, run

  $ vcheck --help

DESCRIPTION

vcheck's behavior can be influenced by both command line options and a configuration file, which at the same time serves as its data file, holding records of programs to check for. This config file is, by default (see ``FILES''), ~/.vcheck. It is structured according to a syntax which is printed in detail when run as

  $ vcheck --grammar

Details about both the grammar in general and the meaning of involved keyword can be found in ``GRAMMAR''. An example of what a config file looks like in principle can be found in ``EXAMPLES''.

The basic purpose of vcheck is to check for new versions of programs listed in its config file. The script is able to cope with all kinds of common version numbers, including words like ``pre'' or ``alpha'', etc. When a new version was found, the config file is updated accordingly.

Furthermore, vcheck can be used to download files automatically if a new version is/was found, and even delete obsolete versions found locally automatically. A special field in each program's record in the config file tells it which version has last been downloaded. Where necessary or desired, the download can be disabled for specific programs, or disabled in general and allowed in special cases. Besides, you can specify preferences (both in general and on a per-program basis) as to what kind of files to download if new versions are available, say, in different formats.

vcheck also has features to limit the scope of programs to check or download to a subset, such as defined by:

  • a regular expression names have to match
  • a minimal urgency (which can be defined on a per-program basis, as levels of high, medium, and low)
  • those programs which haven't yet been downloaded since a new latest version was found
  • those programs which previous queries failed for (optionally, a certain minimum number of times)
  • a conjunctive combination of several of these conditions

In case you're behind a firewall, an HTTP(-based) proxy can be defined in a number of ways (precendence in this order):

  • a specific HTTP or FTP proxy, respectively, defined in the config file
  • a common HTTP+FTP proxy defined in the config file
  • a specific HTTP or FTP proxy, defined via the environment variables $http_proxy/$HTTP_PROXY and $ftp_proxy/$FTP_PROXY, respectively (each in this order of precendence)

vcheck uses ANSI escape sequences to visually enhance its output. Success messages are usually printed in green, error messages are yellow or red (signalling severity). This feature can be disabled temporarily by using the corresponding command line switch, or permanently, via a setting in the configuration file (see ``CONFIGURATION SECION'').

GRAMMAR

When run as

  vcheck --grammar

vcheck will print its config file's grammar, i.e., the formal structure of the entries therein. The individual fields' names are printed along with short descriptions; details on their meaning and usage can be found below in this section.

Per default (i.e., if the script's name has not been changed (see ``FILES'') and if not overridden via "--file"), vcheck reads its configuration from ~/.vcheck. This file will also be rewritten regularly whenever version information etc. about a program is updated. In the course of such rewrites, entries will be sorted in a definable fashion, and a hard-coded order of keywords and indentation scheme will be applied.

Basically, the config file may contain two types of records: a configuration section and any number of program sections. A record (or section---these terms are used synonymously in this documentation) consists of a keyword marking its beginning and a name (this only goes for program sections), followed by an equal sign (`=') and a pair of curly braces (``{}''), between which the section's data is put.

Section data is a sequence of settings, or fields, of a number of types, some of which are obligatory while others are optional, separated by white space (typically, line feeds, to keep things readable). There are the following types of fields:

Boolean
Keywords of this type set a property based on their mere presence. An example of this is the config section field dldefaultno:

  config = {
    dldefaultno
  }
string
String fields consist of a keyword followed by an equal sign (`=') and a string representing the field's value. If the string value contains white space or (double) quotation marks, it needs to be surrounded by (double) quotation marks (`"'). In this case, both quotation marks inside the string and backslashes need to be escaped by backslashes (`\'). Note that string values may not span several lines but have to be contained on a single one, and there may be validation rules as to what the value may be like. Besides, string fields are typically required to be of non-zero length.

An example of this type of field is the prog section field comment:

  prog foo = {
    [...]
    comment = Hello!
    comment = "Comment with white space and \"quotes\"!"
    [...]
  }
string enumeration
String enumerations are basically string fields with but a limited set of allowed values. An example of this is the prog section field dl, whose value must be either ``yes'' or ``no'', if present:

  prog foo = {
    [...]
    dl = yes
    [...]
  }

CONFIGURATION SECTION

The configuration section is optional and, if present, contains settings globally affecting vcheck's default behavior. The configuration section is unique per file (although multiple occurrences with non-conflicting settings are allowed, but these will be joined into a single section once the file is rewritten).

The keyword introducing a configuration section is config. Thus, a config section's principal layout looks like this:

  config = {
    [...]
  }

The keywords allowed inside (``[...]'') the config section are explained in detail below (listed in alphabetical order):

defaulturgency (enumeration: high, medium, low)
Specifies the checking urgency level to assume, unless specified otherwise in a program's record via prog.urgency. Urgencies allow for a crude selection of programs to check for via the "--urgency" command line parameter. In absence of this option, the default urgency is medium.
deleteold (Boolean)
If included in the config section, causes the script to automatically look for and delete versions of a program obsoleted by a new download. May be overridden by prog.deleteold. See the latter for details.

Special note: Use at your own risk!

dldefaultno (Boolean)
By default, don't download. This causes the script to download only those programs whose dl option is explicitly set to yes when run with the "--download" parameter.
dldir (string: absolute directory path)
This option specifies an absolute path (i.e., relative to the root directory) of a directory where to put downloaded files. If the download directory isn't set via this or even more explicitly via a prog.dldir option, downloads will end up in that directory in which the script is executing.
dlexec (string)
Specifies a command to be executed after any successful download (unless overridden for a particular program via prog.dlexec). A successful download in this context is one whose file type has been recognized and whose integrity could be confirmed. In unizoid environments, the command is executed under whatever shell the environment variable $SHELL defines.

The command string is subject to expansion of the following placeholders (see ``PLACEHOLDERS'' for their meaning): "__DLURL__", "__FILE__", "__NEWVER__", "__PROG__", "__RAWVER__", "__URL__". Additionally, `~/' will be replaced by the user's home directory.

config.dlexec may prove useful to, e.g., automatically convert, say, gzipped to bzipped files using a helper script, or to log downloads (see ``HINTS'').

dlprefs (string)
A semicolon- (`;'-) separated list of Perl-style regular expressions defining download preferences. Each of the regular expressions is supposed to match a particular file type that's possible or likely to be encountered. The order in which the expressions occur defines their precedence (the first matching expression will determine which of a set of available file types of a given program version will be selected for download). This value is the default in effect unless specific preferences are defined on a per-program basis using prog.dlprefs. If neither config.dlprefs nor prog.dlprefs is set, the file to be downloaded is chosen pseudo-randomly, if multiple pattern matches occur.

For these download preferences to make any sense, file- and version-matching expressions need to be sufficiently non-restrictive to match several possible extensions. For example, "foo-("__VER__")\\.t`` will match both ''.tar.gz`` and ''.tar.bz2" files, and setting dlprefs to ``\\.tar\\.bz2$;\\.tar\\.gz$'' will cause the script to preferrably download ``.tar.bz2'' files.

dlretry (string: non-negative integer number)
The number of times to retry downloading after a failed download. If this option isn't specified, the number of retries defaults to 0. A retry is considered to have failed if either the connection failed, the retrieved document was empty, or the file type has been recognized and its integrity verified.
eagerquote (Boolean)
If this option is set, all string parameters of configuration file options will be surrounded by double quotes. The default is to use quotes only where necessary (e.g., for string parameters containing white space).
echoexec (Boolean)
If this option is set, commands executed thanks to newverexec or dlexec options will be echoed prior to execution.
ftpproxy (string: HTTP URL or "server:port")
This option specifies a proxy to use for retrieving documents from FTP locations. It specifies either the complete URL or the server and port (as ``server:port'') of the proxy, and the proxy has to be a HTTP-based FTP proxy. This option takes precedence over config.proxy, if specified. If neither config.ftpproxy nor config.proxy is set, the script uses the value the environment variables $ftp_proxy or $FTP_PROXY (in this order of precedence) are set to, or no FTP proxy at all.
httpproxy (string: HTTP URL or "server:port")
This option specifies a proxy to use for retrieving documents from HTTP locations. It specifies either the complete URL or the server and port (as ``server:port'') of the proxy. This option takes precedence over config.proxy, if specified. If neither config.httpproxy nor config.proxy is set, the script uses the value the environment variables $http_proxy or $HTTP_PROXY (in this order of precedence) are set to, or no HTTP proxy at all.
lastcheck (string: date formatted as "YYYY-MM-DD HH:MM")
The date and time the script was last run updating the configuration file. This value is generated and updated automatically.
newverexec (string)
A command to be executed whenever a new version of a program is found, unless overridden on a per-program basis via prog.newverexec. The command is executed under whatever shell the environment variable $SHELL defines.

The command string is subject to expansion of the following placeholders (see ``PLACEHOLDERS'' for their meaning): "__NEWVER__", "__PROG__", "__RAWVER__", "__URL__". Additionally, `~/' will be replaced by the user's home directory.

nocache (Boolean)
This option conserves some memory by not caching retrieved documents (those fetched from prog.url locations). By default, the script caches retrieved document so that program records referring to the same web page won't result in (unnecessary) multiple retrievals during the same session.
plain (Boolean)
This option causes the script to generate plain (as opposed to ANSI-enhanced) output by default. The option may be overridden by specifying "--noplain" on the command line.
proxy (string: HTTP URL or "server:port")
This option specifies a proxy to use for retrieving documents from both HTTP and FTP locations. It specifies either the complete URL or the server and port (as ``server:port'') of the proxy. The proxy set via this option may be overridden via config.ftpproxy and/or config.httpproxy.
sortby (enumeration: name, url)
This option specifies whether to sort prog entries by program name (prog section identifier) or URL when rewriting the configuration file. The default is to sort by name.
xfersum (Boolean)
Corresponds to the command line option "--xfersum". If set, the script will print a total of the amount of data that has been received at exit. Can be overridden via the command line switch "--noxfersum".
timeout (string: non-negative integer number)
The time (in seconds) after which attempted remote retrievals should be aborted. The default is 90 seconds.
verbose (Boolean)
If this option is set, the script will also print version numbers that haven't been obsoleted. The default is to print only new versions (and error messages). This setting can be overridden via the command line switch "--noverbose".

PROGRAM SECTIONS

Program sections each define for a single program (package, ...) an HTTP or FTP URL based on which the latest version of that program available can be determined by vcheck using an additionally-defined regular expression. There can (hypothetically) be any number of program sections in a config file.

The keyword introducing a program section is prog. Each prog section is identified by a unique identifier (there may not be multiple prog sections with the same identifier). Thus, a config section's principal layout looks like this:

  prog Foo = {
    [...]
  }

The keywords allowed inside (``[...]'') a prog section are explained in detail below (listing in alphabetical order). All fields are optional and allowed but once per prog section, unless explicitly stated otherwise.

comment (string; multiple allowed)
An arbitrary comment string. If multiple such entries exist for a single program record, their relative order will be maintained when rewriting the configuration file.
deleteold (enumeration: yes, no)
This option defines whether the script should look for and delete any obsolete versions of a program located in its download directory after each 'Usuccessful' download of a new version of that program. A successful download in this context is any download of a file of a known type whose integrity could be verified. Overrides config.deleteold; the deletion of obsolete versions is disabled by default and only activated by config.deleteold or prog.deleteold.

Any occurrence of prog.dlexplicit or prog.dlintermediate in a program's record inhibits application of deleteold for that program.

Special note: Use at your own risk!

disabled (Boolean)
This option causes the program record in question to be ignored (except when the command line switch "--force" is used).
dl (enumeration: yes, no)
This option specifies whether to download the program in question when the script is run with the "--download" option. By default, a program will be downloaded when a new version is found and the script is run with said parameter, unless config.dldefaultno is set. prog.dl overrides the latter option.
dldir (string)
This option specifies a download directory on a per-program basis. If the directory is absolute (i.e., relative to the root directory, as indicated by a leading slash), it will be treated as an absolute path, otherwise it will be considered relative to either config.dldir, if specified, or the directory the script is executing in.

The dldir string is subject to expansion of the following placeholders (see ``PLACEHOLDERS'' for their meaning): "__NEWVER__".

dldirlast (string)
This option specifies the download directory of the last downloaded version of a program. It does not contains ``PLACEHOLDERS'' unlike "prog.dldir". If the directory is absolute (i.e., relative to the root directory, as indicated by a leading slash), it will be treated as an absolute path, otherwise it will be considered relative to either config.dldir, if specified, or the directory the script is executing in.

This option is only used to store ``PLACEHOLDER'' free dldir, it is overwriten at each new download.

dlexec (string, may be zero-length)
Specifies a command to be executed after any successful download of the program, overriding config.dlexec (if set). A successful download in this context is one whose file type has been recognized and whose integrity could be confirmed. The command is executed under whatever shell the environment variable $SHELL is set to.

The command string is subject to expansion of the following placeholders (see ``PLACEHOLDERS'' for their meaning): "__DLURL__", "__FILE__", "__NEWVER__", "__PROG__", "__RAWVER__", "__URL__". Additionally, `~/' will be replaced by the user's home directory.

dlexplicit (string: HTTP or FTP URL; multiple allowed)
Specifies an explicit download URL. Whenever a new version of the program in question is found, the URL specified via this option will be downloaded (if requested) instead of the one deduced from prog.url and prog.regex.

The command string is subject to expansion of the following placeholders (see ``PLACEHOLDERS'' for their meaning): "__NEWVER__", "__RAWVER__".

This option can also be used to, e.g., download multiple packages on detection of a new version, provided that their names can be specified. For an example of this, see ``EXAMPLES''.

dlintermediate (Boolean)
If this option is set, intermediate versions (i.e., version referenced at url newer than dlversion but older than the most recent version available) will be downloaded as well if any are encountered when a new version of the program is found. This option is useful for downloading patches and suchlike, which depend on each other consecutively. The default is to ignore intermediate versions.
dlprefs (string)
A semicolon- (`;'-) separated list of Perl-style regular expressions defining download preferences. Each of the regular expressions is supposed to match a particular file type that's possible or likely to be encountered. The order in which the expressions occur defines their precedence (the first matching expression will determine which of a set of available file types of a given program version will be selected for download). This value overrides default preferences possibly defined via config.dlprefs. If neither config.dlprefs nor prog.dlprefs is set, the file to be downloaded is chosen pseudo-randomly, if multiple pattern matches occur.

For these download preferences to make any sense, file- and version-matching expressions need to be sufficiently non-restrictive to match several possible extensions. For example, "foo-("__VER__")\\.t`` will match both ''.tar.gz`` and ''.tar.bz2`` files, and setting `dlprefs' to ''\\.tar\\.bz2$;\\.tar\\.gz$`` will cause the script to preferrably download ''.tar.bz2" files.

dlreferrer (string, may be zero-length)
Specifies an HTTP referrer to use when downloading a program package. By default, the version-determining document (i.e., the last url value, with placeholders expanded) is used.
dlversion (string)
This parameter stores the last downloaded version of the program in question and is updated whenever a new version is found (except when running in read-only mode). If prog.transform is set, the stored version will have been transformed from the one matched by prog.regex.
errors (string: non-negative integer number)
This field stores the number of errors during version checks and is reset once a check succeeds. A high value of this field is indicative of an outdated URL or file name matching regular expression and will be remarked upon by the script. Additionally, it is possible to limit the scope of an operation to erroneous records via the "--errors" command line parameter.
lastcheck (string: date in format "YYYY-MM-DD HH:MM")
This field stores the date and time the program in question was last checked (no matter whether successfully or unsuccessfully).
newverexec (string, may be zero-length)
A command to be executed whenever a new version of a program is found, overriding a possible definition via config.newverexec. The command is executed under whatever shell the environment variable $SHELL defines.

The command string is subject to expansion of the following placeholders (see ``PLACEHOLDERS'' for their meaning): "__NEWVER__", "__PROG__", "__RAWVER__", "__URL__". Additionally, `~/' will be replaced by the user's home directory.

regex (string; required; multiple allowed)
This required field is supposed to contain a Perl-style regular expression matching desired versions of the program in question given the document at prog.url as input. Note that the regexp needn't match the complete file name---when considering a download, the script will auto-expand the match as seen fit.

Regular expressions for matching programs' version numbers have to be written in such a way that the $1 part (see the ``perlre'' man page), if the entire expression matches, is exactly the version number. The option is subject to placeholder expansion: "__VER__" will be replaced by a pre-manufactured (non-greedy) regular expression matching version numbers compliant with any of a number of common schemes. Note that in order to yield a $1 match as required, "__VER__" still needs to be put in parentheses. For examples of prog.regex values, see ``EXAMPLES''.

In order to cope with particularly complex remote scenarios (such as version-dependent directory hierarchies), multiple url, regex, and transform fields may be specified per program. In this case, the script will match urls and regexes starting with the first and continuously proceeding to the next field of each type (in sync, as long as both of them are available, or using the last one available otherwise) and match the regexp against the corresponding document. In order for this to be of any use, the second (and each potential later) url will have to contain a "__NEWVER__" or "__RAWVER__" placeholder (see ``PLACEHOLDERS'') which will be replaced by the previously matched latest [transformed] version (the same substitution is done for regex). The version that will finally be considered the latest for the program in question will be the one determined by matching the last regex against the last urls document. For an example of how this can be used in practice, see ``EXAMPLES''.

Possible multiple transform fields will be processed in sync with the respective url and regex fields as long as additional transform fields are specified. If there are more url and/or regex fields than transform fields, the last-specified transform expression will be used for further iterations. If, on the other hand, there are more transform than url/regex fields, further retrievals/matches will be done based on the last url/regex. The author has, however, no idea how this could be of any use.

When the config file is rewritten, multiple url, regex, and/or transform fields will be interleaved to facilitate comprehension and retain their relative order.

tranform (string; multiple allowed)
A Perl expression transforming a version number in $_ (obtained by a prog.regex match) in some way the user deems adequate. For examples of how this might come in handy, see ``EXAMPLES''. The return value of the code fragment, i.e., the value of its last expression, is used as the transformed version and will henceforth be the basis for version comparison for the program in question.
urgency (enumeration: high, medium, low)
Defines the urgency with which to check for the specified program. Urgencies allow for a crude selection of programs to check for via the "--urgency" command line parameter. If there is no urgency defined, it defaults to either config.defaulturgency (if set) or medium.
url (string: HTTP or FTP URL; required; multiple allowed)
This required field defines the HTTP or FTP URL to retrieve as the document to scan for in order to detect the availability of new program versions by matching against prog.regex. Note that if the URL is a directory (especially, an FTP directory which is supposed to be listed), the URL needs to end in a slash (`/'). If the target document is an HTML page, its source code will be matched against prog.regex, aiming at links embedded in the document. An alternate download URL can be specified via prog.dlexplicit.

In order to cope with particularly complex remote scenarios (such as version-dependent directory hierarchies), multiple url, regex, and transform fields may be specified per program. In this case, the script will match urls and regexes starting with the first and continuously proceeding to the next field of each type (in sync, as long as both of them are available, or using the last one available otherwise) and match the regexp against the corresponding document. In order for this to be of any use, the second (and each potential later) url will have to contain a "__NEWVER__" or "__RAWVER__" placeholder (see ``PLACEHOLDERS'') which will be replaced by the previously matched latest [transformed] version (the same substitution is done for regex). The version that will finally be considered the latest for the program in question will be the one determined by matching the last regex against the last urls document. For an example of how this can be used in practice, see ``EXAMPLES''.

Possible multiple transform fields will be processed in sync with the respective url and regex fields as long as additional transform fields are specified. If there are more url and/or regex fields than transform fields, the last-specified transform expression will be used for further iterations. If, on the other hand, there are more transform than url/regex fields, further retrievals/matches will be done based on the last url/regex. The author has, however, no idea how this could be of any use.

When the config file is rewritten, multiple url, regex, and/or transform fields will be interleaved and retain their relative order.

version (string)
Stores the latest known version of the program. In contrast to prog.dlversion, this is the latest version detected, not the latest version downloaded. If a prog.transform option is set, the stored version will have been transformed from the one matched by prog.regex.

PLACEHOLDERS

In a number of string fields, certain placeholders are subject to substitution by run-time values. These placeholders are (in alphabetical order):
"__DLURL__"
The (file) URL from which the latest version of the respective program was downloaded.
"__FILE__"
The local path to the respective latest-version download.
"__NEWVER__"; "__NEWVER1__", "__NEWVER2__", ...
"__NEWVER__" is replaced by the latest transformed (or untransformed, if no transform expression is in effect) version available as determined by the script.

When using multiple url/regex/transform fields in order to cope with more complex remote site hierarchies, "__NEWVER1__", "__NEWVER2__", ... give access to intermediately-determined versions. In this case, "__NEWVER1__" is replaced by the version matched by the first url/regex/transform tuple, "__NEWVER2__" matches the version matched by the second url/regex/transform tuple, and so on.

"__PROG__"
The name (identifier) of the respective prog section.
"__RAWVER__"; "__RAWVER1__", "__RAWVER2__", ...
"__RAWVER__" is replaced by the latest version available as determined by the script.

When using multiple url/regex fields in order to cope with more complex remote site hierarchies, "__RAWVER1__", "__RAWVER2__", ... give access to intermediately-determined versions. In this case, "__RAWVER1__" is replaced by the version matched by the first url/regex pair, "__RAWVER2__" matches the version matched by the second url/regex pair, and so on.

"__URL__"
The (last and expanded) URL used in order to determine the latest program version.
"__VER__"
A pre-manufactured (non-greedy) regular expression matching version numbers compliant with any of a number of common schemes.

HINTS

  • If you use Vim (version 5 or higher) as your editor, you can tell vcheck to create a Vim syntax file providing syntax highlighting within the editor by running the script as

      vcheck --create-vim-syntax-file
    

    If you wish to have Vim apply the syntax rules automatically when editing ``~/.vcheck'', add this line:

      au BufEnter */.vcheck   so $VIM/syntax/vcheck.vim
    

    or, alternatively, one with an explicit path:

      au BufEnter */.vcheck   so /path/to/syntax/vcheck.vim
    

    to your ``~/.vimrc'' and substitute an appropriate path. Of course you need to as well be sure to copy the file into the designated directory.

  • It's no problem to just check for new versions by default and run vcheck again afterwards to download updated packages. Running the script as

      vcheck -dc
    

    or

      vcheck --download --catch-up
    

    respectively, will try to download only those files whose latest downloaded version has been knowingly obsoleted, without checking again for new versions of all other programs.

  • To check only those program locations that failed during the latest attempt(s), run

      vcheck -e
    

    or

      vcheck --errors
    

    respectively.

  • If you add a line

      dlretry = NUMBER
    

    to your config file's config section, vcheck will retry to download a file up to NUMBER times if it detects that it was received incompletely. This will be the case if:

  • the file has zero size
  • the downloaded file's extension was recognized, and a check by the respective decompressor etc. resulted in errors
  • vcheck caches data retrieved from URLs (unless nocache is set in the config file), so if you specify exactly the same URL for different programs, this won't result in multiple retrievals, thus improving efficiency.
  • If you're curious to know how many program records have actually been accumulated in your config file over time, run vcheck as

      vcheck --syntax
    

    This will check the config file's syntax and, as a side-effect, print the number of programs registered.

  • Even if you know from some other source that there is a new version of a program vcheck is configured for, you can still use that to download the package. Just use its matching capabilities, e.g.:

      vcheck -dm foo
    
  • If one of your records points to patches of some program, and you want to make sure you won't miss an intermediate one when downloading (and suppose you don't run vcheck in download mode too frequently), you can add the boolean field dlintermediate to the respective program's section in the config file, and vcheck will try to download all versions newer than dlversion. Note that in those circumstances, dlversion is set to the latest (intermediate) version the download attempt succeeded for (which means that, if, say, versions 1 through 3 are to be downloaded and all downloads except that of version 1 succeed, dlversion will nevertheless be set to 3). A useful example for this:

      prog Linux/patches = {
        dlintermediate
        dlprefs   = \.bz2$;\.gz$
        dlversion = 2.3.6
        regex     = patch-(__VER__)\.[bg]z
        url       = ftp://ftp.kernel.org/pub/linux/kernel/v2.3/
        version   = 2.3.9
      }
    

    Supposing that 2.3.9 still is the latest version, running this in download mode will retrieve Linux kernel patches 2.3.7 through 2.3.9, *.bz2 preferred to *.gz (but accepting the latter if the former is missing, rather than skipping the download entirely).

  • There may be complex remote site structures, involving version-dependent directory hierarchies, such as the layout used by the server for the AC series of Linux kernel patches. The principal layout of that site looks (or used to look, anyway) like this:

      ...
      .../linux-2.4/2.4.8/patch-2.4.8-ac1.gz
      .../linux-2.4/2.4.8/patch-2.4.8-ac2.gz
      ...
      .../linux-2.4/2.4.9/patch-2.4.9-ac1.gz
      .../linux-2.4/2.4.9/patch-2.4.9-ac2.gz
      ...
    

    The problem here is that the bottom-level directory's name varies depending on the regular Linux version an AC patch is based on. The way to deal with this most conveniently in vcheck looks like this:

      prog Linux/patch/AC = {
        dlintermediate
        url       = http://www.kernel.org/.../linux-2.4/
        regex     = (\d+\.\d+\.\d+)
        url       = http://www.kernel.org/.../linux-2.4/__NEWVER__/
        regex     = patch-(__VER__-ac\d+)\.gz
      }
    

    (Note that the URLs have been abbreviated for the sake of readability.) This kind of configuration will cause vcheck to start by retrieving the first url field's document and match the first regex against it. It will then proceed with the second url field's document, matching it against the second regex, replacing its __NEWVER__ placeholder by the latest version previously matched. The version finally determined as the current version for the program record is the one determined by the last match.

    On a side note, version numbers determined during matches further back than the previous one can be accessed via delimiters of the format __NEWVER#__, where `#' is a number indicating the number (1..) of the url/regex pair's version match it should be replaced by. For more details on the mechanism, see the descriptions of url and regex in ``PROGRAM SECTIONS'', and ``PLACEHOLDERS''.

    Regarding the example, it is left to the user to figure out how to extend the record to even automatically cope with changes to the Linux kernel's minor version. ":-)"

    Here's another example of a three-level hierarchy, which used to fit the GIMP's site layout at one point in time:

      prog GIMP/devel/patch = {
        comment   = "Will download complete package if no patch available."
        dlprefs   = patch-.*?\.bz2$;patch-.*?\.gz$;gimp-.*?\.bz2$;gimp-.*?\.gz$
        url       = ftp://ftp.gimp.org/pub/gimp/
        regex     = (?<!\w)v(__VER__)
        url       = ftp://ftp.gimp.org/pub/gimp/v__NEWVER__/
        regex     = (?<!\w)v(__VER__)
        url       = ftp://ftp.gimp.org/pub/gimp/v__NEWVER1__/v__NEWVER__/
        regex     = (?:patch|gimp)-(__VER__)\.[bgt]
      }
    
  • If you want to retrieve some program whose version is, say, a date in format ``dd-mm-yyyy'', this will be misinterpreted by the version comparator because the most significant sub-``version'' isn't the initial one. You can work around this by specifying some Perl expression transforming the original version in the respective program's section, such as:

      transform = "s/(\d+)-(\d+)-(\d+)/$3-$2-$1/; $_"
    

    This piece of code is given the respective version in $_, and after its evaluation, vcheck replaces the original value by what the eval() returns. Alternatively, this would achieve the same:

      transform = "join '-', reverse split /-/, $_"
    
  • Some sites use redirection scripts for download URLs. Consider a situation where a downloads page lists available packages of a program, with links pointing to some server-side script referring your browser to some URL which is in turn redirected by HTTP means to a final file URL (the PHP site, for example, used to make use of this obscure scheme at one time). The way to cope with this in vcheck consists in using a dlexplicit field like this:

      prog PHP = {
        dlexplicit = http://www.php.net/distributions/php-__VER__.tar.gz
        regex      = php-(__VER__)\.t
        url        = http://www.php.net/downloads.php
      }
    

    Effectively, this will use the actual url field only to determine the current version and then paste it into a pattern of the corresponding download URL, thus bypassing the redirections. The obvious disadvantage of this feature consists in its increased dependency on server-side access structures.

  • Suppose you're interested in some program distributed via more than one package (such as Vim, which is split into a source and a run-time package). The means vcheck provides to cope with this once again is the dlexplicit option:

      prog Vim = {
        dlexplicit = ftp://ftp.vim.org/pub/editors/vim/unix/vim-__VER__-src.tar.gz
        dlexplicit = ftp://ftp.vim.org/pub/editors/vim/unix/vim-__VER__-rt.tar.gz
        regex      = vim-(__VER__)(-src)?\.tar
        url        = ftp://ftp.vim.org/pub/editors/vim/unix/
      }
    
  • In order to have vcheck keep track of what has been downloaded (and when), you might add something like this to your config file:

      config = {
        dlexec = "echo `date +%Y-%m-%d` '__PROG__' '__NEWVER__' >>~/.vchecklog"
      }
    

    Note however that program-specific dlexec will take precedence over this setting.

  • With a little creativity, vcheck can be used to check not only for latest versions of programs or packages, but also web site updates and the like. Also, the newverexec (see ``GRAMMAR'') field can be used to pass a link to an external download tool if for some reason vcheck's abilities prove insufficient for a particular scenario.

EXAMPLES

Please make sure to read what's printed by vcheck when run as

  vcheck --help --grammar

as well as ``GRAMMAR'' before reading this section, to learn about command line parameters and the configuration file's grammar. Done so? Then read on...

Suppose there's a config file ~/.vcheck with the following contents:

  config = {
    dlprefs   = \.tar\.bz2$;\.(tar\.|t)gz$;\.zip$
    lastcheck = "1999-06-21 08:15"
  }
  prog Foo = {
    dl      = no
    errors  = 2
    regex   = foo-(__VER__)\.tar
    urgency = high
    url     = http://www.foo.org/pub/foo/
  }
  prog Bar = {
    dlversion = 0.01beta
    regex     = (?i:bar-(__VER__)\.tar)
    url       = http://www.bar.org/bar/index.html
    version   = 0.01
  }
  prog Baz = {
    regex   = baz-(\d+)\.tar
    urgency = low
    url     = ftp://ftp.baz.net/pub/source/
    version = 123
  }

First of all, you can deduce from this what date and time vcheck was last run at with this config file. Trying to check for Foo resulted in errors of some kind during the last 2 attempts, and since there's no version field, it has presumably never been queried successfully. Foo is never to be downloaded. Bar's latest version as determined during one of the last checks was 0.01, but it wasn't downloaded (0.01beta is the version of the last download). Finally, Baz has never been downloaded (according to the config file, anyway). As for downloads in general, *.tar.bz2 is preferred to *.tar.gz and *.tgz, which in turn are more desirable than *.zip files. If no target matching any of these extensions case-insensitively is found, nothing will be downloaded.

Assume furthermore that the following references are currently mentioned at the respective URLs of each program:

Now let's discuss what some specific calls to vcheck, each based on the above configuration, will result in. Again, for a complete list of command line options (all short options have an equivalent long one), see "`vcheck --help`".

- "$ vcheck -n"
This will check for all programs without updating the config file. It'll report Foo 3.14 as new version (not 4.0, as regex doesn't match this), as well as Baz 124.
- "$ vcheck -d"
This will check for all programs, report as above and try to download the following file:

  http://www.bar.org/bar/BAR-0.01.tar.bz2

Note that Baz 124 isn't among, because there wasn't a link conforming to dlprefs, and downloads of Foo have been disabled explicitly. The errors field of Foo is removed since the check succeeded.

- "$ vcheck -c"
This will set dlversion = version for Bar and Baz, without checking for the availability of new versions. Effectively, this will prevent future calls to vcheck with parameter ""-d"" from downloading these files.
- "$ vcheck -dc"
This will step through all programs that downloads haven't been disabled for in principle and whose dlversion is lower than version (i.e., Bar and Baz in our example). For these, vcheck will requery the respective sites to determine a download URL, and try to download

  http://www.bar.org/bar/BAR-0.01.tar.bz2

as in the above example.

- "$ vcheck -m \!foo"
will check for new versions of Bar and Baz. Note that you may have to quote the leading exclamation mark as well as some characters used in regular expressions specified on the command line, in order to prevent your shell from interpreting them.
- "$ check -u medium -m b"
will check only for Bar, as it is the only program whose urgency is at least medium and whose name contains a `b'.
- "$ vcheck -e"
will check only for Foo, since checking for that failed previously.

NOTES

  • First of all, vcheck isn't perfect, and it won't do in all kinds of situations. Yet I think it is able to cope with most of them, and if there's indeed some site which vcheck isn't able to determine download URLs from, or some version numbering scheme its heuristics choke on, you'll just have to deal with that manually. But for the majority of cases, vcheck should facilitate keeping your setup up-to-date.
  • Don't run multiple instances in non-read-only mode with the same configuration file, or else one will cause the changes made by the others to get lost. vcheck will prevent this situation from arising by employing a lock file, provided that your Perl setup supports it.

    Don't edit the config file while vcheck is running, either, or your changes will be overwritten when the script rewrites the file.

FILES

  • vcheck, the script itself
  • ~/.vcheck, its configuration file

    In fact, vcheck doesn't look for a config file ~/.vcheck, but for one of the same name as the script (with a possible extension stripped off). So if you rename the script to, say, foo.pl and run it, it'll try to open ~/.foo.

  • ~/.vcheck.lock, lock file created when not running in read-only mode

    Actually, the file's name is that of the config file with an extension of .lock added.

ENVIRONMENT VARIABLES

  • $http_proxy/$HTTP_PROXY and $ftp_proxy/$FTP_PROXY, each in this order of precedence, specify the HTTP and/or (HTTP-based) FTP proxy to use, unless overridden. The format is either ``server:port'' or a complete URL.
  • $HOME, the current user's home directory
  • $SHELL, used by Perl in unizoid environments when executing helper applications

TO DO

  • add option to config section allowing for the dlexec entry to be ``inherited'' from the config section rather than be overridden by per-program dlexecs (also define order of execution!)
  • check behavior if an HTTP url's download link references a different target base directory
  • make ``--list'' not rewrite the config file, thus allowing for it to be run in parallel to another instance
  • add an option to re-download the latest version (if local file doesn't exist)?
  • code clean-up: array used for download specifications -> hash
  • determine and describe way of reliably matching directories consisting of but a version number on an FTP server independently of whether the page in question is received by proxy or without one
  • scenario: link description contains version, download link entirely differently named
  • separate "--force" options for overriding disabled, dl?
  • max download size command line parameter?
  • extend Vim syntax file generation to highlight placeholders in string variables' values?
  • follow HTTP redirections
  • evaluate HTTP headers after retrievals
  • resume downloads?
  • XMLize the config file format???
  • make it multi-threaded???

RESTRICTIONS

  • All output is currently printed on STDOUT.
  • Placeholders are used but no way of escaping literal occurrences of those strings is provided.
  • dlprefs uses semicolons as delimiters, but there's no way of escaping them if they are meant as a part of one of the regular expressions.
  • There's presumably little to do in order to get vcheck to run in Microsoft Windows. One issue worth noting is that directories (such as dldir values) are expecedted to use unizoid delimiters (i.e., slashes (`/'))--this should perhaps be revised to be portable.

BUGS

Mail bug reports to the author.

AUTHOR

vcheck is copyright (c) 1999-2001 by Marco Go.tze, <[email protected]>. It is distributed under the terms of the Artistic License, a copy of which is included with the script's distribution. Use at your own risk.