rbldnsd(8) DNS daemon suitable for running DNS-based blocklists

SYNOPSIS

rbldnsd options zone:dataset...

DESCRIPTION

rbldnsd is a small DNS-protocol daemon which is designed to handle queries to DNS-based IP-listing or NAME-listing services. Such services are a simple way to share/publish a list of IP addresses or (domain) names which are "listed" for for some reason, for example in order to be able to refuse a service to a client which is "listed" in some blocklist.

rbldnsd is not a general-purpose nameserver. It will answer to A and TXT (and SOA and NS if such RRs are specified) queries, and has limited ability to answer to some other types of queries.

rbldnsd tries to handle data from two different perspectives: given a set (or several) of "listed entries" (e.g. IP address ranges or domain names), it builds and serves a DNS zone. Note the two are not the same: list of spammer's IPs is NOT a DNS zone, but may be represented and used as such, provided that some additional information necessary to build complete DNS zone (e.g. NS and SOA records, maybe A records necessary for http to work) is available. In this context, rbldnsd is very different from other general-purpose nameservers such as BIND or NSD: rbldnsd operates with datasets (sets of entries - IP addresses or domain names, logically grouped together), while other general-purpose nameservers operates with zones. The way how rbldnsd operates may be somewhat confusing to BIND experts.

For rbldnsd, a building block is a dataset: e.g., set of insecure/abuseable hosts (IP addresses), set of network ranges that belongs to various spam operations (IP ranges), domain names that belong to spammers (RHSBL) and so on. Usually, different kind of information is placed into separate file, for easy maintenance. From a number of such datasets, rbldnsd constructs a number of DNS zones as specified on command line. A single dataset may be used for several zones, and a single zone may be constructed from several datasets.

rbldnsd will answer queries to DNS zones specified on the command line as a set of zone specifications. Each zone specification consists of zone basename, dataset type and a comma-separated list of files that forms a given dataset: zone:type:file,file,...

Several zones may be specified in command line, so that rbldnsd will answer queries to any of them. Also, a single zone may be specified several times with different datasets, so it is possible to form a zone from a combination of several different dataset. The same dataset may be reused for several zones too (and in this case, it will be read into memory only once).

There are several dataset formats available, each is suitable and optimized (in terms of memory, speed and ease of use) for a specific task. Available dataset types may be grouped into the following categories:

lists of IP addresses. When a query is done to a zone with such data, query is interpreted as an IP address in a reverse form (similar to in-addr.arpa zone). If the address is found in dataset data, rbldnsd will return A and TXT records specified in data for that IP. This is a classical IP-based blocklist.
lists of domain names. Similar to list of IP addresses, but with generic domain names instead of IPs (wildcards allowed). This type of data may be used to form a blocklist of e.g. sender domain names.
generic list of various types of records, as an auxilary data to form a complete nameserver. This format is similar to bind-style datafiles, but very simplified. One may specify A, TXT, NS and MX records here.
combined set, different datasets from the list above combined in the single (set of) source files, for easy maintenance.
acl, or Access Control List. This is a pseudo dataset, that works by overweriting query results based on the requestor (peer) IP address.

OPTIONS

The following options may be specified:

-u user[:group]
rbldnsd will change its userid to the specified user, which defaults to rbldns, and group, which by default is the primary group of a user. rbldnsd will refuse to run as the root user, since this is insecure.

-r rootdir
rbldnsd will chroot to rootdir if specified. Data files should be available inside rootdir.

-w workdir
rbldnsd will change its working directory to workdir (after chrooting to rootdir if -r option is also specified). May be used to shorten filename paths.

-b address/port
This option is required. rbldnsd will bind to specified address and port (port defaults to port 53, domain). Either numeric IP address or a hostname may be specified, and either port number or service name is accepted. It is possible to specify several addresses to listen on this way, by repeating -b option. Additionally, if there are several addresses listed for a hostname, rbldnsd will listen on all of them. Note that rbldnsd will work slightly faster if only one listening address is specified. Note the delimiter between host and port is a slash (/), not a colon, to be able to correctly handle IPv6 addresses.

-4
Use IPv4 listening socket/transport, do not attempt to use IPv6 (ignored if rbldnsd was built without IPv6 support).

-6
Use IPv6 listening socket/transport, do not attempt to use IPv4 (this option will be reported as error if IPv6 support was not compiled in).

-t defttl:minttl:maxttl
Set default reply time-to-live (TTL) value to be defttl, and set constraints for TTL to minttl and maxttl. Default applies when there's no TTL defined in a given scope (in data file), and constraints are applied when such value provided in data. Any of the values may be omitted, including trailing colon (:) characters, e.g. "-t 30" set default TTL to be 30 secound, and "-t ::120" or "-t ::2m" sets maximum allowed TTL to 2 minutes. All 3 values are in time units, with optional suffix: s (secounds, default), m (minutes), h (hours), d (days) or w (weeks). Zero minttl or maxttl means no corresponding constraint will be enforced. Default defttl is 35m.

-c check
Set interval between checking for zone file changes to be check, default is 1m (one minute). rbldnsd will check zone file's last modification time every so often, and if it detects a change, zone will be automatically reloaded. Setting this value to 0 disables automatic zone change detection. This procedure may also be triggered by sending a SIGHUP signal to rbldnsd (see SIGNALS section below).

-e
Allow non-network addresses to be used in CIDR ranges. Normally, rbldnsd rejects addresses such as 127.2.3.4/24, where prefix is not within the network mask derived from bit number (in this case, correct form is 127.2.3.0/24, note the trailing zero in prefix). This is done in order to catch possible typos in zones (rbldnsd will warn about a problem and will ignore such entry). This option disables checking whether the CIDR prefix fits within the network mask.

-q
Quick and quiet start. Normally, rbldnsd does socket initialization and zone load in foreground, writing progress and statistic to standard output, and aborts in case of any errors. With this flag, nothing will be printed and first zone load will be done in background (unless -n option is also given), and zone load errors will be non-fatal.

-p pidfile
Write rbldnsd's pid to the specified pidfile, so it will be easily findable. This file gets written before entering a chroot jail (if specified) and before changing userid, so it's ok to specify e.g. /var/run/rbldnsd.pid here.

-l logfile
Specifies a file to which log all requests made. This file is created after entering a chroot jail and becoming a user. Logfiles may be quite large, esp. on busy sites (rbldnsd will log every recognized request if this option is specified). This option is mainly intended for debugging purposes. Upon receiption of SIGHUP signal, rbldnsd reopens its logfile. If logfile prefixed with a plus sign (+), logging will not be buffered (i.e. each line will be flushed to disk); by default, logging is buffered to reduce system load. Specify a single hyphen (-) as a filename to log to standard output (filedescriptor 1), either buffered by default, or line-buffered if specified as `+-' (standard output will not be "reopened" upon receiving SIGHUP signal, but will be flushed in case logging is buffered).

-s statsfile
Specifies a file where rbldnsd will write a line with short statistic summary of queries made per zone, every check (-c) interval. Format of each line is:
  timestamp zone:qtot:qok:qnxd:bin:bout zone:...
where timestamp is unix time (secounds since epoch), zone is the name of the base zone, qtot is the total number of queries received, qok - number of positive replies, qnxd - number of NXDOMAIN replies, bin is the total number of bytes read from network (excluding IP/UDP overhead and dropped packets), bout is the total number of bytes written to network. Ther are as many such tuples as there are zones, and one extra, total typle at the end, with zone being "*", like:
  1234 bl1.ex:10:5:4:311:432 bl2.ex:24:13:7:248:375 *:98:35:12:820:987
Note the total values may be larger than the sum of per-zone values, due to queries made against unlisted zones, or bad/broken packets.

Rbldnsd will write bare timestamp to statsfile when it is starting up, shutting down or when statistic counters are being reset after receiving SIGUSR2 signal (see below), to indicate the points where the counters are starting back from zero.

By default, rbldnsd writes absolute counter values into statsfile (number of packets (bytes) since startup or last reset). statsfile may be prefixed with plus sign (+), in which case rbldnsd will write delta values, that is, number of packets or bytes since last write, or number of packets (bytes) per unit of time ("incremental" mode, hence the "+" sign).

-n
Do not become a daemon. Normally, rbldnsd will fork and go to the background after successful initialization. This option disables this behaviour.

-f
Request rbldnsd to continue processing requests during data reloads. Rbldnsd forks a child process to handle requests while parent reloads the data. This ensures smooth operations, but requires more memory, since two copies of data is keept in memory during reload process.

-d
Dump all zones to stdout in BIND format and exit. This may be suitable to convert easily editable rbldnsd-style data into BIND zone. rbldnsd dumps all zones as one stream, so one may want to specify only one zone with -d. Zone file will have appropriate $ORIGIN tags. Note that data generated may be really huge (as BIND format isn't appropriate for this sort of things), and some entries may not be really the same in BIND as in rbldnsd (e.g., IP netblocks of large size will be represented as wildcard entries - 10.0.0.0/8 will become *.10; excluded entries will be represented by a CNAME to `excluded' name, so such name should not be present in a data set). In this mode, rbldnsd ignores -r (root directory) option.

-v
Do not show exact version information in response to version.bind CH TXT queries (by default rbldnsd responds to such queries since version 0.98). With single -v, rbldnsd will only return "rbldnsd" to the caller, without the version number. Second -v disables providing any information in response to such requests, i.e. rbldnsd will return REFUSE code.

-C
Disable automatic on-the-fly uncompression of data files if this feature is compiled in (see below).

-A
-a
Controls "laziness" of rbldnsd when constructing replies. With -a specified, rbldnsd does not add AUTH section (containing NS records) to replies unless explicitly asked for NS records. It is equivalent to BIND9 "minimal-answers" configuration setting. While with -A specified, rbldnsd will always fill in AUTH section, increasing size of replies dramatically but allowing (caching) resolver clients to react faster to changes in nameserver lists. Currently (as of 0.996 version), non-lazy (as with -A) mode is the default, but it will change in future release.

-x extension
Load the given extension file (a dynamically-linked library, usually with ".so" suffix). This allows to gather custom statistics or perform other custom tasks. See separate document for details about building and using extensions. This feature is not available on all platforms, and can be disabled at compile time.

-X extarg
Pass the given argument, extarg, to the extension loaded with -x.

DATASET TYPES AND FORMATS

Dataset files are text files which are interpreted depending on type specified in command line. Empty lines and lines starting with hash character (#) or semicolon (;) are ignored, except for a special case outlined below in section titled "Special Entries".

A (comma-separated) list of files in dataset specification (in type:file,file,...) is interpreted as if all files where logically combined into one single file.

When compiled with zlib support, rbldnsd is able to read gzip-compressed data files. So, every file in dataset specification can be compressed with gzip(1), and rbldnsd will read such a file decompressing it on-the-fly. This feature may be turned off by specifying -C option.

rbldnsd is designed to service a DNSBL, where each entry have single A record and optional TXT record assotiated with it. rbldnsd allows to specify A value and TXT template either for each entry individually, or to use default A value and TXT template pair for a group of entries. See section "Resulting A values and TXT templates" below for a way to specify them.

Special Entries

If a line starts with a dollar sign ($), hash character and a dollar sign (#$), semicolon and dollar sign (;#) or colon and a dollar sign (:$), it is interpreted in a special way, regardless of dataset type (this is one exception where a line starting with hash character is not ignored - to be able to use zone files for both rbldnsd and for DJB's rbldns). The following keywords, following a dollar sign, are recognized:

$SOA ttl origindn persondn serial refresh retry expire minttl
Specifies SOA (Start Of Authority) record for all zones using this dataset. Only first SOA record is interpreted. This is the only way to specify SOA - by default,
rbldnsd will not add any SOA record into answers, and will REFUSE to answer to certain queries (notably, SOA query to zone's base domain name). It is recommended, but not mandatory to specify SOA record for every zone. If no SOA is given, negative replies will not be cacheable by caching nameservers. Only one, first $SOA line is recognized in every dataset (all subsequent $SOA lines encountered in the same dataset are silently ignored). When constructing a zone, SOA will be taken from first dataset where $SOA line is found, in an order as specified in command line, subsequent $SOA lines, if any, are ignored. This way, one may overwrite $SOA found in 3rd party data by prepending small local file to the dataset in question, listing it before any other files.
If serial value specified is zero, timestamp of most recent modified file will be substituted as serial.
If ttl field is zero, default ttl (-t option or last $TTL value, see below) will be used.
All time fields (ttl, refresh, retry, expire, minttl) may be specified in time units. See -t option for details.

$NS ttl nameserverdn nameserverdn...
Specifies NS (Name Server) records for all zones using this dataset. Only first $NS line in a dataset is recognized, all subsequent lines are silently ignored. When constructing a zone from several datasets, rbldnsd uses nameservers from $NS line in only first dataset where $NS line is given, in command-line order, just like for $SOA record. Only first 32 namservers are recognized. Individual nameserver(s) may be prefixed with a minus sign (-), which means this single nameserver will be ignored by
rbldnsd. This is useful to temporary comment out one nameserver entry without removing it from the list. If ttl is zero, default ttl will be used. The list of NS records, just like $SOA value, are taken from the first data file in a dataset where the $NS line is found, subsequent $NS lines, if any, are ignored.

$TTL time-to-live
Specifies TTL (time-to-live) value for all records in current dataset. See also -t option. $TTL special overrides -t value on a per-dataset basis.

$TIMESTAMP dstamp [expires]
(experimental) Specifies the data timestamp dstamp when the data has been generated, and optionally when it will expire. The timestamps are in form yyyy:mm:dd[:hh[:mi[:ss]]], where yyyy is the year like 2005, mm is the month number (01..12), dd is the month day number (01..31), hh is hour (00..23), mi and ss are minutes and secounds (00.59); hours, minutes and secounds are optional and defaults to 0; the delimiters (either colon or dash may be used) are optional too, but are allowed for readability. Also, single zero (0) or dash (-) may be used as dstamp and/or expires, indicating the value is not given. expires may also be specified as +rel, where rel is a time specification (probably with suffix like s, m, h, d) as an offset to dstamp. rbldnsd compares dstamp with current timestamp and refuses to load the file if dstamp specifies time in the future. And if expires is specified, rbldnsd will refuse to service requests for that data if current time is greather than the value specified in expires field.
Note that rbldnsd will check the data expiry time every time it checks for data file updates (when receiving SIGHUP signal or every -c interval). If automatic data reload timer (-c option) is disabled, zones will not be exipired automatically.

$MAXRANGE4 range-size
Specifies maximum size of IPv4 range allowed for IPv4-based datasets. If an entry covers more IP addresses than range-size, it will be ignored (and a warning will be logged). range-size may be specified as a number of hosts, like 256, or as network prefix lenght, like /24 (the two are the same):
  $MAXRANGE4 /24
  $MAXRANGE4 256
This constraint is active for a dataset it is specified in, and can be owerwritten (by subsequent $MAXRANGE statement) by a smaller value, but can not be increased.

$n text
(n is a single digit). Specifies a substitution variable for use as $n placeholders (the $n entries are ignored in generic daaset). See section "Resulting A values and TXT templates" below for description and usage examples.

$= text
Set the base template for all individual TXT records. See section "Resulting A values and TXT templates" below for more information.

ip4set Dataset

A set of IP addresses or CIDR address ranges, together with A and TXT resulting values. IP addresses are specified one per line, by an IP address prefix (initial octets), complete IP address, CIDR range, or IP prefix range (two IP prefixes or complete addresses delimited by a dash, inclusive). Examples, to specify 127.0.0.0/24:

  127.0.0.0/24
  127.0.0
  127/24
  127-127.0.0
  127.0.0.0-127.0.0.255
  127.0.0.1-255
to specify 127.16.0.0-127.31.255.255:
  127.16.0.0-127.31.255.255
  127.16.0-127.31.255
  127.16-127.31
  127.16-31
  127.16.0.0/12
  127.16.0/12
  127.16/12
Note that in prefix range, last boundary is completed with all-ones (255), not all-zeros line with first boundary and a prefix alone. In prefix ranges, if last boundary is only one octet (127.16-31), it is treated as "suffix", as value of last specified octet of the first boundary prefix (127.16.0-31 is treated as 127.16.0.0-127.16.31.255, i.e. 127.16.0.0/19).

After an IP address range, A and TXT values for a given entry may be specified. If none given, default values in current scope (see below) applies. If a value starts with a colon, it is interpreted as a pair of A record and TXT template, delimited by colon (:127.0.0.2:This entry is listed). If a value does not start with a colon, it is interpreted as TXT template only, with A record defaulting to the default A value in current scope.

IP address range may be followed by a comment char (either hash character (#) or semicolon (;)), e.g.:

  127/8 ; loopback network
In this case all characters up to the end of line are ignored, and default A and TXT values will be used for this IP range.

Every IP address that fits within any of specified ranges is "listed", and rbldnsd will respond to reverse queries against it within specified zone with positive results. In contrast, if an entry starts with an exclamation sign (!), this is an exclusion entry, i.e. corresponding address range is excluded from being listed (and any value for this record is ignored). This may be used to specify large range except some individual addresses, in a compact form.

If a line starts with a colon (:), this line specifies the default A value and TXT template to return (see below) for all subsequent entries up to end of current file. If no default entry specified, and no value specified for a given record, rbldnsd will return 127.0.0.2 for matching A queries and no record for matching TXT queries. If TXT record template is specified and contains occurences of of dollar sign ($), every such occurence is replaced with an IP address in question, so singe TXT template may be used to e.g. refer to a webpage for an additional information for a specific IP address.

ip4trie Dataset

Set of IP4 CIDR ranges with corresponding (A, TXT) values. This dataset is similar to ip4set, but uses a different internal representation. It accepts CIDR ranges only (not a.b.c.d-e.f.g.h), and allows for the specification of A/TXT values on a per CIDR range basis. (If multiple CIDR ranges match a query, the value for longest matching prefix is returned.) Exclusions are supported too.

This dataset is not particularly memory-efficient for storing many single IP addresses --- it uses about 50% more memory than the ip4set dataset in that case. The ip4trie dataset is better adapted, however, for listing CIDR ranges (whose lengths are not a multiple of 8 bits.)

ip4tset Dataset

"trivial" ip4set: a set of single IP addresses (one per line), with the same A+TXT template. This dataset type is more efficient than ip4set (in both memory usage and access times), but have obvious limitation. It is intended for DNSBLs like DSBL.org, ORDB.org and similar, where each entry uses the same default A+TXT template. This dataset uses only half a memory for the same list of IP addresses compared to ip4set.

ip6trie Dataset

Set of IP6 CIDR ranges. This is the IP6 equivalent of the ip4trie dataset. It allows the sepecification of individual A/TXT values for each CIDR range and supports exclusions. Compressed ("::") ip6 notation is supported.

Example zone data:

  # Default A and TXT template valuse
  :127.0.1.2: Listed, see http://example.com/lookup?$
  # A listing, note that trailing :0s can be omitted
  2001:21ab:c000/36
  # /64 range with non-default A and TXT values
  2001:21ab:def7:4242 :127.0.1.3: This one smells funny
  # compressed notation
  2605:6001:42::/52
  ::1                   # localhost
  !2605:6001:42::bead   # exclusion

ip6tset Dataset

"Trivial" ip6 dataset: a set of /64 IP6 CIDR ranges (one per line), all sharing a single A+TXT template. Exclusions of single IP6 (/128) addresses are also supported. This dataset type is quite memory-efficient --- it uses about 40% of the memory that the ip6trie dataset would use --- but has obvious limitations.

This dataset wants the /64s listed as four ip6 words, for example:

  2001:20fe:23:41ed
  abac:adab:ad00:42f
Exclusions are denoted with a leading exclamation mark. You may also use compressed "::" notation for excluded addresses. E.g.:
  !abac:adab:ad00:42f:face:0f:a:beef
  !abac:adab:ad00:42f::2

dnset Dataset

Set of (possible wildcarded) domain names with associated A and TXT values. Similar to ip4set, but instead of IP addresses, data consists of domain names (not in reverse form). One domain name per line, possible starting with wildcard (either with star-dot (*.) or just a dot). Entry starting with exclamation sign is exclusion. Default value for all subsequent lines may be specified by a line starting with a colon.

Wildcards are interpreted as follows:

example.com
only example.com domain is listed, not subdomains thereof. Not a wildcard entry.
*.example.com
all subdomains of example.com are listed, but not example.com itself.
.example.com
all subdomains of example.com and example.com itself are listed. This is a shortcut: to list a domain name itself and all it's subdomains, one may either specify two lines (example.com and *.example.com), or one line (.example.com).

This dataset type may be used instead of ip4set, provided all CIDR ranges are expanded and reversed (but in this case, TXT template will be expanded differently).

generic Dataset

Generic type, simplified bind-style format. Every record should be on one line (line continuations are not supported), and should be specified completely (i.e. all domain names in values should be fully-qualified, entry name may not be omitted). No wildcards are accepted. Only A, TXT, and MX records are recognized. TTL value may be specified before record type. Examples:

 # bl.ex.com
 # specify some values for current zone
 $NS 0 ns1.ex.com ns2.ex.com
 # record with TTL
 www 3000 A 127.0.0.1
 about TXT "ex.com combined blocklist"

combined Dataset

This is a special dataset that stores no data by itself but acts like a container for several other datasets of any type except of combined type itself. The data file contains an optional common section, where various specials are recognized like $NS, $SOA, $TTL (see above), and a series of sections, each of which defines one (nested) dataset and several subzones of the base zone, for which this dataset should be consulted. New (nested) dataset starts with a line

  $DATASET type[:name] subzone subzone...
and all subsequent lines up to the end of current file or to next $DATASET line are interpreted as a part of dataset of type type, with optional name (name is used for logging purposes only, and the whole ":name" (without quotes or square brackets) part is optional). Note that combined datasets cannot be nested. Every subzone will always be relative to the base zone name specified on command line. If subzone specified as single character "@", dataset will be connected to the base zone itself.

This dataset type aims to simplify subzone maintenance, in order to be able to include several subzones in one file for easy data transfer, atomic operations and to be able to modify list of subzones on remote secondary nameservers.

Example of a complete dataset that contains subzone `proxies' with a list of open proxies, subzone `relays' with a list of open relays, subzone `multihop' with output IPs of multihop open relays, and the base zone itself includes proxies and relays but not multihops:

  # common section
  $NS 1w ns1.ex.com ns2.ex.com
  $SOA 1w ns1.ex.com admin.ex.com 0 2h 2h 1w 1h
  # list of open proxies,
  # in `proxies' subzone and in base zone
  $DATASET ip4set:proxy proxies @
  :2:Open proxy, see http://bl.ex.com/proxy/$
  127.0.0.2
  127.0.0.10
  # list of open relays,
  # in `relays' subzone and in base zone
  $DATASET ip4set:relay relays @
  :3:Open relay, see http://bl.ex.com/relay/$
  127.0.0.2
  127.0.2.10
  # list of optputs of multistage relays,
  # in `multihop' subzone only
  $DATASET ip4set:multihop-relay multihop
  :4:Multihop open relay, see http://bl.ex.com/relay/$
  127.0.0.2
  127.0.9.12
  # for the base zone and all subzones,
  # include several additional records
  $DATASET generic:common proxies relays multihop @
  @ A 127.0.0.8
  www A 127.0.0.8
  @ MX 10 mx.ex.com
  # the above results in having the following records
  # (provided that the base zone specified is bl.ex.com):
  #  proxies.bl.ex.com A 127.0.0.8
  #  www.proxies.bl.ex.com 127.0.0.8
  #  relays.bl.ex.com A 127.0.0.8
  #  www.relays.bl.ex.com 127.0.0.8
  #  multihop.bl.ex.com A 127.0.0.8
  #  www.multihop.bl.ex.com 127.0.0.8
  #  bl.ex.com A 127.0.0.8
  #  www.bl.ex.com 127.0.0.8

Note that $NS and $SOA values applies to the base zone only, regardless of the placement in the file. Unlike the $TTL values and $n substitutions, which may be both global and local for a given (sub-)dataset.

Resulting A values and TXT templates

In all zone file types except generic, A values and TXT templates are specified as following:

  :127.0.0.2:Blacklisted: http://example.com/bl?$
If a line starts with a colon, it specifies default A and TXT for all subsequent entries in this dataset. Similar format is used to specify values for individual records, with the A value (enclosed by colons) being optional:
  127.0.0.2 :127.0.0.2:Blacklisted: http://example.com/bl?$
or, without specific A value:
  127.0.0.2 Blacklisted: http://example.com/bl?$

Two parts of a line, delimited by second colon, specifies A and TXT record values. Both are optional. By default (either if no default line specified, or no IP address within that line), rbldnsd will return 127.0.0.2 as A record. 127.0.0 prefix for A value may be omitted, so the above example may be simplified to:

  :2:Blacklisted: http://example.com/bl?$
There is no default TXT value, so rbldnsd will not return anything for TXT queries it TXT isn't specified.

When A value is specified for a given entry, but TXT template is omitted, there may be two cases interpreted differently, namely, whenever there's a second semicolon (:) after the A value. If there's no second semicolon, default TXT value for this scope will be used. In contrast, when second semicolon is present, no TXT template will be generated at all. All possible cases are outlined in the following example:

  # default A value and TXT template
  :127.0.0.2:IP address $ is listed
  # 127.0.0.4 will use default A and TXT
  127.0.0.4
  # 127.0.0.5 will use specific A and default TXT
  127.0.0.5 :5
  # 127.0.0.6 will use specific a and no TXT
  127.0.0.6 :6:
  # 127.0.0.7 will use default A and specific TXT
  127.0.0.7 IP address $ running an open relay

In a TXT template, references to substitution variables are replaced with values of that variables. In particular, single dollar sign ($) is replaced by a listed entry (an IP address in question for IP-based datasets and the domain name for domain-based datasets). $n-style constructs, where n is a single digit, are replaced by a substitution variable $n defined for this dataset in current scope (see section "Special Entries" above). To specify a dollar sign as-is, use $$.

For example, the following lines:

  $1 See http://www.example.com/bl
  $2 for details
  127.0.0.2  $1/spammer/$ $2
  127.0.0.3  $1/relay/$ $2
  127.0.0.4  This spammer wants some $$$$.  $1/$
will result in the following text to be generated:
  See http://www.example.com/bl/spammer/127.0.0.2 for details
  See http://www.example.com/bl/relay/127.0.0.3 for details
  This spammer wants some $$.  See http://www.example.com/bl/127.0.0.4

If the "base template" ($= variable) is defined, this template is used for expansion, instead of the one specified for an entry being queried. Inside the base template, $= construct is substituted with the text given for individual entries. In order to stop usage of base template $= for a single record, start it with = (which will be omitted from the resulting TXT value). For example,

  $0 See http://www.example.com/bl?$= ($) for details
  127.0.0.2     r123
  127.0.0.3
  127.0.0.4     =See other blocklists for details about $
produces the following TXT records:
  See http://www.example.com/bl?r123 (127.0.0.2) for details
  See http://www.example.com/bl?127.0.0.3 (127.0.0.3) for details
  See other blocklists for details about 127.0.0.4

acl Dataset

This is not a real dataset, while the syntax and usage is the same as with other datasets. Instead of defining which records exists in a given zone and which do not, the acl dataset specifies which client hosts (peers) are allowed to query the given zone. The dataset specifies a set of IPv4 and/or IPv6 CIDR ranges (with the syntax exactly the same as understood by the ip4trie and ip6trie datasets), together with action specifiers. When a query is made from an IP address listed (not for the IP address), the specified action changes rules used to construct the reply. Possible actions and their meanings are:

:ignore
ignore all queries from this IP address altogether. rbldnsd acts like there was no query received at all. This is the default action.
:refuse
refuse all queries from the IP in question. rbldnsd will always return REFUSED DNS response code.
:empty
pretend there's no data in all other datasets for the given zone. This means that all the clients in question will always receive reply from rbldnsd telling that the requested IP address or domain name is not listed in a given DNSBL. rbldnsd still replies to metadata queries (SOA and NS records, and to all queries satisfied by generic dataset if specified for the given zone) as usual.
:pass
process the request as usual. This may be used to add a "whitelisting" entry for a network/host bloked by another (larger) ACL entry.
a_txt_template
usual A+TXT template as used by other datasets. This means that rbldnsd will reply to any valid DNSBL query with "it is listed" answer, so that the client in question will see every IP address or domain name is listed in a given DNSBL. TXT record used in the reply is taken from the acl dataset instead of real datasets. Again, just like with empty case, rbldnsd will continue replying to metadata queries (including generic datasets if any) as usual.

Only one ACL dataset can be specified for a given zone, and each zone must have at least one non-acl dataset. It is also possible to specify one global ACL dataset, by specifying empty zone name (which is not allowed for other dataset types), like

  rbldnsd ... :acl:filename...

In this case the ACL defined in filename applies to all zones. If there are both global ACL and local zone-specific ACL specified, both will be consulted and actions taken in the order specified above, ie, if either ACL returns ignore for this IP, the request will be ignored, else if either ACL returns refuse, the query will be refused, and so on. If both ACLs specifies "always listed" A+TXT template, the reply will contain A+TXT from global ACL.

For this dataset type, only a few $-style specials are recognized. In particular, $SOA and $NS keywords are not allowed. When rbldnsd performs $ substitution in the TXT template returned from ACL dataset, it will use client IP address to substitute for a single $ character, instead of the IP address or domain name found in the original query.

SIGNALS

Rbldnsd handles the following signals:

SIGHUP
recheck zone files and reload any outdated ones. This is done automatically if enabled, see -c option. Additionally, rbldnsd will reopen logfile upon receiving SIGHUP, if specified (-l option).

SIGTERM, SIGINT
Terminate process.

SIGUSR1
Log current statistic counters into syslog. Rbldnsd collects how many packets it handled, how many bytes was received, sent, how many OK requests/replies (and how many answer records) was received/sent, how many NXDOMAIN answers was sent, and how many errors/refusals/etc was sent, in a period of time.

SIGUSR2
The same as SIGUSR1, but reset all counters and start new sample period.

NOTES

Some unsorted usage notes follows.

Generating and transferring data files

When creating a data file for rbldnsd (and for anything else, it is a general advise), it is a good idea to create the data in temporary file and rename the temp file when all is done. Never try to write to the main file directly, it is possible that at the same time, rbldnsd will try to read it and will get incomplete data as the result. The same applies to copying data using cp(1) utility and similar (including scp(1)), that performs copying over existing data. Even if you're sure noone is reading the data while you're copying or generating it, imagine what will happen if you will not be able to complete the process for whatever reason (interrupt, filesystem full, endless number of other reasons...). In most cases is better to keep older but correct data instead of leaving incomplete/corrupt data in place.

Right:

  scp remote:data target.tmp && mv target.tmp target
Wrong:
  scp remote:data target
Right:
  ./generate.pl > target.tmp && mv target.tmp target
Wrong:
  ./generate.pl > target

From this point of view, rsync(1) command seems to be safe, as it always creates temporary file and renames it to the destination only when all is ok (but note the --partial option, which is good for downloading something but may be wrong to transfer data files -- usually you don't want partial files to be loaded). In contrast, scp(1) command is not safe, as it performs direct copying. You may still use scp(1) in a safe manner, as shown in the example above.

Also try to eliminate a case when two (or more) processes performs data copying/generation at the same time to the same destination. When your data is generated by a cron job, use file locking (create separate lock file (which should never be removed) and flock/fcntl it in exclusive mode without waiting, exiting if lock fails) before attempting to do other file manipulation.

Absolute vs relative domain names

All keys specified in dataset files are always relative to the zone base DN. In contrast, all the values (NS and SOA records, MX records in generic dataset) are absolute. This is different from BIND behaviour, where trailing dot indicates whenever this is an absolute or relative DN. Trailing dots in domain names are ignored by rbldnsd.

Aggregating datasets

Several zones may be served by rbldnsd, every zone may consist of several datasets. There are numerous ways to combine several data files into several zones. For example, suppose you have a list of dialup ranges in file named `dialups', and a list of spammer's ip addresses in file named `spammers', and want to serve 3 zones with rbldnsd: dialups.bl.ex.com, spam.bl.ex.com and bl.ex.com which is a combination of the two. There are two ways to do this:

 rbldnsd options... \
   dialups.bl.ex.com:ip4set:dialups \
   spam.bl.ex.com:ip4set:spammers \
   bl.ex.com:ip4set:dialups,spammers

or:

 rbldnsd options... \
   dialups.bl.ex.com:ip4set:dialups \
   spam.bl.ex.com:ip4set:spammers \
   bl.ex.com:ip4set:dialups \
   bl.ex.com:ip4set:spammers

(note you should specify combined bl.ex.com zone after all its subzones in a command line, or else subzones will not be consulted at all).

In the first form, there will be 3 independent data sets, and every record will be stored 2 times in memory, but only one search in internal data structures will be needed to resolve queries for aggregate bl.ex.com. In second form, there will be only 2 data sets, every record will be stored only once (both datasets will be reused), but 2 searches will be performed by rbldnsd to answer queries against aggregate zone (but difference in speed is almost unnoticeable). Note that when aggregating several data files into one dataset, an exclusion entry in one file becomes exclusion entry in the whole dataset (which may be a problem when aggregating dialups, where exclusions are common, with open relays/proxies, where exclusions are rare if at all used).

Similar effect may be achieved by using combined dataset type, sometimes more easily. combined dataset results in every nested dataset to be used independantly, like in second form above.

combined dataset requires rbldnsd to be the authoritative nameserver for the whole base zone. Most important, one may specify SOA and NS records for the base zone only. So, some DNSBLs which does not use a common subzone for the data, cannot use this dataset. An example being DSBL.org DNSBL, where each of list.dsbl.org, multihop.dsbl.org and unconfirmed.dsbl.org zones are separate, independant zones with different set of nameservers. But for DSBL.org, where each dataset is really independant and used only once (there's no (sub)zone that is as a combinations of other zones), combined dataset isn't necessary. In contrast, SORBS.net zones, where several subzones used and main zone is a combination of several subzones, combined dataset is a way to go.

All authoritative nameservers should be set up similarily

When you have several nameservers for your zone, set them all in a similar way. Namely, if one is set up using combined dataset, all the rest should be too, or else DNS meta-data will be broken. This is because metadata (SOA and NS) records returned by nameservers using combined and other datasets will have different origin. With combined dataset, rbldnsd return NS and SOA records for the base zone, not for any subzone defined inside the dataset. Given the above example with dialups.bl.ex.com, spammers.bl.ex.com and aggregate bl.ex.com zones, and two nameservers, first is set up in any ways described above (using individual datasets for every of the 3 zones), and second is set up for the whole bl.ex.com zone using combined dataset. In this case, for queries against dialups.bl.ex.com, first nameserver will return NS records like

  dialups.bl.ex.com. IN NS a.ns.ex.com.
while second will always use base zone, and NS records will look like
  bl.ex.com. IN NS a.ns.ex.com.
All authoritative nameservers for a zone must have consistent metadata records. The only way to achieve this is to use similar configuration (combined or not) on all nameservers. Have this in mind when using other software for a nameserver.

Generic dataset usage

generic dataset type is very rudimentary. It's purpose is to complement all the other type to form complete nameserver that may answer to A, TXT and MX queries. This is useful mostly to define A records for HTTP access (relays.bl.example.com A, www.bl.example.com A just in case), and maybe descriptive texts as a TXT record.

Since rbldnsd only searches one, most closely matching (sub)zone for every request, one cannot specify a single e.g. generic dataset in form

  proxies      TXT list of open proxies
  www.proxies  A 127.0.0.8
  relays       TXT list of open relays
  www.relays   A 127.0.0.9
for several (sub)zones, each of which are represented as a zone too (either in command line or as combined dataset). Instead, several generic datasets should be specified, separate one for every (sub)zone. If the data for every subzone is the same, the same, single dataset may be used, but it should be specified for every zone it should apply to (see combined dataset usage example above).

BUGS

Most of the bugs outlined in this section aren't really bugs, but present due to non-standartized and thus unknown expected behaviour of a nameserver that serves a DNSBL zone. rbldnsd matches BIND runtime behaviour where appropriate, but not always.

rbldnsd lowercases some domain names (the ones that are lookup keys, e.g. in `generic' and `dnset' datasets) when loading, to speed up lookup operations. This isn't a problem in most cases.

There is no TCP mode. If a resource record does not fit in UDP packet (512 bytes), it will be silently ignored. For most usages, this isn't a problem, because there should be only a few RRs in an answer, and because one record is usually sufficient to decide whenever a given entry is "listed" or not. rbldnsd isn't a full-featured nameserver, after all.

rbldnsd will not always return a list of nameserver records in the AUTHORITY section of every positive answer: NS records will be provided (if given) only if there's a room for them in single UDP packet. If records does not fit, AUTHORITY section will be empty.

rbldnsd does not allow AXFR operations. For DNSBLs, AXFR is the stupidiest yet common thing to do - use rsync for zone transfers instead. This isn't a bug in rbldnsd itself, but in common practice of using AXFR and the like to transfer huge zones in a format which isn't suitable for such a task. Perhaps in the future, if there will be some real demand, I'll implement AXFR "server" support (so that rbldnsd will be able to act as master for BIND nameservers, but not as secondary), but the note remains: use rsync.

rbldnsd truncates all TXT records to be at most 255 bytes. DNS specs allows longer TXTs, but long TXTs is something that should be avoided as much as possible - TXT record is used as SMTP rejection string. Note that DNS UDP packet is limited to 512 bytes. rbldnsd will log a warning when such truncation occurs.

VERSION

This manpage corresponds to rbldnsd version 0.997.

AUTHOR

The rbldnsd daemon written by Michael Tokarev <[email protected]>, based on ideas by Dan Bernstein and his djbdns package, with excellent contributions by Geoffrey T. Dairiki <[email protected]>.

LICENCE

Mostly GPL, with some code licensed under 3-clause BSD license.