Date::Manip::Recur(3) methods for working with recurring events

SYNOPSIS


use Date::Manip::Recur;
$date = new Date::Manip::Recur;

DESCRIPTION

This module contains functions useful in parsing and manipulating recurrences. A recurrence is a notation for specifying when a recurring event occurs. For example, if an event occurs every other Friday or every 4 hours, this can be defined as a recurrence. A fully specified recurrence consists of the following pieces of information:
Frequency
The most basic piece of information is the frequency. For relatively simple recurring events, the frequency defines when those events occur. For more complicated recurring events, the frequency tells approximately when the events occur (but to get the actual events, certain modifiers must be applied as described below).

Examples of recurring events include:

   the first of every month
   every other day
   the 4th Thursday of each month at 2:00 PM
   every 2 hours and 30 minutes

All of these can be expressed as a frequency.

NOTE: unlike date parsing, support for frequencies written out in English (or whatever language you are working in) is extremely limited. For example, the string ``the first of every month'' will NOT be parsed as a valid frequency. A limited number of frequencies can be expressed in a written out form (see the <L/``OTHER FREQUENCY FORMATS'' section below), but most must be expressed in the format described below in <L/``FREQUENCY NOTATION''>. In this document however, the written out form will often be used for the sake of clarity.

Since a frequency typically refers to events that could happen an infinite number of times, you usually have to specify a date range to get the actual dates. Some frequencies also require a base date (i.e. information about when one such even actually occurred) since the frequency is otherwise ambiguous. For example, the frequency 'every other day' does not include enough information to specify the dates that the event happened on, so you have to explicitly define one of them. Then all others can be derived.

Modifier
Complex recurring events may require the use of modifiers in order to get them correct.

For example, in America, many places treat both Thanksgiving and the day after as holidays. Thanksgiving is easy to define since it is defined as:

   4th Thursday of every November

In the frequency notation (described below), this would be written as:

   1*11:4:5:0:0:0

The day after Thanksgiving is NOT possible to define in the same way. Depending on the year, the day after the 4th Thursday may be the 4th or 5th Friday.

The only way to accurately define the day after Thanksgiving is to specify a frequency and a modifier:

   4th Thursday of every November
   +1 day

In frequency notation, this can be expressed as:

   1*11:4:5:0:0:0*FD1

The syntax for the various modifiers is described below in the ``MODIFIERS'' section.

Base date
Many recurrences have a base date which is a date on which a recurring event is based.

The base date is not necessarily a date where the recurring event occurs. Instead, it may be modified (with modifiers, or with values specified in the recurrence) to actually produce a recurring event.

For example, if the frequency is

   every other Friday at noon

the base date will be a Friday and the recurring event will happen on that Friday, Friday two weeks later, Friday four weeks later, etc. In all cases, the dates will be modified to be at noon.

If the frequency has a modifier, such as:

   every other Friday
   + 1 day

(and yes, this trivial example could be expressed as the frequency 'every other Saturday' with no modifiers), then the base date is still on a Friday, but the actual recurring event is determined by applying modifiers and occurs on Saturday.

Recurring events are assigned a number with the event that is referred to by the base date being the 0th occurrence, the first one after that as the 1st occurrence, etc. Recurring events can also occur before the base date with the last time the recurring event occurred before the base date is the -1th occurrence.

So, if the frequency is

   the first of every month

and the base date is 'Mar 1, 2000', then the 5 recurring events around it are:

   N    Date
   -2   Jan 1 2000
   -1   Feb 1 2000
    0   Mar 1 2000
   +1   Apr 1 2000
   +2   May 1 2000

In some cases, the Nth date may not be defined. For example, if the frequency is:

   the 31st of every month

and the base date is Mar 31, 2000, the 5 recurring events around it are:

   N   Date
   -2  Jan 31 2000
   -1  undefined
    0  Mar 31 2000
    1  undefined
    2  May 31 2000

As mentioned above, the base date is used to determine one of the occurrences of the recurring event... but it may not actually be on of those events.

As an example, for the recurring event:

   every other Friday

a base date could be on a Friday, but it would also be possible to have a base date on some other day of the week, and it could unambiguously refer simply to a week, and the recurring event would occur on Friday of that week.

In most cases, it won't be necessary to treat base dates with that level of complexity, but with complicated recurring events, it may be necessary. More information on how Date::Manip determines a recurring event from a base date is given below in the section ``BASE DATES''.

Range
A date range is simply a starting and an ending date. When a range is used (primarily in the dates method as described below), only recurring events (with all modifiers applied) which happened on or after the start date and on or before the end date are used.

For example, if the frequency was

   the first of every month

and the start/end dates were Jan 1 2000 and May 31 2000, the list of dates referred to would be:

   Jan 1 2000
   Feb 1 2000
   Mar 1 2000
   Apr 1 2000
   May 1 2000

If no base date is specified, but a date range is specified, the start date is used as the specified base date.

It should be noted that if both the range and base date are specified, the range is not used to determine a base date. Also, the first time the recurring event occurs in this range may NOT be the 0th occurrence with respect to the base date, and that is allowed.

NOTE: both dates in the range and the base date must all be in the same time zone, and use the same Date::Manip::Base object.

An alternate definition of the range may also be used to specify that the recurring events based only on the interval and BEFORE any modifiers are applied fall in the range.

This definition is described in more detail below.

FREQUENCY NOTATION

The syntax for specifying a frequency requires some explanation. It is very concise, but contains the flexibility to express every single type of recurring event I could think of.

The syntax of the frequency description is a colon separated list of the format Y:M:W:D:H:MN:S (which stand for year, month, week, etc.). One (and only one) of the colons may optionally be replaced by an asterisk, or an asterisk may be prepended to the string. For example, the following are all valid frequency descriptions:

  1:2:3:4:5:6:7
  1:2*3:4:5:6:7
 *1:2:3:4:5:6:7

But the following are NOT valid because they contain more than one asterisk:

  1:2*3:4:5*6:7
 *1:2:3:4:5:6*7

When an asterisk is included, the portion to the left of it is called the interval, and refers to an approximate time interval between recurring events. For example, if the interval of the frequency is:

  1:2*

it means that the recurring event occurs approximately every 1 year and 2 months. The interval is approximate because elements to the right of the asterisk, as well as any modifiers included in the recurrence, will affect when the events actually occur.

If no asterisks are included, then the entire recurrence is an interval. For example,

  0:0:0:1:12:0:0

refers to an event that occurs every 1 day, 12 hours.

The portion of the frequency that occur after an asterisk is called the recurrence time (or rtime), and refers to a specific value (or values) for that type of time element (i.e. exactly as it would appear on a calendar or a clock). For example, if the frequency ends with the rtime:

  *12:0:0

then the recurring event occurs at 12:00:00 (noon).

For example:

  0:0:0:2*12:30:0      every 2 days at 12:30 (each day)

Elements in the rtime can be listed as single values, ranges (2 numbers separated by a dash ``-''), or a comma separated list of values or ranges. In some cases, negative values are appropriate for the week or day values. -1 stands for the last possible value, -2 for the second to the last, etc.

If multiple values are included in more than one field in the rtime, every possible combination will be used. For example, if the frequency ends with the rtime:

  *12-13:0,30:0

the event will occur at 12:00, 12:30, 13:00, and 13:30.

Some examples are:

  0:0:0:1*2,4,6:0:0    every day at at 02:00, 04:00, and 06:00
  0:0:0:2*12-13:0,30:0 every other day at 12:00, 12:30, 13:00,
                       and 13:30
  0:1:0*-1:0:0:0       the last day of every month
  *1990-1995:12:0:1:0:0:0
                       Dec 1 in 1990 through 1995

There is no way to express the following with a single recurrence:

  every day at 12:30 and 1:00

You have to use two recurrences to do this.

You can include negative numbers in ranges. For example, including the range -2--1 means to go from the 2nd to the last to the last occurrence. Negative values are only supported in the week and day fields, and only in some cases.

You can even use a range like 2--2 (which means to go from the 2nd to the 2nd to the last occurrence). However, this is STRONGLY discouraged since this leads to a date which produces a variable number of events. As a result, the only way to determine the Nth date is to calculate every date starting at the base date. If you know that every date produces exactly 4 recurring events, you can calculate the Nth date without needing to determine every intermediate date.

When specifying a range, the first value must be less than the second or else nothing will be returned.

When both the week and day elements are non-zero and the day is right of the asterisk, the day refers to the day of week. The following examples illustrate these type of frequencies:

  0:1*4:2:0:0:0        4th Tuesday (day 2) of every month
  0:1*-1:2:0:0:0       last Tuesday of every month
  0:0:3*2:0:0:0        every 3rd Tuesday (every 3 weeks
                       on 2nd day of week)
  1:0*12:2:0:0:0       the 12th Tuesday of each year

NOTE: The day of week refers to the numeric value of each day as specified by ISO 8601. In other words, day 1 is ALWAY Monday, day 7 is ALWAYS Sunday, etc., regardless of what day of the week the week is defined to begin on (using the FirstDay config variable). So when the day field refers to the day of week, it's value (or values if a range or comma separated list are used) must be 1-7.

When the week element is zero and the month element is non-zero and the day element is right of the asterisk, the day value is the day of the month (it can be from 1 to 31 or -1 to -31 counting from the end of the month).

  3*1:0:2:12:0:0       every 3 years on Jan 2 at noon
  0:1*0:2:12,14:0:0    2nd of every month at 12:00 and 14:00
  0:1:0*-2:0:0:0       2nd to last day of every month

NOTE: If the day given refers to the 29th, 30th, or 31st, in a month that does not have that number of days, it is ignored. For example, if you ask for the 31st of every month, it will return dates in Jan, Mar, May, Jul, etc. Months with fewer than 31 days will be ignored.

If both the month and week elements are zero, and the year element is non-zero, the day value is the day of the year (1 to 365 or 366 --- or the negative numbers to count backwards from the end of the year).

  1:0:0*45:0:0:0       45th day of every year

Specifying a day that doesn't occur in that year silently ignores that year. The only result of this is that specifying +366 or -366 will ignore all years except leap years.

If the week element is non-zero and to the right of the asterisk, and the day element is zero, the frequency refers to the first day of the given week of the month or week of the year:

  0:1*2:0:0:0:0        the first day of the 2nd week of
                       every month
  1:0*2:0:0:0:0        the first day of the 2nd week of
                       every year

Although the meaning of almost every recurrence can be deduced by the above rules, a set of tables describing every possible combination of Y/M/W/D meanings, and giving an example of each is included below in the section ``LIST OF Y/M/W/D FREQUENCY DEFINITIONS''. It also explains a small number of special cases.

NOTE: If all fields left of the asterisk are zero, the last one is implied to be 1. In other words, the following are equivalent:

   0:0:0*x:x:x:x
   0:0:1*x:x:x:x

and can be thought of as every possible occurrence of the rtime.

NOTE: When applying a frequency to get a list of dates on which a recurring event occurs, a delta is created from the frequency which is applied to get dates referred to by the interval. These are then operated on by the rtime and by modifiers to actually get the recurring events. The deltas will always be exact or approximate. There is no support for business mode recurrences. However, with the careful use of modifiers (discussed below), most recurring business events can be determined too.

BASE DATES

A recurrence of the form *Y:M:W:D:H:MN:S (which is technically speaking not a recurring event... it is just a date or dates specified using the frequency syntax) uses the first date which matches the frequency as the base date. Any base date specified will be completely ignored. A date range may be specified to work with a subset of the dates.

All other recurrences use a specified base date in order to determine when the 0th occurrence of a recurring event happens. As mentioned above, the specified base date may be determined from the start date, or specified explicitly.

The specified base date is used to provide the bare minimum information. For example, the recurrence:

   0:0:3*4:0:0:0       every 3 weeks on Thursday

requires a base date to determine the week, but nothing else. Using the standard definition (Monday-Sunday) for a week, and given that one week in August 2009 is Aug 10 to Aug 16, any date in the range Aug 10 to Aug 16 will give the same results. The definition of the week defaults to Monday-Sunday, but may be modified using the FirstDay config variable.

Likewise, the recurrence:

  1:3*0:4:0:0:0        every 1 year, 3 months on the 4th
                       day of the month

would only use the year and month of the base date, so all dates in a given month would give the same set of recurring dates.

It should also be noted that a date may actually produce multiple recurring events. For example, the recurrence:

   0:0:2*4:12,14:0:0   every 2 weeks on Thursday at 12:00
                       and 14:00

produces 2 events for every date. So in this case, the base date produces the 0th and 1st event, the base date + an offset produces the 2nd and 3rd events, etc.

It must be noted that the base date refers ONLY to the interval part of the recurrence. The rtime and modifiers are NOT used in determining the base date.

INTERVAL

The interval of a frequency (everything left of the asterisk) will be used to generate a list of dates (called interval dates). When rtime values and modifiers are applied to an interval date, it produces the actual recurring events.

As already noted, if the rtime values include multiple values for any field, more than one event are produced by a single interval date.

It is important to understand is how the interval dates are calculated. The interval is trivially turned into a delta. For example, with the frequency 0:0:2*4:12:0:0, the interval is 0:0:2 which produces the delta 0:0:2:0:0:0:0.

In order to get the Nth interval date, the delta is multiplied by N and added to the base date. In other words:

   D(0) = Jan 31
   D(1) = Jan 31 + 1 month = Feb 28
   D(2) = Jan 31 + 2 month = Mar 31

DATE RANGE

The start and end dates form the range in which recurring events can fall into.

Every recurring date will fall in the limit:

   start <= date <= end

When a recurrence is created, it may include a default range, and this is handled by the RecurRange config variable.

By default, the date range applies to the final dates once all modifiers have been applied.

This behavior can be changed by applying the range to the unmodified dates.

An example of how this applies might be in defining New Year's Day (observed). The most useful definition of this would be:

   1*1:0:1:0:0:0*DWD

which means Jan 1 modified to the nearest working day.

But if you wanted to find New Year's for 2005 using this definition by passing in a start date of 2005-01-01-00:00:00 and an end date of 2005-12-31-23:59:59, you won't find anything because New Year's day will actually be observed on 2004-12-31 (since Jan 1 is a Saturday).

To get around this, you can pass in a non-zero parameter with the recurrence which means that this range will be applied to the unmodified dates.

In effect, this discards the modifier (DWD), gets the dates that fall in the range, and for all that fall in the range, the modifiers are applied.

So:

   1*1:0:1:0:0:0*DWD**2005-01-01-00:00:00*2005-12-31-23:59:59

will return no dates, but:

   1*1:0:1:0:0:0*DWD**2005-01-01-00:00:00*2005-12-31-23:59:59*1

will return:

   2004-12-31-00:00:00

OTHER FREQUENCY FORMATS

There are a small handful of English strings (or the equivalent in other languages) which can be parsed in place of a numerical frequency. These include:

  every Tuesday in June [1997]
  2nd Tuesday in June [1997]
  last Tuesday in June [1997]
  every Tuesday of every month [in 1997]
  2nd Tuesday of every month [in 1997]
  last Tuesday of every month [in 1997]
  every day of every month [in 1997]
  2nd day of every month [in 1997]
  last day of every month [in 1997]
  every day [in 1997]
  every 2nd day [in 1977]
  every 2 days [in 1977]

Each of these set the frequency. If the year is include in the string, it also sets the dates in the range to be the first and last day of the year.

In each of these, the numerical part (i.e. 2nd in all of the examples above) can be any number from 1 to 31. To make a frequency with a larger number than that, you have to use the standard format discussed above.

Due to the complexity of writing out (and parsing) frequencies written out, I do not intend to add additional frequency formats, and the use of these is discouraged. The frequency format described above is preferred.

MODIFIERS

Any number of modifiers may be added to a frequency to get the actual date of a recurring event. Modifiers are case sensitive.
Modifiers to set the day-of-week
The following modifiers can be used to adjust a date to a specific day of the week.

  PDn   Means the previous day n not counting today
  PTn   Means the previous day n counting today
  NDn   Means the next day n not counting today
  NTn   Means the next day n counting today
  WDn   Day n (1-7) of the current week

In each of these, 'n' is 1-7 (1 being Sunday, 7 being Saturday).

For example, PD2/ND2 returns the previous/next Tuesday. If the date that this is applied to is Tuesday, it modifies it to one week in the past/future.

PT2/NT2 are similar, but will leave the date unmodified if it is a Tuesday.

Modifiers to move forward/backward a number of days
These modifiers can be used to add/subtract n days to a date.

  FDn   Means step forward n days.
  BDn   Means step backward n days.
Modifiers to force events to be on business days
Modifiers can also be used to force recurring events to occur on business days. These modifiers include:

  FWn   Means step forward n workdays.
  BWn   Means step backward n workdays.
  CWD   The closest work day (using the TomorrowFirst
        config variable).
  CWN   The closest work day (looking forward first).
  CWP   The closest work day (looking backward first).
  NWD   The next work day counting today
  PWD   The previous work day counting today
  DWD   The closest work day (using the TomorrowFirst config
        variable) counting today
  IBD   This discards the date if it is not a business day.
  NBD   This discards the date if it IS a business day.
  IWn   This discards the date if it is not the n'th day
        of the week (n=1-7, 1 is Monday)
  NWn   This discards the date if it IS the n'th day of the week

The CWD, CWN, and CWP modifiers will always change the date to the closest working day NOT counting the current date.

The NWD, PWD, and DWD modifiers always change the date to the closest working day unless the current date is a work day. In that case, it is left unmodified.

CWD, CWN, and CWP will usually return the same value, but if you are starting at the middle day of a 3-day weekend (for example), it will return either the first work day of the following week, or the last work day of the previous week depending on whether it looks forward or backward first.

All business day modifiers ignore the time, so if a date is initially calculated at Saturday at noon, and the FW1 is applied, the date is initially moved to the following Monday (assuming it is a work day) and the FW1 moves it to Tuesday. The final result will be Tuesday at noon.

The IBD, NBD, IWn, and NWn modifiers eliminate dates from the list immediately. In other words, if a recurrence has three modifiers:

  FD1,IBD,FD1

then as a date is being tested, first the FD1 modifier is applied. Then, it is tested to see if it is a business day. If it is, the second FD1 modifier will be applied. Otherwise, the date will not be included in the list of recurring events.

Special modifiers
The following modifiers do things that cannot be expressed using any other combination of frequency and modifiers:

  EASTER   Set the date to Easter for this year.

DETERMINING DATES

In order to get a list of dates referred to by the recurrence, the following steps are taken.
The recurrence is tested for errors
The recurrence must be completely specified with a base date (either supplied explicitly, or derived from a start date) and date range when necessary. All dates must be valid.
The actual base date is determined
Using information from the interval and the specified base date, the actual base date is determined.
The Nth date is calculated
By applying the delta that corresponds to the interval, and then applying rtime and modifier information, the Nth date is determined.

This is repeated until all desired dates have been obtained.

The nth method described below has more details.

The range is tested
Any date that fall outside the range is discarded.

NOTE: when the recurrence contains no interval, it is not necessary to specify the range, and if it is not specified, all of the dates are used. The range MAY be specified to return only a subset of the dates if desired.

LIST OF Y/M/W/D FREQUENCY DEFINITIONS

Because the week and day values may have multiple meanings depending on where the asterisk is, and which of the fields have non-zero values, a list of every possible combination is included here (though most can be determined using the rules above).

When the asterisk occurs before the day element, and the day element is non-zero, the day element can take on multiple meanings depending on where the asterisk occurs, and which leading elements (year, month, week) have non-zero values. It can refer to the day of the week, day of the month, or day of the year.

When the asterisk occurs before the week element, the week element of the frequency can also take on multiple meanings as well. When the month field and day fields are zero, it refers to the week of the year. Since the week of the year is well defined in the ISO 8601 spec, there is no ambiguity.

When the month field is zero, but the day field is not, the week field refers to the nth occurrence of the day of week referred to by the day field in the year.

When the month field is non-zero, the week field refers to the nth occurrence of the day of week in the month.

In the tables below only the first 4 elements of the frequency are shown. The actual frequency will include the hour, minute, and second elements in addition to the ones shown.

When all elements left of the asterisk are 0, the interval is such that it occurs the maximum times possible (without changing the type of elements to the right of the asterisk). Another way of looking at it is that the last 0 element of the interval is changed to 1. So, the interval:

  0:0*3:0

is equivalent to

  0:1*3:0

When the year field is zero, and is right of the asterisk, it means the current year.

All elements left of the asterisk
When all of the month, week, and day elements are left of the asterisk, the simple definitions of the frequency are used:

  frequency     meaning
  1:2:3:4       every 1 year, 2 months, 3 weeks,
                4 days

Any, or all of the fields can be zero.

Non-zero day, non-zero week
When both the day and week elements are non-zero, the day element always refers to the day of week. Values must be in the range (1 to 7) and no negative values are allowed.

The following tables shows all possible variations of the frequency where this can happen (where day 4 = Thursday).

When the week is left of the asterisk, the interval is used to get the weeks on the calendar containing a recurring date, and the day is used to set the day of the week. The following are possible:

  frequency     meaning
  1:2:3*4       every 1 year, 2 months, 3 weeks
                on Thur
  1:0:3*4       every 1 year, 3 weeks on Thur
  0:2:3*4       every 2 months, 3 weeks on Thur
  0:0:3*4       every 3 weeks on Thur

When the week is right of the asterisk, and a non-zero month is left of the asterisk, the recurrence refers to a specific occurrence of a day-of-week during a month. The following are possible:

  frequency     meaning
  1:2*3:4       every 1 year, 2 months on the
                3rd Thursday of the month
  0:2*3:4       every 2 months on the 3rd Thur
                of the month

When the week and month are both non-zero and right of the asterisk, the recurrence refers to an occurrence of day-of-week during the given month. Possibilities are:

  frequency     meaning
  1*2:3:4       every 1 year in February on
                the 3rd Thur
  0*2:3:4       same as 1*2:3:4
 *1:2:3:4       in Feb 0001 on the 3rd Thur
                of the month
 *0:2:3:4       on the 3rd Thur of Feb in the
                current year

When the week is right of the asterisk, and the month is zero, the recurrence refers to an occurrence of the day-of-week during the year. The following are possible:

  frequency     meaning
  1:0*3:4       every 1 year on the 3rd Thursday
  1*0:3:4       of the year
 *1:0:3:4       in 0001 on the 3rd Thur of
                the year
  0*0:3:4       same as 1*0:3:4
 *0:0:3:4       on the 3rd Thur of the current
                year

There is one special case:

  frequency     meaning
  0:0*3:4       same as 0:1*3:4 (every month on
                the 3rd Thur of the month)
Non-zero day, non-zero month
When a non-zero day element occurs to the right of the asterisk and the week element is zero, but the month element is non-zero, the day elements always refers to a the day of month in the range (1 to 31) or (-1 to -31).

The following table shows all possible variations of the frequency where this can happen:

  frequency     meaning
  1:2:0*4       every 1 year, 2 months on the
  1:2*0:4       4th day of the month
  1*2:0:4       every year on Feb 4th
 *1:2:0:4       Feb 4th, 0001
  0:2:0*4       every 2 months on the 4th day
  0:2*0:4       of the month
  0*2:0:4       same as 1*2:0:4
 *0:2:0:4       Feb 4th of the current year
Zero day, non-zero week
When a day is zero, and the week is non-zero, the recurrence refers to a specific occurrence of the first day of the week (as given by the FirstDay variable).

The frequency can refer to an occurrence of FirstDay in a specific week (if the week is left of the asterisk):

  frequency     meaning
  1:2:3*0       every 1 year, 2 months, 3 weeks on
                FirstDay
  1:0:3*0       every 1 year, 3 weeks on FirstDay
  0:2:3*0       every 2 months, 3 weeks on FirstDay
  0:0:3*0       every 3 weeks on FirstDay

or to a week in the year (if the week is right of the asterisk, and the month is zero):

  frequency     meaning
  1:0*3:0       every 1 year on the first day of the
  1*0:3:0       3rd week of the year
 *1:0:3:0       the first day of the 3rd week of 0001

or to an occurrence of FirstDay in a month (if the week is right of the asterisk and month is non-zero):

  frequency     meaning
  1:2*3:0       every 1 year, 2 months on the 3rd
                occurrence of FirstDay
  0:2*3:0       every 2 months on the 3rd occurrence
                of FirstDay
  1*2:3:0       every year on the 3rd occurrence
                of FirstDay in Feb
  0*2:3:0       same as 1*2:3:0
 *1:2:3:0       the 3rd occurrence of FirstDay
                Feb 0001
 *0:2:3:0       the 3rd occurrence of FirstDay
                in Feb of the current year

NOTE: in the last group, a slightly more intuitive definition of these would have been to say that the week field refers to the week of the month, but given the ISO 8601 manner of defining when weeks start, this definition would have virtually no practical application. So the definition of the week field referring to the Nth occurrence of FirstDay in a month was used instead.

There are a few special cases here:

  frequency     meaning
  0:0*3:0       same as 0:1*3:0   (every month on the 3rd
                occurrence of the first day of week)
  0*0:3:0       same as 1*0:3:0
 *0:0:3:0       the first day of the 3rd week of the
                current year
Non-zero day
When a non-zero day element occurs and both the month and week elements are zero, the day elements always refers to a the day of year (1 to 366 or -1 to -366 to count from the end).

The following table shows all possible variations of the frequency where this can happen:

  frequency     meaning
  1:0:0*4       every year on the 4th day of
  1:0*0:4       the year
  1*0:0:4
 *1:0:0:4       the 4th day of 0001

Other non-zero day variations have multiple meanings for the day element:

  frequency     meaning
  0:0:0*4       same as 0:0:1*4  (every week on Thur)
  0:0*0:4       same as 0:1*0:4  (every month on the 4th)
  0*0:0:4       same as 1*0:0:4
 *0:0:0:4       the 4th day of the current year
All other variations
The remaining variations have zero values for both week and day. They are:

  frequency     meaning
  1:2:0*0       every 1 year, 2 months on the first
  1:2*0:0       day of the month
  1*2:0:0       every year on Feb 1
 *1:2:0:0       Feb 1, 0001
  1:0:0*0       every 1 year on Jan 1
  1:0*0:0
  1*0:0:0
 *1:0:0:0       Jan 1, 0001
  0:2:0*0       every 2 months on the first day of
  0:2*0:0       the month
  0*2:0:0       same as 1*2:0:0
 *0:2:0:0       Feb 1 of the current year
  0:0:0*0       same as 0:0:1*0 (every week on
                the first day of the week)
  0:0*0:0       same as 0:1*0:0 (every month
                on the 1st)
  0*0:0:0       same as 1*0:0:0
 *0:0:0:0       Jan 1 of the current year

METHODS

new
new_config
new_date
new_delta
new_recur
base
tz
is_date
is_delta
is_recur
config
err
Please refer to the Date::Manip::Obj documentation for these methods.
parse
   $err = $recur->parse($string [,$modifiers] [,$base,$start,$end,$unmod]);

This creates a new recurrence. A string containing a valid frequency is required. In addition, $start, $end, and $base dates can be passed in (either as Date::Manip::Date objects, or as strings containing dates that can be parsed), and any number of the modifiers listed above.

If the $start or $end dates are not included, they may be supplied automatically, based on the value of the RecurRange variable. If any of the dates are passed in, they must be included in the order given (though it is safe to pass an empty string or undef in for any of them if you only want to set some, but not all of them). If $unmod is true, the range will apply to unmodified dates rather than the modified dates.

The $modifiers argument must contain valid modifiers, or be left out of the argument list entirely. You cannot pass an empty string or undef in for it.

   $err = $recur->parse($string);

This creates a recurrence from a string which contains all of the necessary elements of the recurrence. The string is of the format:

   FREQ*MODIFIERS*BASE*START*END*UNMOD

where FREQ is a string containing a frequency, MODIFIERS is a string containing a comma separated list of modifiers, BASE, START, and END are strings containing parseable dates.

All pieces are optional, but order must be maintained, so all of the following are valid:

   FREQ*MODIFIERS
   FREQ**BASE
   FREQ**BASE*START*END
   FREQ***START*END*UNMOD

If a part of the recurrence is passed in both as part of $string and as an argument, the argument overrides the string portion, with the possible exception of modifiers. The modifiers in the argument override the string version unless the first one is a '+' in which case they are appended. See the modifiers method below for more information.

frequency
start
end
basedate
modifiers
You can also create a recurrency in steps (or replace parts of an existing recurrence) using the following:

   $err = $recur->frequency($frequency);
   $err = $recur->start($start);
   $err = $recur->start($start,$unmod);
   $err = $recur->end($end);
   $err = $recur->basedate($base);
   $err = $recur->modifiers($modifiers);
   $err = $recur->modifiers(@modifiers);

These set the appropriate part of the recurrence.

Calling the frequency method discards all information currently stored in the Recur object (including an existing start, end, and base date), so this method should be called first.

In the modifiers method, the modifiers can be passed in as a string containing a comma separated list of modifiers, or as a list of modifiers. The modifiers passed in override all previously set modifiers UNLESS the first one is the string ``+'', in which case the new modifiers are appended to the list.

In the start, end, and base methods, the date passed in can be a Date::Manip::Date object, or a string that can be parsed to get a date. If $unmod is true, it will mean that the range will apply to unmodified dates.

NOTE: the parse method will overwrite all parts of the recurrence, so it is not appropriate to do:

   $recur->modifiers($modifiers);
   $recur->parse($string);

The modifiers passed in in the first call will be overwritten.

These functions can also be used to look up the values.

   $freq  = $recur->frequency();
   $start = $recur->start();
   $end   = $recur->end();
   @mods  = $recur->modifiers();
   ($base,$actual) = $recur->basedate();

The basedate function will return both the specified base and the actual base dates.

If any of the values are not yet determined, nothing will be returned.

dates
   @dates = $recur->dates([$start,$end,$unmod]);

Returns the list of dates defined by the full recurrence. If there is an error, or if there are no dates, an empty list will be returned.

$start and $end are either "undef," or dates which can be used to limit the set of dates passed back (they can be Date::Manip::Date objects or strings that can be parsed).

If the recurrence does not have a start and end date already, passing in $start and $end will set the range (but they will NOT be stored in the recurrence).

If the recurrence does have a start and end date stored in it, the $start and $end arguments can be used to temporarily override the limits. For example, if a recurrence has a start date of Jan 1, 2006 00:00:00 and and end date of Dec 31, 2006 23:59:59 stored in the recurrence, passing in $start of Jul 1, 2006 00:00:00 will limit the dates returned to the range of Jul 1 to Dec 31.

Passing in a start date of Jul 1, 2007 will mean that no dates are returned since the recurrence limits the date to be in 2006.

If one or both of $start and $end are "undef", then the stored values will be used.

nth
   ($date,$err) = $recur->nth($n);

This returns the $nth recurring event ($n may be any integer). If an error occurs, it is returned (but it is not set in $recur since it may be properly, though perhaps incompletely, defined). The following errors may be returned:

   Invalid recurrence
      The recurrence has an error flag set.
   Incomplete recurrence
      The recurrence is incomplete. It needs either a
      base date or a date range.
   Range invalid
      The recurrence has an invalid date range (i.e.
      the end date occurs before the start date).
   Start invalid
   End invalid
   Base invalid
      An invalid date was entered for one of the dates.

There are a few special circumstances to be aware of.

1) If the recurrence contains no interval (i.e. is of the form *Y:M:W:D:H:MN:S), the dates come directly from the rtime values. In this case, the 0th event is the first date in the list of dates specified by the rtime. As such, $n must be a positive integer. If $n is negative, or outside the range of dates specified, the returned date will be "undef" (but this is not an error).

2) A very small number of recurrences have an unknown number of recurring events associated with each date. This only happens if one of the values in the rtime is specified as a range including both a positive and negative index. For example, if the day field in an rtime refers to the day of month, and is 15--15 (i.e. the 15th day to the 15th to the last day), this may include 3 events (on a month with 31 days), 2 event (months with 30 days), 1 event (months with 29 days), or 0 events (months with 28 days). As such, in order to calculate the Nth date, you have to start with the 0th (i.e. base) date and calculate every event until you get the Nth one. For this reason, it is highly recommended that this type of frequency be avoided as it will be quite slow.

3) Most recurrences have a known number of events (equal to the number of combinations of values in the rtime) for each date. For these, calculating the Nth date is much faster. However, in this case, some of them may refer to an invalid date. For example, if the frequency is 'the 31st of every month' and the base (0th) date is Jan 31, the 1st event would refer to Feb 31. Since that isn't valid, "undef" would be returned for "$n=1." Obviously, it would be possible to actually determine the Nth valid event by calculating all N-1 dates, but in the interest of performance, this is not done.

4) The way the Nth recurring event is calculated differs slightly for NE>0 and N<0 if the delta referred to by the frequency is approximate. To calculate the Nth recurring event (where N>0), you take the base date and add N*DELTA (where DELTA is the delta determined by the frequency). To get the Nth recurring event (where N<0), a date is determine which, if N*DELTA were added to it, would produce the base date. For more details, refer to the Date::Manip::Calc document. In the ``SUBTRACTION'' in Date::Manip::Calc section in the discussion of approximate date-delta calculations, calculations are done with $subtract = 2.

next
prev
   ($date,$err) = $recur->next();
   ($date,$err) = $recur->prev();

These return the next/previous recurring event.

The first time next/prev is called, one of the recurring events will be selected and returned (using the rules discussed below). Subsequent calls to next/prev will return the next or previous event.

Unlike the nth method which will return a specific event (or undef if the Nth even is not defined), the next and prev methods will only work with defined events.

So, for the recurrence:

   the 31st of every month

next might return the following sequence of events:

   Jan 31 2000
   Mar 31 2000
   May 31 2000

The rules for determining what event to return the first time one of these is called are as follows:

1) If there is a range, next will return the first event that occurs after the start of the range. prev will return the last event that occurs before the end of the range.

2) If there is no range, next will return the first event on or after the base date. prev will return the last event before the base date.

The error codes are the same as for the nth method.

HISTORY OF THE FREQUENCY NOTATION

I realize that the frequency notation described above looks quite complicated at first glance, but it is (IMO) the best notation for expressing recurring events in existence. I actually consider it the single most important contribution to date/time handling in Date::Manip.

When I first decided to add recurring events to Date::Manip, I first came up with a list of common ways of specifying recurring events, and then went looking for a notation that could be used to define them. I was hoping for a notation that would be similar to cron notation, but more powerful.

After looking in several specifications (including ISO 8601) and after a discussion on a mailing list of calendar related topics, it appeared that there was no concise, flexible notation for handling recurring events that would handle all of the common forms I'd come up with.

So, as a matter of necessity, I set about inventing my own notation. As I was looking at my list, it struck me that all of the parts which specified a frequency were higher level (i.e. referred to a larger unit of time) than those parts which specified a specific value (what I've called the rtime). In other words, when the terms were laid out from year down to seconds, the frequency part was always left of specific values.

That led immediately to the notation described above, so I started analyzing it to figure out if it could express all of the recurring events I'd come up with. It succeeded on 100% of them. Not only that, but by playing with different values (especially different combinations of m/w/d values), I found that it would define recurring events that I hadn't even thought of, but which seemed perfectly reasonable in hindsight.

After a very short period, I realized just how powerful this notation was, and set about implementing it, and as I said above, of all the contributions that Date::Manip has made, I consider this to be the most important.

KNOWN BUGS

If you specify a recurrence which cannot be satisfied for the base date, or for any time after the base date, the recurrence will crash. This can only happen if you specify a recurrence that always occurs in the spring DST transition using the current timezone rules.

For example, in a US timezone, the current timezone rules state that a DST transition occurs at 02:00:00 on the 2nd Sunday in March and the clock jumps to 03:00. This started in 2006. As a result, the recurrence

   1*3:2:7:2:0:0

with a base date of 2006 or later cannot be satisfied.

BUGS AND QUESTIONS

Please refer to the Date::Manip::Problems documentation for information on submitting bug reports or questions to the author.

LICENSE

This script is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

AUTHOR

Sullivan Beck ([email protected])