<!DOCTYPE html
PUBLIC "-//W3C//DTD HTML 4.01//EN"
"http://www.w3.org/TR/html4/strict.dtd">
<html>
<head>
<title>How to Read the tz Database</title>
<meta http-equiv="Content-type" content='text/html; charset="UTF-8"'>
<style type="text/css">
pre {margin-left: 2em; white-space: pre-wrap;}
</style>
</head>
<body>
<h2>How to Read the <a href="https://en.wikipedia.org/wiki/Tz_database">tz
Database</a> Source Files</h2>
<h3>by Bill Seymour</h3>
<p>This page uses the <code>America/Chicago</code> and
<code>Pacific/Honolulu</code> zones as examples of how to infer
times of day from the <a href="tz-link.html">tz database</a>
source files. It might be helpful, but not absolutely necessary,
for the reader to have already downloaded the
latest release of the database and become familiar with the basic layout
of the data files. The format is explained in the “man
page” for the zic compiler, <code>zic.8.txt</code>, in
the <code>code</code> subdirectory.</p>
<p>We’ll begin by talking about the rules for changing between standard
and daylight saving time since we’ll need that information when we talk
about the zones.</p>
<p>First, let’s consider the special daylight saving time rules
for Chicago (from the <code>northamerica</code> file in
the <code>data</code> subdirectory):</p>
<table border="1">
<tr>
<th colspan="6">From the Source File</th>
</tr>
<tr>
<td colspan="6" align="center"><table><tr><td>
<pre>
#Rule NAME FROM TO TYPE IN ON AT SAVE LETTER
Rule Chicago 1920 only - Jun 13 2:00 1:00 D
Rule Chicago 1920 1921 - Oct lastSun 2:00 0 S
Rule Chicago 1921 only - Mar lastSun 2:00 1:00 D
Rule Chicago 1922 1966 - Apr lastSun 2:00 1:00 D
Rule Chicago 1922 1954 - Sep lastSun 2:00 0 S
Rule Chicago 1955 1966 - Oct lastSun 2:00 0 S
</pre>
</td></tr></table></td>
</tr>
<tr>
<th colspan="6">Reformatted a Bit</th>
</tr>
<tr>
<th>From</th>
<th>To</th>
<th colspan="2">On</th>
<th>At</th>
<th>Action</th>
</tr>
<tr align="center">
<td colspan="2">1920 only</td>
<td colspan="2">June 13<small><sup>th</sup></small></td>
<td rowspan="6">02:00 local</td>
<td>go to daylight saving time</td>
</tr>
<tr align="center">
<td>1920</td>
<td>1921</td>
<td rowspan="5">last Sunday</td>
<td>in October</td>
<td>return to standard time</td>
</tr>
<tr align="center">
<td colspan="2">1921 only</td>
<td>in March</td>
<td rowspan="2">go to daylight saving time</td>
</tr>
<tr align="center">
<td rowspan="2">1922</td>
<td>1966</td>
<td>in April</td>
</tr>
<tr align="center">
<td>1954</td>
<td>in September</td>
<td rowspan="2">return to standard time</td>
</tr>
<tr align="center">
<td>1955</td>
<td>1966</td>
<td>in October</td>
</tr>
</table>
<p>We’ll basically just ignore the <code>TYPE</code> column.
In the 2007j release, the most recent as of this writing, the
<code>TYPE</code> column never contains anything but a hyphen,
a kind of null value. (From the description in <code>zic.8.txt</code>,
this appears to be a mechanism for removing years from a set
in some localizable way. It’s used in the file, <code>pacificnew</code>,
to determine whether a given year will have a US presidential election;
but everything related to that use is commented out.)
<p>The <code>SAVE</code> column contains the local (wall clock) offset from
local standard time.
This is usually either zero for standard time or one hour for daylight
saving time; but there’s no reason, in principle, why it can’t
take on other values.
<p>The <code>LETTER</code> (sometimes called <code>LETTER/S</code>)
column can contain a variable
part of the usual abbreviation of the time zone’s name, or it can just
be a hyphen if there’s no variable part. For example, the abbreviation
used in the central time zone will be either “CST” or
“CDT”. The variable part is ‘S’ or ‘D’;
and, sure enough, that’s just what we find in
the <code>LETTER</code> column
in the <code>Chicago</code> rules. More about this when we talk about
“Zone” lines.
<p>One important thing to notice is that “Rule” lines
want at once to be both <i>transitions</i> and <i>steady states</i>:
<ul>
<li>On the one hand, they represent transitions between standard and
daylight saving time; and any number of Rule lines can be in effect
during a given period (which will always be a non-empty set of
contiguous calendar years).</li>
<li>On the other hand, the <code>SAVE</code> and <code>LETTER</code>
columns contain state that exists between transitions. More about this
when we talk about the US rules.</li>
</ul>
<p>In the example above, the transition to daylight saving time
happened on the 13<small><sup>th</sup></small> of June in 1920, and on
the last Sunday in March in 1921; but the return to standard time
happened on the last Sunday in October in both of those
years. Similarly, the rule for changing to daylight saving time was
the same from 1922 to 1966; but the rule for returning to standard
time changed in 1955. Got it?</p>
<p>OK, now for the somewhat more interesting “US” rules:</p>
<table border="1">
<tr>
<th colspan="6">From the Source File</th>
</tr>
<tr>
<td colspan="6" align="center"><table><tr><td>
<pre>
#Rule NAME FROM TO TYPE IN ON AT SAVE LETTER/S
Rule US 1918 1919 - Mar lastSun 2:00 1:00 D
Rule US 1918 1919 - Oct lastSun 2:00 0 S
Rule US 1942 only - Feb 9 2:00 1:00 W # War
Rule US 1945 only - Aug 14 23:00u 1:00 P # Peace
Rule US 1945 only - Sep 30 2:00 0 S
Rule US 1967 2006 - Oct lastSun 2:00 0 S
Rule US 1967 1973 - Apr lastSun 2:00 1:00 D
Rule US 1974 only - Jan 6 2:00 1:00 D
Rule US 1975 only - Feb 23 2:00 1:00 D
Rule US 1976 1986 - Apr lastSun 2:00 1:00 D
Rule US 1987 2006 - Apr Sun>=1 2:00 1:00 D
Rule US 2007 max - Mar Sun>=8 2:00 1:00 D
Rule US 2007 max - Nov Sun>=1 2:00 0 S
</pre>
</td></tr></table></td>
</tr>
<tr>
<th colspan="6">Reformatted a Bit</th>
</tr>
<tr>
<th>From</th>
<th>To</th>
<th colspan="2">On</th>
<th>At</th>
<th>Action</th>
</tr>
<tr align="center">
<td rowspan="2">1918</td>
<td rowspan="2">1919</td>
<td rowspan="2">last Sunday</td>
<td>in March</td>
<td rowspan="3">02:00 local</td>
<td>go to daylight saving time</td>
</tr>
<tr align="center">
<td>in October</td>
<td>return to standard time</td>
</tr>
<tr align="center">
<td colspan="2">1942 only</td>
<td colspan="2">February 9<small><sup>th</sup></small></td>
<td>go to “war time”</td>
</tr>
<tr align="center">
<td colspan="2" rowspan="2">1945 only</td>
<td colspan="2">August 14<small><sup>th</sup></small></td>
<td>23:00 <a href="https://en.wikipedia.org/wiki/Universal_Time">UT</a></td>
<td>
rename “war time” to “peace<br>time;”
clocks don’t change
</td>
</tr>
<tr align="center">
<td colspan="2">September 30<small><sup>th</sup></small></td>
<td rowspan="9">02:00 local</td>
<td rowspan="2">return to standard time</td>
</tr>
<tr align="center">
<td rowspan="2">1967</td>
<td>2006</td>
<td rowspan="2">last Sunday</td>
<td>in October</td>
</tr>
<tr align="center">
<td>1973</td>
<td>in April</td>
<td rowspan="6">go to daylight saving time</td>
</tr>
<tr align="center">
<td colspan="2">1974 only</td>
<td colspan="2">January 6<small><sup>th</sup></small></td>
</tr>
<tr align="center">
<td colspan="2">1975 only</td>
<td colspan="2">February 23<small><sup>rd</sup></small></td>
</tr>
<tr align="center">
<td>1976</td>
<td>1986</td>
<td>last Sunday</td>
<td rowspan="2">in April</td>
</tr>
<tr align="center">
<td>1987</td>
<td>2006</td>
<td>first Sunday</td>
</tr>
<tr align="center">
<td rowspan="2">2007</td>
<td rowspan="2">present</td>
<td colspan="2">second Sunday in March</td>
</tr>
<tr align="center">
<td colspan="2">first Sunday in November</td>
<td>return to standard time</td>
</tr>
</table>
<p>There are two interesting things to note here.</p>
<p>First, the time that something happens (in the <code>AT</code>
column) is not necessarily the local (wall clock) time. The time can be
suffixed with ‘s’ (for “standard”) to mean
local standard time, different from local (wall clock) time when observing
daylight saving time; or it can be suffixed with ‘g’,
‘u’, or ‘z’, all three of which mean the
standard time at the
<a href="https://en.wikipedia.org/wiki/Prime_Meridian">prime meridian</a>.
‘g’ stands for “<a
href="https://en.wikipedia.org/wiki/Greenwich_Mean_Time">GMT</a>”;
‘u’ stands for “<a
href="https://en.wikipedia.org/wiki/Universal_Time">UT</a>” or “<a
href="https://en.wikipedia.org/wiki/Coordinated_Universal_Time">UTC</a>”
(whichever was official at the time); ‘z’ stands for the
<a href="https://en.wikipedia.org/wiki/Nautical_time">nautical time zone</a>
Z (a.k.a. “Zulu” which, in turn, stands for ‘Z’).
The time can also be suffixed with ‘w’ meaning local (wall
clock) time; but it usually isn’t because that’s the
default.</p>
<p>Second, the day in the <code>ON</code> column, in addition to
“<code>lastSun</code>” or a particular day of the month,
can have the form, “<code>Sun>=</code><i>x</i>” or
“<code>Sun<=</code><i>x</i>,” where <i>x</i> is a day
of the month. For example, “<code>Sun>=8</code>” means
“the first Sunday on or after the eighth of the month,” in
other words, the second Sunday of the month. Furthermore, although
there are no examples above, the weekday needn’t be
“<code>Sun</code>” in either form, but can be the usual
three-character English abbreviation for any day of the week.</p>
<p>And the US rules give us more examples of a couple of things
already mentioned:</p>
<ul>
<li>The rules for changing to and from daylight saving time are
actually <i>different sets</i> of rules; and the two sets can change
independently. Consider, for example, that the rule for the return to
standard time stayed the same from 1967 to 2006; but the rule for the
transition to daylight saving time changed several times in the same
period. There can also be periods, 1946 to 1966 for example, when no
rule from this group is in effect, and so either no transition
happened in those years, or some other rule is in effect (perhaps a
state or other more local rule).</li>
<li>The <code>SAVE</code> and <code>LETTER</code> columns
contain <i>steady state</i>, not transitions. Consider, for example,
the transition from “war time” to “peace time”
that happened on August 14, 1945. The “1:00” in
the <code>SAVE</code> column is <i>not</i> an instruction to advance
the clock an hour. It means that clocks should <i>be</i> one hour
ahead of standard time, which they already are because of the previous
rule, so there should be no change.</li>
</ul>
<p>OK, now let’s look at a Zone record:</p>
<table border="1">
<tr>
<th colspan="5">From the Source File</th>
</tr>
<tr>
<td colspan="6" align="center"><table><tr><td>
<pre>
#Zone NAME STDOFF RULES FORMAT [UNTIL]
Zone America/Chicago -5:50:36 - LMT 1883 Nov 18 12:09:24
-6:00 US C%sT 1920
-6:00 Chicago C%sT 1936 Mar 1 2:00
-5:00 - EST 1936 Nov 15 2:00
-6:00 Chicago C%sT 1942
-6:00 US C%sT 1946
-6:00 Chicago C%sT 1967
-6:00 US C%sT
</pre>
</td></tr></table></td>
</tr>
<tr>
<th colspan="5">Columns Renamed</th>
</tr>
<tr>
<th rowspan="2">Standard Offset<br>
from <a href="https://en.wikipedia.org/wiki/Prime_Meridian">Prime
Meridian</a></th>
<th rowspan="2">Daylight<br>Saving Time</th>
<th rowspan="2">Abbreviation(s)</th>
<th colspan="2">Ending at Local Time</th>
</tr>
<tr>
<th>Date</th>
<th>Time</th>
</tr>
<tr align="center">
<td>−5:50:36</td>
<td>not observed</td>
<td>LMT</td>
<td>1883-11-18</td>
<td>12:09:24</td>
</tr>
<tr align="center">
<td rowspan="2">−6:00:00</td>
<td>US rules</td>
<td rowspan="2">CST or CDT</td>
<td>1920-01-01</td>
<td>00:00:00</td>
</tr>
<tr align="center">
<td>Chicago rules</td>
<td>1936-03-01</td>
<td rowspan="2">02:00:00</td>
</tr>
<tr align="center">
<td>−5:00:00</td>
<td>not observed</td>
<td>EST</td>
<td>1936-11-15</td>
</tr>
<tr align="center">
<td rowspan="4">−6:00:00</td>
<td>Chicago rules</td>
<td>CST or CDT</td>
<td>1942-01-01</td>
<td rowspan="3">00:00:00</td>
</tr>
<tr align="center">
<td>US rules</td>
<td>CST, CWT or CPT</td>
<td>1946-01-01</td>
</tr>
<tr align="center">
<td>Chicago rules</td>
<td rowspan="2">CST or CDT</td>
<td>1967-01-01</td>
</tr>
<tr align="center">
<td>US rules</td>
<td colspan="2">—</td>
</tr>
</table>
<p>There are a couple of interesting differences between Zones and Rules.</p>
<p>First, and somewhat trivially, whereas Rules are considered to
contain one or more records, a Zone is considered to be a single
record with zero or more <i>continuation lines</i>. Thus, the keyword,
“<code>Zone</code>,” and the zone name are not
repeated. The last line is the one without anything in
the <code>[UNTIL]</code> column.</p>
<p>Second, and more fundamentally, each line of a Zone represents a
steady state, not a transition between states. The state exists from
the date and time in the previous line’s <code>[UNTIL]</code>
column up to the date and time in the current
line’s <code>[UNTIL]</code> column. In other words, the date and
time in the <code>[UNTIL]</code> column is the instant that separates
this state from the next. Where that would be ambiguous because
we’re setting our clocks back, the <code>[UNTIL]</code> column
specifies the first occurrence of the instant. The state specified by
the last line, the one without anything in the <code>[UNTIL]</code>
column, continues to the present.</p>
<p>The first line typically specifies the mean solar time observed
before the introduction of standard time. Since there’s no line before
that, it has no beginning. <code>8-) </code> For some places near the <a
href="https://en.wikipedia.org/wiki/International_Date_Line">International
Date Line</a>, the first <i>two</i> lines will show solar times
differing by 24 hours; this corresponds to a movement of the Date
Line. For example:</p>
<pre>
#Zone NAME STDOFF RULES FORMAT [UNTIL]
Zone America/Juneau 15:02:19 - LMT 1867 Oct 18
-8:57:41 - LMT ...
</pre>
<p>When Alaska was purchased from Russia in 1867, the Date Line moved
from the Alaska/Canada border to the Bering Strait; and the time in
Alaska was then 24 hours earlier than it had
been. <code><aside></code>(6 October in the Julian calendar,
which Russia was still using then for religious reasons, was followed
by <i>a second instance of the same day with a different name</i>, 18
October in the Gregorian calendar. Isn’t civil time
wonderful? <code>8-)</code>)<code></aside></code></p>
<p>The abbreviation, “LMT” stands for “local mean
time”, which is an invention of
the <a href="https://en.wikipedia.org/wiki/Tz_database">tz
database</a> and was probably never actually used during the
period. Furthermore, the value is almost certainly wrong except in the
archetypal place after which the zone is named. (The tz database
usually doesn’t provide a separate Zone record for places where
nothing significant happened after 1970.)</p>
<p>The <code>RULES</code> column tells us whether daylight saving time is being observed:
<ul>
<li>A hyphen, a kind of null value, means that we have not set our
clocks ahead of standard time.</li>
<li>An amount of time (usually but not necessarily “1:00”
meaning one hour) means that we have set our clocks ahead by that
amount.</li>
<li>Some alphabetic string means that we <i>might have</i> set our
clocks ahead; and we need to check the rule the name of which is the
given alphabetic string.</li>
</ul>
<p>An example of a specific amount of time is:</p>
<pre>
#Zone NAME STDOFF RULES FORMAT [UNTIL]
Zone Pacific/Honolulu ... 1933 Apr 30 2:00
-10:30 1:00 HDT 1933 May 21 12:00
...
</pre>
<p>Hawaii tried daylight saving time for three weeks in 1933 and
decided they didn’t like it. <code>8-) </code>Note that
the <code>STDOFF</code> column always contains the standard time
offset, so the local (wall clock) time during this period was GMT −
10:30 + 1:00 = GMT − 9:30.</p>
<p>The <code>FORMAT</code> column specifies the usual abbreviation of
the time zone name. It can have one of three forms:</p>
<ul>
<li>a string of three or more characters that are either ASCII alphanumerics,
“<code>+</code>”, or “<code>-</code>”,
in which case that’s the abbreviation</li>
<li>a pair of strings separated by a slash
(‘<code>/</code>’), in which case the first string is the
abbreviation for the standard time name and the second string is the
abbreviation for the daylight saving time name</li>
<li>a string containing “<code>%s</code>,” in which case
the “<code>%s</code>” will be replaced by the text in the
appropriate Rule’s <code>LETTER</code> column</li>
</ul>
<p>The last two make sense only if there’s a named rule in effect.</p>
<p>An example of a slash is:</p>
<pre>
#Zone NAME STDOFF RULES FORMAT [UNTIL]
Zone Europe/London ... 1996
0:00 EU GMT/BST
</pre>
<p>The current time in the UK is called either Greenwich mean time or
British summer time.</p>
<p>One wrinkle, not fully explained in <code>zic.8.txt</code>, is what
happens when switching to a named rule. To what values should
the <code>SAVE</code> and <code>LETTER</code> data be initialized?</p>
<ul>
<li>If at least one transition has happened, use
the <code>SAVE</code> and <code>LETTER</code> data from the most
recent.</li>
<li>If switching to a named rule before any transition has happened,
assume standard time (<code>SAVE</code> zero), and use
the <code>LETTER</code> data from the earliest transition with
a <code>SAVE</code> of zero.
</ul>
<p>And three last things about the <code>FORMAT</code> column:</p>
<ul>
<li>The <a href="https://en.wikipedia.org/wiki/Tz_database">tz
database</a> gives abbreviations for time zones in <i>popular
usage</i>, which is not necessarily “correct” by law. For
example, the last line in
<code>Zone</code> <code>Pacific/Honolulu</code> (shown below) gives
“HST” for “Hawaii standard time” even though the
<a href="https://www.law.cornell.edu/uscode/text/15/263">legal</a>
name for that time zone is “Hawaii-Aleutian standard time.”
This author has read that there are also some places in Australia where
popular time zone names differ from the legal ones.
<li>No attempt is made to <a
href="https://en.wikipedia.org/wiki/Internationalization_and_localization">localize</a>
the abbreviations. They are intended to be the values returned through the
<code>"%Z"</code> format specifier to
<a href="https://en.wikipedia.org/wiki/C_(programming_language)">C</a>’s
<a href="https://pubs.opengroup.org/onlinepubs/9699919799/functions/strftime.html"><code>strftime</code></a>
function in the
<a href="https://kirste.userpage.fu-berlin.de/chemnet/use/info/libc/libc_19.html#SEC324">“C” locale</a>.
<li>If there is no generally-accepted abbreviation for a time zone,
a numeric offset is used instead, e.g., <code>+07</code> for 7 hours
ahead of Greenwich. By convention, <code>-00</code> is used in a
zone while uninhabited, where the offset is zero but in some sense
the true offset is undefined.
</ul>
<p>As a final example, here’s the complete history for Hawaii:</p>
<table border="1">
<tr>
<th colspan="6">Relevant Excerpts from the US Rules</th>
</tr>
<tr>
<td colspan="6" align="center"><table><tr><td>
<pre>
#Rule NAME FROM TO TYPE IN ON AT SAVE LETTER/S
Rule US 1918 1919 - Oct lastSun 2:00 0 S
Rule US 1942 only - Feb 9 2:00 1:00 W # War
Rule US 1945 only - Aug 14 23:00u 1:00 P # Peace
Rule US 1945 only - Sep lastSun 2:00 0 S
</pre>
</td></tr></table></td>
</tr>
<tr>
<th colspan="6">The Zone Record</th>
</tr>
<tr>
<td colspan="6" align="center"><table><tr><td>
<pre>
#Zone NAME STDOFF RULES FORMAT [UNTIL]
Zone Pacific/Honolulu -10:31:26 - LMT 1896 Jan 13 12:00
-10:30 - HST 1933 Apr 30 2:00
-10:30 1:00 HDT 1933 May 21 2:00
-10:30 US H%sT 1947 Jun 8 2:00
-10:00 - HST
</pre>
</td></tr></table></td>
</tr>
<tr>
<th colspan="6">What We Infer</th>
</tr>
<tr>
<th rowspan="2">Wall-Clock<br>Offset from<br>Prime Meridian</th>
<th rowspan="2">Adjust<br>Clocks</th>
<th colspan="2">Time Zone</th>
<th colspan="2">Ending at Local Time</th>
</tr>
<tr>
<th>Abbrv.</th>
<th>Name</th>
<th>Date</th>
<th>Time</th>
</tr>
<tr align="center">
<td>−10:31:26</td>
<td>—</td>
<td>LMT</td>
<td>local mean time</td>
<td>1896-01-13</td>
<td>12:00</td>
</tr>
<tr align="center">
<td>−10:30</td>
<td>+0:01:26</td>
<td>HST</td>
<td>Hawaii standard time</td>
<td>1933-04-30</td>
<td>02:00</td>
</tr>
<tr align="center">
<td>−9:30</td>
<td>+1:00</td>
<td>HDT</td>
<td>Hawaii daylight time</td>
<td>1933-05-21</td>
<td>12:00</td>
</tr>
<tr align="center">
<td>−10:30¹</td>
<td>−1:00¹</td>
<td>HST¹</td>
<td>Hawaii standard time</td>
<td>1942-02-09</td>
<td>02:00</td>
</tr>
<tr align="center">
<td rowspan="2">−9:30</td>
<td>+1:00</td>
<td>HWT</td>
<td>Hawaii war time</td>
<td>1945-08-14</td>
<td>13:30²</td>
</tr>
<tr align="center">
<td>0</td>
<td>HPT</td>
<td>Hawaii peace time</td>
<td>1945-09-30</td>
<td rowspan="2">02:00</td>
</tr>
<tr align="center">
<td>−10:30</td>
<td>−1:00</td>
<td rowspan="2">HST</td>
<td rowspan="2">Hawaii standard time</td>
<td>1947-06-08</td>
</tr>
<tr align="center">
<td>−10:00³</td>
<td>+0:30³</td>
<td colspan="2">—</td>
</tr>
<tr>
<td colspan="6">
¹Switching to US rules…most recent transition (in 1919) was to standard time
</td>
</tr>
<tr>
<td colspan="6">
²23:00 <a href="https://en.wikipedia.org/wiki/Universal_Time">UT</a>
+ (−9:30) = 13:30 local
</td>
</tr>
<tr>
<td colspan="6">
³Since <a href="https://en.wikipedia.org/wiki/ISO_8601">1947–06–08T12:30Z</a>,
the civil time in Hawaii has been
<a href="https://en.wikipedia.org/wiki/Universal_Time">UT</a>/<a href="https://en.wikipedia.org/wiki/Coordinated_Universal_Time">UTC</a>
− 10:00 year-round.
</td>
</tr>
</table>
<p>There will be a short quiz later. <code>8-)</code></p>
<hr>
<address>
This web page is in the public domain, so clarified as of
2015-10-20 by Bill Seymour.
<br>
All suggestions and corrections will be welcome; all flames will be amusing.
Mail to was at pobox dot com.
</address>
</body>
</html>