Chapter 3 Data Transformation
The (HURDAT2) data has a comma-delimited, text format with six-hourly information on the location, maximum winds, central pressure, and (beginning in 2004) size of all known tropical cyclones and subtropical cyclones. The dataset is a combination of serveral subsets. Each subset is used for a storm track record which includes header information and values. please refer to this file for detail information.
Firstly, we extract storm id
, name
, and subtext length
from each subtext header, then read in data according to each subtext length
, and merge data subset by indexing it with storm id
, name
. In the original text, you will find that the name
is non-unique. Currently, there are six lists that are used in rotation and re-cycled every six years, i.e., the 2013 list is used again in 2019. For more information, please see tropical cyclone names. To avoid the future confusion, we create storm-id
variable by combining name
and year
together (e.g., Sandy-2012). In the original file, there are storms labeled with NAMEs but others labelled with UNNAME
. Here, we use name_id
variable to indicate whether a storm has a name or not.
Secondly, we estimate category
variable from wind speed based on Saffir-Simpson storm category, calculate the diameter of the area experiencing hurricane strength winds (64 knots or above), _ts_diameter_
from extent of 34 kt wind radii maximum extent in northeastern quadrant (in nautical miles, extent_34_NE
), 34 kt wind radii maximum extent in southeastern quadrant (in nautical miles, extent_34_SW
), 34 kt wind radii maximum extent in northeastern quadrant (in nautical miles, extent_34_NW
), and 34 kt wind radii maximum extent in southeastern quadrant (in nautical miles, extent_34_SE
), _hu_diameter_
from extent of 64 kt wind radii maximum extent in northeastern quadrant - extent_64_NE
, southeastern quadrant - extent_34_SW
, northeastern quadrant, extent_64_NW
), and southeastern quadrant - extent_64_SE
.
Thirdly, we estimate the storm duration tc_dur_track
to those with maximum sustained surface winds of at least 35 knot and defined storms and define tc_dur_type
for type of the duration. Here, S
indicates storms with duration of 2.0 days or less and will be mentioned in the following text as short-lived storms, and L
represnts storms with duration of more than 2.0 days and will be referred as “medium-to-long lived” storms.
Finally, the ocean surface temperature is .nc
formate, we use ncdf4
to read in data. Note that there is there is an “ET” typo in Status of system in the HURDAT2, which has been corrected to EX
in the output data\clean\hurricanes.csv
file.
Meaning for each variables
_id_
Storm id, which is unique. An id is a combination of 8 characters,
for example, ‘AL092011’,
AL (Spaces 1 and 2) – Basin – Atlantic
09 (Spaces 3 and 4) – ATCF cyclone number for that year
2011 (Spaces 5-8, before first comma) – Year
for detail information, please see dataformat
_name_
Storm Name, which is non-unique. There are six lists that are used in rotation and re-cycled every six years, i.e., the 2013 list is used again in 2019. For more information, please see tropical cyclone names.
_storm_id_
Storm name and id combined, i.e., Sandy-2012
_unname_label_
Storms have name or not (“yes”, “no”)
_datetime, year, month, day, hour_
Date of report (in Universal Time Coordinate)
_record_identifier_
C – Closest approach to a coast, not followed by a landfall
G – Genesis
I – An intensity peak in terms of both pressure and wind
L – Landfall (center of system crossing a coastline)
P – Minimum in central pressure
R – Provides additional detail on the intensity of the cyclone when rapid changes are underway
S – Change of status of the system
T – Provides additional detail on the track (position) of the cyclone
W – Maximum sustained wind speed
_latitude,longitude_
Location of storm center
_status_
Storm classification (Tropical Depression, Tropical Storm, or Hurricane)
TD – Tropical cyclone of tropical depression intensity (< 34 knots)
TS – Tropical cyclone of tropical storm intensity (34-63 knots)
HU – Tropical cyclone of hurricane intensity (> 64 knots)
EX – Extratropical cyclone (of any intensity)
SD – Subtropical cyclone of subtropical depression intensity (< 34 knots)
SS – Subtropical cyclone of subtropical storm intensity (> 34 knots)
LO – A low that is neither a tropical cyclone, a subtropical cyclone, nor an extratropical cyclone (of any intensity)
WV – Tropical Wave (of any intensity)
DB – Disturbance (of any intensity)
_category_
Saffir-Simpson storm category (estimated from wind speed. -1 = Tropical Depression, 0 = Tropical Storm)
_max_wind_
storm’s maximum sustained wind speed (in knots)
_min_pressure_
Air pressure at the storm’s center (in millibars)
_ts_diameter_
Diameter of the area experiencing tropical storm strength winds (34 knots or above)
_hu_diameter_
Diameter of the area experiencing hurricane strength winds (64 knots or above)
_max_category_
Maximum category of each storm track
_max_status_label_
Label (“TRUE”, “FALSE”) to indicate whether the measurement is for the maximum status of each track
_max_max_wind_
The maximum value of the max_wind
for each track
_min_min_pressure_
The minimum value of the min_pressure
for each track
_max_ts_diameter_
The maximum value of the ts_diameter
for each track
_max_hu_diamter_
The maximum value of the hu_diameter
for each track
!–test–>