The titanic3
data frame describes the survival status of individual
passengers on the Titanic. The titanic3
data frame does not contain
information for the crew, but it does contain actual and estimated ages for
almost 80% of the passengers.
A data frame with 1309 observations on the following 14 variables:
a factor with levels 1st
,
2nd
, and 3rd
Survival (0 = No; 1 = Yes)
Name
a factor with levels
female
and male
age in years
Number of Siblings/Spouses Aboard
Number of Parents/Children Aboard
Ticket Number
Passenger Fare
Cabin
a factor with levels
Cherbourg
, Queenstown
, and Southampton
Lifeboat
Body IdentificationNumber
Home/Destination
http://biostat.mc.vanderbilt.edu/twiki/pub/Main/DataSets/titanic.html
Thomas Cason of UVa has greatly updated and improved the titanic
data
frame using the Encyclopedia Titanica and created a new dataset
called titanic3
. This dataset reflects the state of data available as
of August 2, 1999. Some duplicate passengers have been dropped, many errors
have been corrected, many missing ages have been filled in, and new
variables have been created.
Harrell, F. E. (2001) Regression Modeling Strategies with Applications to Linear Models, Logistic Regression, and Survival Analysis. Springer.
with(titanic3, table(pclass, sex))#> sex #> pclass female male #> 1st 144 179 #> 2nd 106 171 #> 3rd 216 493