The titanic3 data frame describes the survival status of individual passengers on the Titanic. The titanic3 data frame does not contain information for the crew, but it does contain actual and estimated ages for almost 80% of the passengers.

Format

A data frame with 1309 observations on the following 14 variables:

pclass

a factor with levels 1st, 2nd, and 3rd

survived

Survival (0 = No; 1 = Yes)

name

Name

sex

a factor with levels female and male

age

age in years

sibsp

Number of Siblings/Spouses Aboard

parch

Number of Parents/Children Aboard

ticket

Ticket Number

fare

Passenger Fare

cabin

Cabin

embarked

a factor with levels Cherbourg, Queenstown, and Southampton

boat

Lifeboat

body

Body IdentificationNumber

home.dest

Home/Destination

Source

http://biostat.mc.vanderbilt.edu/twiki/pub/Main/DataSets/titanic.html

Details

Thomas Cason of UVa has greatly updated and improved the titanic data frame using the Encyclopedia Titanica and created a new dataset called titanic3. This dataset reflects the state of data available as of August 2, 1999. Some duplicate passengers have been dropped, many errors have been corrected, many missing ages have been filled in, and new variables have been created.

References

Harrell, F. E. (2001) Regression Modeling Strategies with Applications to Linear Models, Logistic Regression, and Survival Analysis. Springer.

Examples

with(titanic3, table(pclass, sex))
#> sex #> pclass female male #> 1st 144 179 #> 2nd 106 171 #> 3rd 216 493