survfit(formula, data=sys.parent(), weights, subset, na.action, newdata, individual=F, conf.int=.95, se.fit=T, type=<<see below>>, error=<<see below>>, conf.type="log", conf.lower="usual")
The Peto lower limit is based on the same "effective n" argument as the modified limit, but also replaces the usual Greenwood variance term with a simple approximation. It is known to be conservative.
The Greenwood formula for the variance is a sum of terms d/(n*(n-m)), where d is the number of deaths at a given time point, n is the sum of wt for all individuals still at risk at that time, and m is the sum of weights for the deaths at that time. The justification is based on a binomial argument when weights are all equal to one; extension to the weighted case is ad hoc. Tsiatis (1981) proposes a sum of terms d/(n*n), based on a counting process argument which includes the weighted case.
For the Fleming-Harrington estimate, two different methods for handling ties have been implemented. If there were 3 deaths out of 10 at risk, then the method due to Fleming and Harrington increments the hazard by 3/10 while the Nelson estimate increments the hazard by 1/10 + 1/9 + 1/8. For curves created after a Cox model these correspond to the Breslow and Efron estimates, respectively, and the proper choice is made automatically. The fh2 method will give results closer to the Kaplan-Meier.
Based on the work of Link (1984), the log transform is expected to produce the most accurate confidence intervals. If there is heavy censoring, then based on the work of Dorey and Korn (1987) the modified estimate will give a more reliable confidence band for the tails of the curve.
Fleming, T. H. and Harrington, D. P. (1984). Nonparametric estimation of the survival distribution in censored data. Comm. in Statistics 13, 2469-86.
Kablfleisch, J. D. and Prentice, R. L. (1980). The Statistical Analysis of Failure Time Data. New York:Wiley.
Link, C. L. (1984). Confidence intervals for the survival function using Cox's proportional hazards model with covariates. Biometrics 40, 601-610.
Tsiatis, A. (1981). A large sample study of the estimate for the integrated hazard function in Cox's regression model for survival data. Annals of Statistics 9, 93-108.
# fit a Kaplan-Meier and plot it fit <- survfit(Surv(time, status) ~ group, data = leukemia) plot(fit, lty = 2:3) legend(100, .8, c("Maintained", "Nonmaintained"), lty = 2:3) # fit a Cox proportional hazards model and plot the # predicted survival curve at the average predictor fit <- coxph(Surv(futime, fustat) ~ age, data = ovarian) plot(survfit(fit), xlab = "Survival in Days")