u/AcademicShelter6246

Predictor Model Comparison- What is the best approach?

Hi all,

I'm working on a prognostic landmark analysis in a cardiac disease cohort (N=194) ( high for this condition) and looking for advice on the best analytical strategy given our sample size constraints.

Study setup:

  • Landmark design: predictors measured at baseline and follow-up visit, outcomes counted after follow-up visit
  • 4 binary predictors (worsened vs not): Predictors 1, 2 3 and a novel clinical marker.
  • Primary outcome: composite of CV hospitalization or death (77 events)
  • Secondary outcomes: first CV hospitalization (73), composite HF hospitalization/death (23), all-cause death (18)

The hypothesis: Three of the four predictors are FDA-validated endpoints used in clinical trials. Our novel predictor has shown prognostic value in prior univariate analyses, and also multivariable Cox regression, but has never been directly compared head-to-head with the validated ones. We hypothesize it performs similarly in terms of prognostic magnitude.

What we've done so far:

  1. Univariate Cox for each predictor × each outcome
  2. Ordinal domain score (0–4 worsened domains) as a single parsimonious predictor
  3. C-statistic comparison across nested models (with vs without novel predictor)
  4. LRT for incremental value of novel predictor above the two functional measures
  5. Pairwise models (novel predictor + each comparator)
  6. Andersen-Gill for recurrent hospitalizations (131 total events)

The problem: With only 73–77 primary events and 4 binary predictors, we're at ~19 EPV — adequate for univariate and domain score analyses, but underpowered for full multivariable Cox. The novel predictor appears in only 32/194 patients (16.5%), limiting statistical power further.

Specific questions:

  1. Are there methods beyond C-statistic and LRT better suited to compare prognostic markers in this underpowered setting?
  2. Is NRI (Net Reclassification Index) appropriate here given binary predictors and a time-to-event outcome?
  3. Would a permutation-based approach or bootstrapped C-statistic comparison be more appropriate than asymptotic LRT?
  4. Any recommendations or a different approach to analyze this 4 predictors?

Thanks in advance folks

reddit.com
u/AcademicShelter6246 — 7 days ago