Can I use survival analysis on a regression problem (not survival) with censored outcome #1626
-
I have a machine learning problem of regression (not survival) where I have certain numerical and categorical features and am trying to predict a numerical outcome. I know for a fact that the phenomena is highly non-linear. The problem has nothing to do with survival in that sense. But for some of the outcomes (target values), I just know that they are below certain value. I don't know the exact value. Therefore I can not use standard scikit-learn regression models here as I can not calculate the loss accurately. Is survival analysis suitable for this use case? And if so, what kind of methods would be suitable for such a regression problem? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Hi @ogencoglu, Yes, survival analysis generally models any censoring of outcomes. Survival is associated with right-censoring, but there is also left and interval censoring. In your case, you have left-censoring. Another common problem that has left-censoring is instrument readings, where some readings are below a detection limit. You can read more about left-censoring here. Anyways, most of the models in lifelines expose a |
Beta Was this translation helpful? Give feedback.
Hi @ogencoglu,
Yes, survival analysis generally models any censoring of outcomes. Survival is associated with right-censoring, but there is also left and interval censoring. In your case, you have left-censoring. Another common problem that has left-censoring is instrument readings, where some readings are below a detection limit. You can read more about left-censoring here.
Anyways, most of the models in lifelines expose a
fit_left_censoring
method that can be used to infer parameters and perform predictions for your dataset.