The model for censored linear regression (non-Bayesian version) is:
The observed variables are
,
and
and we are to estimate
and
. The latent variable in this model is
. The complete likelihood is:
if
, and is:
if
, which implies
. We observe that the integral is going to appear inside the logarithm operator, so it is better to approximate this value from its moments. One possible approximation is to use (11.137) and (11.138), so when
, the first and the second moment of
are:
respectively. Note that this is a variant version of the E-step.
The log likelihood for the entire dataset now becomes: