How to use first observation in growth model simulation
Posted: Tue May 24, 2016 9:04 pm
Suppose that the health condition of 200 patients from each of 50 GPs was observed 3 times last year. The data can be analysed as a 3-level growth model with observations nested within patients, nested within GPs (let's assume that a linear model is appropriate for this kind of data):
HEALTH_ijk = B0_ijk + B1_jk * TIME
B0_ijk = B0 + v0_k + u0_jk + e0_ijk
B1_jk = B1 + v1_k + u1_jk
We have k = 50 GPs, j = 1000 total patients, and i = 3 observations (and 3000 values of HEALTH). TIME is a variable taking values {0, 1, 2} for each observation. This is a random intercept and random slope model with 9 parameters to estimate:
Now suppose that I collect a first observation this year. This means that HEALTH_i=1 will have a value, but HEALTH_i=2 and HEALTH_i=3 will be N/A. I would like to use MLwiN to forecast the values of HEALTH_i=2 and HEALTH_i=3 using the information I have, both from past experience and from the current data. Specifically:
HEALTH_jk = B0_jk
B0_ijk = B0 + v0_k + e0_jk
The residuals are at level 2 now so they are not particularly useful, but the intercept and its variance should remain more or less the same as I collect more data.
More than a data imputation problem (the data are not "missing" in the sense that were missed during the data collection, they do not exist yet!), to me this sounds like something that should be solvable through simulation and MCMC in a Bayesian framework with the appropriate setting of priors. Is this possible in MLwiN? I have been reading the manual on MCMC estimation (http://www.bristol.ac.uk/cmm/software/m ... nuals.html) and I would be grateful if you could point me to relevant chapters or other online resources to figure this out.
Thank you for taking the time to read through all this,
k.
HEALTH_ijk = B0_ijk + B1_jk * TIME
B0_ijk = B0 + v0_k + u0_jk + e0_ijk
B1_jk = B1 + v1_k + u1_jk
We have k = 50 GPs, j = 1000 total patients, and i = 3 observations (and 3000 values of HEALTH). TIME is a variable taking values {0, 1, 2} for each observation. This is a random intercept and random slope model with 9 parameters to estimate:
- - 2 fixed coefficients: B0 and B1 (global intercept and slope)
- 7 random components:
-- 3 intercept variances, one for each level (var(v0), var(u0) and var(e0));
-- 2 slope variances, one slope for the GP and one for each patient (var(v1) and var(u1));
-- and 2 covariances (cov(v01) and cov(u01)
Now suppose that I collect a first observation this year. This means that HEALTH_i=1 will have a value, but HEALTH_i=2 and HEALTH_i=3 will be N/A. I would like to use MLwiN to forecast the values of HEALTH_i=2 and HEALTH_i=3 using the information I have, both from past experience and from the current data. Specifically:
- - From past experience, I know that the parameters will fall within a certain range
- From the current data, I already have two parameters that will remain almost unchanged as more data become available: B0, var(v0)
HEALTH_jk = B0_jk
B0_ijk = B0 + v0_k + e0_jk
The residuals are at level 2 now so they are not particularly useful, but the intercept and its variance should remain more or less the same as I collect more data.
More than a data imputation problem (the data are not "missing" in the sense that were missed during the data collection, they do not exist yet!), to me this sounds like something that should be solvable through simulation and MCMC in a Bayesian framework with the appropriate setting of priors. Is this possible in MLwiN? I have been reading the manual on MCMC estimation (http://www.bristol.ac.uk/cmm/software/m ... nuals.html) and I would be grateful if you could point me to relevant chapters or other online resources to figure this out.
Thank you for taking the time to read through all this,
k.