Flexible Bayesian modelling and causal inference for panel data with R package dynamite

model_formula <- obs(fulltime ~ gender + varying(~ -1 + gender:lag(never) + gender:lag(fulltime)), family = "bernoulli") + aux(never ~ fulltime == 0 & lag(never) == 1 | init(1)) + splines(df = 10) ) fit <- dynamite(model_formula, data = d, group = "id", time = "age", chains = 4, cores = 4, iter = 2000)

# No full time employment at age 30 newdata0 <- d |> filter(age >= 28) newdata0$fulltime[newdata0$age == 30] <- 0 newdata0$fulltime[newdata0$age > 30] <- NA # Full time employment at age 30 newdata1 <- d |> filter(age >= 28) newdata1$fulltime[newdata1$age == 30] <- 1 newdata1$fulltime[newdata1$age > 30] <- NA

pred <- bind_rows( no = predict(fit, newdata = newdata0, funs = list(fulltime = list(mean = mean)))$simulated, yes = predict(fit, newdata = newdata1, funs = list(fulltime = list(mean = mean)))$simulated, .id = "fulltime_30" ) |> filter(age > 28) |> group_by(age) |> summarise(difference = mean_fulltime[fulltime_30 == "yes"] - mean_fulltime[fulltime_30 == "no"] ) |> summarise( mean = mean(difference), lwr = quantile(difference, 0.025), upr = quantile(difference, 0.975) )

Allison, P D, R Williams, and E Moral-Benito. 2017. “Maximum Likelihood for Cross-Lagged Panel Models with Fixed Effects.” Socius.

Hamaker, E L, R M Kuiper, and R PPP Grasman. 2015. “A Critique of the Cross-Lagged Panel Model.” Psychological Methods.

Harvey, A C. 1978. “The Estimation of Time-Varying Parameters from Panel Data.” In Annales de l’inséé.

Harvey, A C, and G D A Phillips. 1982. “The Estimation of Regression Models with Time- Varying Parameters.” In Games, Economic Dynamics, and Time Series Analysis, edited by M. Deistler, E. Fürst, and G. Schwödiauer. Heidelberg.

Helske, J. 2022. “Efficient Bayesian Generalized Linear Models with Time-Varying Coefficients: The Walker Package in R.” SoftwareX.

Hernán, M A, and J M Robins. 2020. Causal Inference: What If.

Lang, S, and A Brezger. 2004. “Bayesian P- Splines.” Journal of Computational and Graphical Statistics.

Pearl, J. 2009. Causality: Models, Reasoning, and Inference.

Sun, Y, R J Carroll, and D Li. 2009. “Semiparametric Estimation of Fixed-Effects Panel Data Varying Coefficient Models.” In Nonparametric Econometric Methods.

Flexible Bayesian modelling and causal inference for panel data with R package dynamite

Panel data

Statistical methods

Cross-lagged panel model with two variables

Limitations of CLPM

Time-varying effects

Time-varying regression

Long-term causal effects (LTCE)

Example of estimating LTCEs

Identifying functional

Alternative identifying functional

Estimating LTCE

Dynamite model

Linear predictor

Time-varying components

Latent factors

Dynamite graph

Causal effect of employment on employment

Causal graph

Defining the model in R

Time-varying effects

Simulation of individual trajectories based on the intervention

Predictions cont.

Results

Future directions

The end