Publications

From sequences to variables - Rethinking the relationship between sequences and outcomes

Abstract Sequence analysis (SA) has gained increasing interest in social sciences for the holistic analysis of life course and other longitudinal data. The usual approach is to construct sequences, calculate dissimilarities, group similar sequences with cluster analysis, and use cluster membership as a dependent or independent variable in a linear or nonlinear regression model. This approach may be problematic as the cluster memberships are assumed to be fixed known characteristics of the subjects in subsequent analysis.

By Satu Helske, Jouni Helske, Guilherme K. Chihaya in Clustering

March 8, 2023

dynamite: An R Package for Dynamic Multivariate Panel Models

Abstract dynamite is an R package for Bayesian inference of intensive panel (time series) data comprising of multiple measurements per multiple individuals measured in time. The package supports joint modeling of multiple response variables, time-varying and time-invariant effects, a wide range of discrete and continuous distributions, group-specific random effects, latent factors, and customization of prior distributions of the model parameters. Models in the package are defined via a user-friendly formula interface, and estimation of the posterior distribution of the model parameters takes advantage of state-of-the-art Markov chain Monte Carlo methods.

Estimating the causal effect of timing on the reach of social media posts

Abstract Modern companies regularly use social media to communicate with their customers. In addition to the content, the reach of a social media post may depend on the season, the day of the week, and the time of the day. We consider optimizing the timing of Facebook posts by a large Finnish consumers’ cooperative using historical data on previous posts and their reach. The content and the timing of the posts reflect the marketing strategy of the cooperative.

A Bayesian spatio-temporal analysis of markets during the Finnish 1860s famine

Abstract We develop a Bayesian spatio-temporal model to study pre-industrial grain market integration during the Finnish famine of the 1860s. Our model takes into account several problematic features often present when analysing multiple spatially interdependent time series. For example, compared with the error correction methodology commonly applied in econometrics, our approach allows simultaneous modelling of multiple interdependent time series avoiding cumbersome statistical testing needed to predetermine the market leader as a point of reference.

By Tiia-Maria Pasanen, Miikka Voutilainen, Jouni Helske and Harri Högmander in Bayesian Inference

June 20, 2022

Efficient Bayesian generalized linear models with time-varying coefficients: The walker package in R

Abstract The R package walker extends standard Bayesian general linear models to the case where the effects of the explanatory variables can vary in time. This allows, for example, to model the effects of interventions such as changes in tax policy which gradually increases their effect over time. The Markov chain Monte Carlo algorithms powering the Bayesian inference are based on Hamiltonian Monte Carlo provided by Stan software, using a state space representation of the model to marginalise over the regression coefficients for efficient low-dimensional sampling.