Introduction

The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.

Introduction

library(rstan)
#> Loading required package: StanHeaders
#> 
#> rstan version 2.32.7 (Stan version 2.32.2)
#> For execution on a local, multicore CPU with excess RAM we recommend calling
#> options(mc.cores = parallel::detectCores()).
#> To avoid recompilation of unchanged Stan programs, we recommend calling
#> rstan_options(auto_write = TRUE)
#> For within-chain threading using `reduce_sum()` or `map_rect()` Stan functions,
#> change `threads_per_chain` option:
#> rstan_options(threads_per_chain = 1)
library(bennu)
library(bayesplot)
#> This is bayesplot version 1.14.0
#> - Online documentation and vignettes at mc-stan.org/bayesplot
#> - bayesplot theme set to bayesplot::theme_default()
#>    * Does _not_ affect other ggplot2 plots
#>    * See ?bayesplot_theme_set for details on theme setting
library(ggplot2)

rstan_options(auto_write = TRUE)
options(mc.cores = 2)

## basic example code
d <- generate_model_data()
#> Warning: `generate_model_data()` was deprecated in bennu 0.2.1.
#> ℹ Please use `model_random_walk_data()` instead.
#> This warning is displayed once every 8 hours.
#> Call `lifecycle::last_lifecycle_warnings()` to see where this warning was
#> generated.
# note iter should be at least 2000 to generate a reasonable posterior sample
fit <- est_naloxone(d, iter = 100)
#> Sampling)
#> Chain 2:                0.198 seconds (Total)
#> Chain 2: 
#> Chain 1: Iteration: 100 / 100 [100%]  (Sampling)
#> Chain 1: 
#> Chain 1:  Elapsed Time: 0.098 seconds (Warm-up)
#> Chain 1:                0.128 seconds (Sampling)
#> Chain 1:                0.226 seconds (Total)
#> Chain 1:
#> Warning: There were 13 divergent transitions after warmup. See
#> https://mc-stan.org/misc/warnings.html#divergent-transitions-after-warmup
#> to find out why this is a problem and how to eliminate them.
#> Warning: There were 1 chains where the estimated Bayesian Fraction of Missing Information was low. See
#> https://mc-stan.org/misc/warnings.html#bfmi-low
#> Warning: Examine the pairs() plot to diagnose sampling problems
#> Warning: The largest R-hat is NA, indicating chains have not mixed.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#r-hat
#> Warning: Bulk Effective Samples Size (ESS) is too low, indicating posterior means and medians may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#bulk-ess
#> Warning: Tail Effective Samples Size (ESS) is too low, indicating posterior variances and tail quantiles may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#tail-ess
mcmc_pairs(fit,
  pars = c("sigma", "mu0", "zeta"),
  off_diag_args = list(size = 1, alpha = 0.5)
)

library(posterior)
#> This is posterior version 1.6.1
#> 
#> Attaching package: 'posterior'
#> The following object is masked from 'package:bayesplot':
#> 
#>     rhat
#> The following objects are masked from 'package:rstan':
#> 
#>     ess_bulk, ess_tail
#> The following objects are masked from 'package:stats':
#> 
#>     mad, sd, var
#> The following objects are masked from 'package:base':
#> 
#>     %in%, match

summarise_draws(fit, default_summary_measures())
#> # A tibble: 222 × 7
#>    variable   mean median     sd    mad     q5   q95
#>    <chr>     <dbl>  <dbl>  <dbl>  <dbl>  <dbl> <dbl>
#>  1 sim_p[1]  0.159  0.144 0.0647 0.0647 0.0781 0.270
#>  2 sim_p[2]  0.158  0.150 0.0589 0.0567 0.0798 0.266
#>  3 sim_p[3]  0.154  0.148 0.0454 0.0472 0.0945 0.233
#>  4 sim_p[4]  0.165  0.163 0.0440 0.0450 0.101  0.246
#>  5 sim_p[5]  0.179  0.176 0.0315 0.0306 0.133  0.236
#>  6 sim_p[6]  0.187  0.183 0.0336 0.0298 0.140  0.240
#>  7 sim_p[7]  0.197  0.195 0.0323 0.0303 0.147  0.249
#>  8 sim_p[8]  0.217  0.211 0.0390 0.0385 0.162  0.290
#>  9 sim_p[9]  0.257  0.258 0.0334 0.0330 0.207  0.312
#> 10 sim_p[10] 0.300  0.295 0.0406 0.0394 0.241  0.368
#> # ℹ 212 more rows

We can plot the posterior predictive distribution of the probability of prior kit use and compare to simulated data using the plot_kit_use function

We can also plot the reported number of kits used along with the posterior predictive distribution using the plot_kit_use function with the reported argument set to TRUE

Alternatively, if we only want to plot certain regions we can include that as an optional argument in the plot_kit_use function

We can compare the posterior predictive check to the prior predictive check by re-running the model without the likelihood by setting the run_estimation parameter to FALSE

# note iter should be at least 2000 to generate a reasonable posterior sample
prior_fit <- est_naloxone(d,
  run_estimation = FALSE,
  iter = 100
)
#> [ 60%]  (Sampling)
#> Chain 2: Iteration: 70 / 100 [ 70%]  (Sampling)
#> Chain 2: Iteration: 80 / 100 [ 80%]  (Sampling)
#> Chain 2: Iteration: 90 / 100 [ 90%]  (Sampling)
#> Warning: There were 13 transitions after warmup that exceeded the maximum treedepth. Increase max_treedepth above 10. See
#> https://mc-stan.org/misc/warnings.html#maximum-treedepth-exceeded
#> Warning: There were 3 chains where the estimated Bayesian Fraction of Missing Information was low. See
#> https://mc-stan.org/misc/warnings.html#bfmi-low
#> Warning: Examine the pairs() plot to diagnose sampling problems
#> Warning: The largest R-hat is NA, indicating chains have not mixed.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#r-hat
#> Warning: Bulk Effective Samples Size (ESS) is too low, indicating posterior means and medians may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#bulk-ess
#> Warning: Tail Effective Samples Size (ESS) is too low, indicating posterior variances and tail quantiles may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#tail-ess
mcmc_pairs(prior_fit,
  pars = c("sigma", "mu0", "zeta"),
  off_diag_args = list(size = 1, alpha = 0.5)
)

We can compare the prior and posterior predictive distribution of the probability of prior kit use using the plot_kit_use function

Incorporating different priors

The default priors for the model are set to be weakly uninformative. To change the default priors we can specify a list as follows. Note that c is the prior of the region coefficients, mu0 is the intercept, ct0 is the initial conditions of the random walk, zeta is the standard deviation of the random walk, and sigma is the standard deviation of the error. In this example let’s set the variation month to month to be smaller by setting the standard deviation of the prior for zeta to 0.01.

nondefault_priors <- list(
  c = list(mu = 0, sigma = 1),
  ct0 = list(mu = 0, sigma = 1),
  zeta = list(mu = 0, sigma = 0.01),
  mu0 = list(mu = 0, sigma = 1),
  sigma = list(mu = 0, sigma = 1)
)
nondefault_prior_fit <- est_naloxone(d, iter = 100, priors = nondefault_priors)
#> Warning: There were 1 divergent transitions after warmup. See
#> https://mc-stan.org/misc/warnings.html#divergent-transitions-after-warmup
#> to find out why this is a problem and how to eliminate them.
#> Warning: There were 4 transitions after warmup that exceeded the maximum treedepth. Increase max_treedepth above 10. See
#> https://mc-stan.org/misc/warnings.html#maximum-treedepth-exceeded
#> Warning: Examine the pairs() plot to diagnose sampling problems
#> Warning: The largest R-hat is NA, indicating chains have not mixed.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#r-hat
#> Warning: Bulk Effective Samples Size (ESS) is too low, indicating posterior means and medians may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#bulk-ess
#> Warning: Tail Effective Samples Size (ESS) is too low, indicating posterior variances and tail quantiles may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#tail-ess

Compare the posterior using non-default priors to the posterior for default priors,

plot_kit_use(`Default prior` = fit,
             `Non-default prior` = nondefault_prior_fit, data = d)

Random walk of order 2 model

The second model uses a random walk of order 1 to characterize the time-dependence structure of the inverse logit probability of naloxone use,

\[\Delta^2c_i = c_i - c_{i+1} \sim N(0,\tau^{-1}) \]

The model can also be fit using a random walk of order 2 increments for the inverse logit probability of naloxone use using the following independent second order increments,

\[\Delta^2c_i = c_i - 2c_{i+1} + c_{i+2} \sim N(0,\tau^{-1}) \]

This version of the model will produce smoother estimates of probability of naloxone use in time.

# note iter should be at least 2000 to generate a reasonable posterior sample
# note use `rw_type` to specify order of random walk
rw2_fit <- est_naloxone(d,
  rw_type = 2,
  iter = 100
)
#> ed Time: 0.148 seconds (Warm-up)
#> Chain 1:                0.108 seconds (Sampling)
#> Chain 1:                0.256 seconds (Total)
#> Chain 1:
#> Warning: There were 37 divergent transitions after warmup. See
#> https://mc-stan.org/misc/warnings.html#divergent-transitions-after-warmup
#> to find out why this is a problem and how to eliminate them.
#> Warning: Examine the pairs() plot to diagnose sampling problems
#> Warning: The largest R-hat is NA, indicating chains have not mixed.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#r-hat
#> Warning: Bulk Effective Samples Size (ESS) is too low, indicating posterior means and medians may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#bulk-ess
#> Warning: Tail Effective Samples Size (ESS) is too low, indicating posterior variances and tail quantiles may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#tail-ess
mcmc_pairs(rw2_fit,
  pars = c("sigma", "mu0", "zeta"),
  off_diag_args = list(size = 1, alpha = 0.5)
)

Summarize random walk order 2 model draws

summarise_draws(rw2_fit, default_convergence_measures())
#> # A tibble: 222 × 4
#>    variable   rhat ess_bulk ess_tail
#>    <chr>     <dbl>    <dbl>    <dbl>
#>  1 sim_p[1]   1.39     11.1     75.1
#>  2 sim_p[2]   1.38     11.1     73.3
#>  3 sim_p[3]   1.38     11.3     79.2
#>  4 sim_p[4]   1.32     12.6    107. 
#>  5 sim_p[5]   1.29     14.0     65.9
#>  6 sim_p[6]   1.23     16.1     85.8
#>  7 sim_p[7]   1.14     22.3     95.9
#>  8 sim_p[8]   1.11     30.4     80.1
#>  9 sim_p[9]   1.07     51.7    188. 
#> 10 sim_p[10]  1.09     67.7     88.6
#> # ℹ 212 more rows

We can compare two different model fits of the probability of prior kit use to simulated data using the plot_kit_use function

plot_kit_use(rw_1 = fit, rw_2 = rw2_fit, data = d)

fit missing distribution data

As distribution data are reported from sites some months may not be reported. This would lead to a difference in the frequency of the orders data and the distribution data. The model can account for these differences and infer the distribution during missing months. See the following example for generating distribution data once every three months

## basic example code
d_missing <- generate_model_data(reporting_freq = 3)
ggplot(aes(x = times, y = Reported_Used, color = as.factor(regions)),
  data = d_missing
) +
  geom_point()
#> Warning: Removed 32 rows containing missing values or values outside the scale range
#> (`geom_point()`).

missing_fit <- est_naloxone(d_missing,
  rw_type = 2,
  iter = 100
)
#> ed Time: 0.137 seconds (Warm-up)
#> Chain 2:                0.062 seconds (Sampling)
#> Chain 2:                0.199 seconds (Total)
#> Chain 2:
#> Warning: There were 18 divergent transitions after warmup. See
#> https://mc-stan.org/misc/warnings.html#divergent-transitions-after-warmup
#> to find out why this is a problem and how to eliminate them.
#> Warning: There were 1 transitions after warmup that exceeded the maximum treedepth. Increase max_treedepth above 10. See
#> https://mc-stan.org/misc/warnings.html#maximum-treedepth-exceeded
#> Warning: There were 1 chains where the estimated Bayesian Fraction of Missing Information was low. See
#> https://mc-stan.org/misc/warnings.html#bfmi-low
#> Warning: Examine the pairs() plot to diagnose sampling problems
#> Warning: The largest R-hat is NA, indicating chains have not mixed.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#r-hat
#> Warning: Bulk Effective Samples Size (ESS) is too low, indicating posterior means and medians may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#bulk-ess
#> Warning: Tail Effective Samples Size (ESS) is too low, indicating posterior variances and tail quantiles may be unreliable.
#> Running the chains for more iterations may help. See
#> https://mc-stan.org/misc/warnings.html#tail-ess
mcmc_pairs(missing_fit,
  pars = c("sigma", "mu0", "zeta"),
  off_diag_args = list(size = 1, alpha = 0.5)
)

plot_kit_use(model = missing_fit, data = d_missing)

plot_kit_use(model = missing_fit, data = d_missing, reported = TRUE)

Change scale of data

the original model was designed for data aggregated into months. If the order and distribution data is aggregated differently then this can be incorporated into the model by updating the delay in reporting distribution and the delay in ordering to distributing distribution.

The delay in ordering to distributing distribution is composed of a gamma distribution with shape parameter \(\alpha\) and inverse scale parameter \(\beta = 1/\theta\). In order to update for example from monthly to weekly data, we can use the following property of the gamma distribution,

\[\Gamma(cx;\alpha,\beta) = \Gamma(x;\alpha,\beta/c)\]

To convert the delay distribution from month into week set \(c = 1/4\) as one over the approximate number of weeks in a month. The reporting delay distribution is a simple vector of length 3 denoting the empirical delay distribution. This can be expanded into months through interpolation and normalization to retain its property as a probability distribution. Example code (not run) is shown below. Note that these distributions will need to be updated when considering another jurisdiction,

# monthly reporting delay distribution
psi_vec <- c(0.7, 0.2, 0.1)
# convert to weeks using interpolation
weekly_psi_vec <- rep(psi_vec, 4) / sum(psi_vec)

# properties of order to distributed delay distribution in months
max_delays <- 3
delay_alpha <- 2
delay_beta <- 1

# convert to weeks
weekly_max_delays <- max_delays * 4
weekly_delay_alpha <- delay_alpha
weekly_delay_beta <- 0.25 * delay_beta

result <- est_naloxone(d,
  psi_vec = weekly_psi_vec,
  max_delays = weekly_max_delays,
  delay_alpha = weekly_delay_alpha,
  delay_beta = weekly_delay_beta
)

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.