- Research
- Open Access

# Simulation enhanced distributed lag models for mortality displacement

- Koen Simons
^{1, 2}Email author, - Ronald Buyl
^{2}, - An Van Nieuwenhuyse
^{1}and - Danny Coomans
^{2}

**Received:**26 May 2016**Accepted:**18 October 2016**Published:**10 November 2016

## Abstract

Distributed lag models (DLM) are attractive methods for dealing with mortality displacement, however their estimates can have substantial bias when the data is generated by a multi-state model. In particular DLMs are not valid for mortality displacement. Alternative methods are scarce and lack feasibility and validation. We investigate the breakdown of DLM in three state models by means of simulation and propose simulation enhanced distributed lag models (SEDLM) to overcome the defects. The new method provides simultaneous estimates of the net effect (entry) and the displacement effect (exit). These have improved performance over the singular estimate from a regular DLM. SEDLM entry estimates have negligible bias and their variance is reduced. The exit estimates are unbiased and their variance is one order of magnitude lower with respect to the entry estimates. Applying SEDLM to the original Chicago data, the 95% highest posterior density intervals for both entry and exit contain 0, providing neither evidence for a ‘displacement effect’ nor for a ‘net effect’.

## Keywords

- Air pollution
- Distributed lag model
- Harvesting
- Mortality displacement
- Simulation study
- Time series

## Background

Distributed lag models (DLM) have become the dominant approach for modelling acute mortality effects of environmental exposures such as atmospheric ozone (Zanobetti and Schwartz 2008), fine particulate matter (Zanobetti et al. 2002), ambient temperature (Basu 2009), heat waves (Hajat et al. 2002) etc.

Several arguments favour DLMs. Primarily, they provide an intuitive way of estimating risks when the delay between exposure and event is unknown or variable. Secondly, DLMs are flexible and have been extended to investigate thresholds (Muggeo 2008) and non-linear exposure-response relationships (Gasparrini et al. 2010). Interactions between exposures have also been included (Filleul et al. 2006, e.g.). Thirdly, DLMs are fairly easy to implement in standard statistical software. And lastly, DLMs were considered to give both quantitative and qualitative information on mortality displacement.

The distinction between weakened individuals and the general population is important for impact calculations. The general population enjoys a residual life expectancy of several years or decades. Consequently, when air pollution episodes increase mortality rates in this group, the impact on life expectancy is large. However, in the frail group residual life expectancies are very short. Hence, an exposure event can bring deaths in the frail group forward only some days or weeks and the effect on life expectancy is negligible in comparison with the general population.

Naturally, it is possible that there is an effect on both groups. In such a scenario it is preferable to disentangle the effects and consider only the effect on the general population when calculating the impact on life expectancy. Often the sum of the lag estimates generated by a DLM, is used, as the individual lag estimates can be both positive and negative. In essence both the left and right pattern of Fig. 1 are allowed for by a DLM.

Unfortunately, it was shown in Roberts and Switzer (2004) that such estimates are biased. This bias does not seem to be an attenuation nor a consistent over-estimation. Instead their results show that the bias depends in some non-trivial manner on the number of lags included in the model, on the size of the true effect and on the mean lifetime in the frail population. Although the bias was negligible in some scenario’s, it was sufficiently large to lead to spurious conclusions in other settings.

Nonetheless, DLMs remain the most popular class of models for acute effects of air pollution exposure. We believe that two factors contribute to this. Firstly, the motivation for the DLM is intuitive and although it has, to our knowledge, never been proven analytically that the estimates should be unbiased, it is difficult to understand why they are sometimes biased towards the null, yet at other times unbiased or biased away from the null. Secondly, there is no well-established alternative model. Even though alternative models have been proposed as early as 1999 (Smith et al. 1999), they have not been extensively studied. In particular, the aforementioned, Bayesian, approach suffered a large computational burden and only eighteen simulations were performed, all in the setting of zero mortality displacement. Other methods similarly lack feasibility or validation.

In this paper we revisit the multi state framework necessary for models that generate mortality displacement. Thereafter we recapitulate the DLM approach and provide additional observations pertaining the factors that contribute to the bias of DLM estimates. Using these insights as heuristics, we propose an intuitive modification of the DLM that corrects for bias: Simulation Enhanced Distributed Lag Models (SEDLM). Subsequently, we compare SEDLM to DLM with a simulation study and demonstrate that the SEDLM produces better estimates. We apply the method to the Chicago data as an illustrative example. Finally, we turn to the discussion and conclusion.

## Mortality displacement

In order to generate mortality displacement, a multi state model with at least three states is needed. However, we will first consider the simple model depicted in Fig. 2a. Herein there are only two states: healthy and dead. All people start in the healthy state and there is only one possible transition: towards death. Let us assume that the base rate of dying is sufficiently low and the population large enough that the amount of healthy people (\(n_t\)) is approximately constant. Thus daily mortality can be considered a Poisson process with rate \(\mu _t\) that is influenced by concurrent and recent exposure including lags of up to *L* days: \(x_t\), \(x_{t-1} \ldots x_{t-L}\). Let us assume for simplicity that the effect is linear: \({\mathrm {log}}\mu _t = \alpha + \beta _0 x_t + \cdots + \beta _L x_{t-L}\). Unless the exposure has a protective effect, none of the \(\beta _i\) can be negative. If we believe that model in Fig. 2a is the underlying truth, then we should restrict our estimates to be non-negative (\(\hat{\beta _i} \ge 0\)).

*t*, \(u_t\) healthy people become frail. We will refer to this transition from healthy to frail as entry. Similarly, on day

*t*, \(y_t\) frail people die and we will refer to this transition as exit. Assuming that the healthy population is large and that the individual probability of entry is small, we assign a Poisson distribution to \(U_t\). In addition, we assume that initially there is only a small finite number of frail individuals and that the probability of exit is high for all individuals in the frail state. This can be modelled with a Binomial distribution. As a direct consequence of these choices, the amount of people in the frail state varies from day to day. While the outflux is proportional to the size of the frail population, the influx is independent thereof. It is easy to see that the size of the frail state will evolve towards a stable equilibrium (\(E \left[ m_t \right] = E \left[ m_{t-1} \right] \)) wherein the expected influx is equal to the expected number of deaths. If the outflux is too large, then the frail population will shrink and hence the outflux will decrease towards the equilibrium state. Vice versa, a too small frail population will experience a net increase until its size, multiplied by the base rate, becomes equal to the expected influx.

## DLMs

*L*lagged exposure terms to capture the full effect of exposure:

*L*increases so does the amount of degrees of freedom used. Furthermore the lagged exposure terms are anything but orthogonal. The combined effect is a substantial increase in the variance of the estimates. Adding conditions for the coefficients can reduce the number of degrees of freedom used and thus improve the model fit. One alternative is restricting the \(\beta _i\) to a low order polynomial (Almon 1965; Schwartz 2000).

While these approaches improve efficiency, a maximum lag *L* must still be chosen and this influences the estimates. More importantly, to our knowledge, the assumption in Eq. 5 has not been proven. On the contrary, Roberts and Switzer (2004) reported substantial bias when data had been generated from a three state model similar to the one described above, and also when more complex models such as including multiple frail states were used (Roberts 2011). Results from the 2004 paper indicate that choice of the number of lags influences the bias in estimates. When the simulation assumed a pure entry model (\(\phi _t \equiv c\)), estimates from distributed lag models with a lag number smaller than the mean lifetime in the frail population tended to underestimation, which is consistent with current understanding of DLM’s. Yet the pattern remained unclear for more general simulation settings: ie when \(\phi _t\) is time-dependent, assumption 5 may produce both over- and underestimation.

## Bias correction through simulation

- 0.
Fit a DLM to the data and save the estimates \(\hat{\varvec{\beta }}^{{\mathrm {realdata}}}\).

- 1.Simulation
- (a)
Specify a three state model.

- (b)
Simulate time-series with known \(\beta _{\mathrm {entry}},\beta _{\mathrm {exit}}\).

- (c)
Fit a DLM to each generated time-series, using the same specification as in step 0. Save the sets of \(\lbrace \beta _{\mathrm {entry}},\beta _{\mathrm {exit}}, \hat{\beta }_0^{\mathrm {DLM}}, \ldots , \hat{\beta }_J^{\mathrm {DLM}} \rbrace \).

- (a)
- 2.Estimate the response surfaces \(\widehat{f_j^{\mathrm {DLM}}} \left( \beta _{\mathrm {entry}},\beta _{\mathrm {exit}} \right) \).
- (a)
Choose a smoother

*s* - (b)
\(\forall j \in 0\ldots J, \forall \beta _{\mathrm {entry}},\beta _{\mathrm {exit}}\) : \(E\left[ \hat{\beta }_j^{\mathrm {DLM}} \vert \beta _{\mathrm {entry}},\beta _{\mathrm {exit}}\right] \approx s\left( \hat{\beta }_j^{\mathrm {DLM}} \vert \beta _{\mathrm {entry}},\beta _{\mathrm {exit}}\right) \).

- (c)
Calculate the residuals.

- (d)
Estimate the local covariance matrix.

- (a)
- 3.Compare the estimates from step 0 with the information obtained in step 2
- (a)
\(\forall \beta _{\mathrm {entry}},\beta _{\mathrm {exit}}\): calculate the Mahalanobis distance between \(\hat{\varvec{\beta }}^{\mathrm {realdata}}\) and \({\varvec{s}} \left( \beta _{\mathrm {entry}},\beta _{\mathrm {exit}}\right) \)

- (b)
\(\forall \beta _{\mathrm {entry}},\beta _{\mathrm {exit}}\): calculate an approximate likelihood \(\widetilde{L} \approx L \left( \beta _{\mathrm {entry}}, \beta _{\mathrm {exit}} \vert \hat{\varvec{\beta }}^{\mathrm {realdata}} \right) \)

- (c)
Use numerical integration to normalize the approximated likelihood \(\widetilde{L}\).

- (a)

### Simulation

In order to simulate a time series from model 2, appropriate values for the parameters need to be chosen. The goal is to mimic the conditions of the real dataset. Thus it is a natural choice to use the real exposure and covariate time series and to fix \(\alpha _{\mathrm {entry}}\) to the long-run average of the observed deaths, noting that in the long term the number of deaths equals the number of entry transitions. Similarly an estimate of the smooth seasonal variation and the smooth function of the covariates can be obtained from the real dataset.

Since no exact information on the mean lifetime in the frail state (MLT) is available, a value or a distribution must be chosen for it. Considering that a mean lifetime of several months implies a relatively large possible loss of life expectancy and that mortality displacement aims to distinguish between very small losses and larger losses, it seems logical to limit the maximum value of this parameter. Likewise, noting that \(\phi _t \equiv 1\) is a degenerate case of the three state model, a mean lifetime of zero days seems of little added value. Nonetheless, limiting the MLT to a single value is not a realistic use-case. Therefore we decided to model the uncertainty of this parameter by sampling from a uniform distribution with range one to twenty eight days.

Two parameters can be derived from the MLT: the base probability of death \(\alpha _{\mathrm {exit}}\) and the initial size of the frail population. In equilibrium, the expected number of new frail people is equal to the expected number of deaths. The latter is equal to the size of the frail population multiplied by the probability of dying. Vice versa, the product of the MLT and the mean daily deaths can be substituted for the initial size of the frail population.

This leaves only two more parameters to set: \(\beta _{\mathrm {entry}}\) and \(\beta _{\mathrm {exit}}\). In order to obtain good estimates of the surfaces \(f_j\), it is necessary to evaluate them over a wide range of \(\left( \beta _{\mathrm {entry}},\beta _{\mathrm {exit}}\right) \) pairs. A straightforward solution is using an envelope: drawing the pairs from a uniform distribution on a square or rectangle *S*. In order to facilitate computations, this is modified by dividing the square *S* into several smaller cells \(G_i\) and drawing several pairs from uniform distributions on each \(G_i\).

### Smooth response surface

*S*must lie within exactly one cell \(G_i\) and each cell \(G_i\) has at most eight neighbours. Only points lying in the cell \(G_i\) or in its neighbours, can have a non-zero weight. For each of these points, the Euclidean distance from \({\varvec{a}}\) to the center of the kernel (\({\varvec{p}}\)) is calculated. Subsequently a weight is assigned. We use the Epanechnikov kernel (Epanechnikov 1969) with span equal to the length (

*h*) of the small squares \(G_i\):

### Calculating the posterior probability by numerical integration

*c*is estimated by approximating the integration over

*S*by a sum.

## Data and simulation setup

For maximal comparability, we based our simulations on the same dataset as Roberts and Switzer (2004). The data from the National Mortality, Morbidity, and Air Pollution Study (NMMAPS, Peng et al. (2004)) is freely available. It contains time series of daily deaths for multiple cities in the USA. These are stratified by cause of death (accidental, respiratory, pneumonia, ...) and age group (<65, 65–74, \(\ge \)75). It also contains time series of air pollutants and climatological variables for each city.

Following Roberts and Switzer, we selected the city of Chicago and the time period 1987–1994 from the NMMAPS database. We selected all daily deaths in residents of age 65 and above. We used PM10 as exposure and removed both outliers (daily PM10 concentration \(>150\,\upmu\hbox{g/m}^3\)) and missing values. These days were also removed from the mortality series. Both the temperature (*T*), dew-point temperature (\(T_d\)) and PM10 series were centralised.

We used a generalized linear model [GLM, McCullagh and Nelder (1989)] to obtain an estimate of the seasonal component of the mortality time series. More precisely, we fitted a Poisson model with linear predictor \(s_1(t) +{\mathrm {DOW}}(t) + s_2(T_{d}(t)) + s_3(T(t)) + s_4(T(t-1))\) wherein the \(s_i\) are natural cubic splines with respectively 7 degrees of freedom per year, 3, 6 and 6 *df*. DOW is the day of the week effect. We used the package mgcv (Wood 2011), R 3.0.2.(R Development Core Team 2011).

We chose a square *S* with corners (−0.02, −0.02) and (0.02, 0.02), corresponding to a maximum effect of 22% increase in mortality for each \(10\,\upmu{\text{g}}/{\text{m}}^3\) of PM10. This is reasonably large with respect to current estimates, allowing for a large area away from the boundaries of the envelope while keeping the computational load low. We divided the square into 32 × 32 smaller squares and drew 80 triplets \(\left( \beta _{\mathrm {entry}},\beta _{\mathrm {exit}},{\mathrm {MLT}}\right) \) for each small square, yielding a total number of 81,920 pairs. As noted in “Simulation” section, the MLT is sampled from a uniform distribution with range 1–28 days.

Subsequently, we randomly assigned the 80 samples in each square to five groups of size 16, thus creating training sets with 64 samples in each square and test sets with 16 samples in each square. We applied the SEDLM algorithm to all five train- and test set combinations. To avoid artefacts at the edge of *S*, we calculated the posterior probabilities only for points lying within the 30\(\,\times \,\)30 inner squares.

Finally, to examine the effect of the envelope, we created an additional set of simulations on a rectangle with entry effect range from −0.04 to 0.04 and exit effect from −0.02 to 0.02. We used the same size for the inner grid cells and generated 64 time series per cell. These were used to re-estimate the smooth functions with the larger envelope and provide new SEDLM estimates for the original 81,920 test samples.

## Results

Simulation results: RMSE \(\times \) 500 for estimated versus simulated entry and exit effects for various (SE)DLM specifications: the maximum lag *L* included and restrictions upon the lag structure

Maximum lag | 10 | 20 | 10 p4 | 20 p4 | 40 p4 | 60 p4 | 10 bs5 | 20 bs5 | 40 bs5 | 60 bs5 | 60 bs10 |
---|---|---|---|---|---|---|---|---|---|---|---|

| |||||||||||

DLM | 4.11 | 2.79 | 4.15 | 2.59 | 2.85 | 5.01 | 4.14 | 2.55 | 3.07 | 5.27 | 1.91 |

SEDLM | 1.62 | 1.20 | 1.61 | 1.13 | 0.98 | 1.00 | 1.61 | 1.13 | 0.99 | 1.03 | 0.93 |

| |||||||||||

SEDLM | 0.34 | 0.33 | 0.35 | 0.36 | 0.63 | 0.92 | 0.35 | 0.38 | 0.70 | 0.97 | 0.36 |

| |||||||||||

SEDLM | 1.65 | 1.24 | 1.65 | 1.19 | 1.16 | 1.36 | 1.65 | 1.19 | 1.21 | 1.41 | 1.00 |

Table 1 shows the results of the simulation. We tested DLMs with multiple choices of *L* and both with and without restrictions. As the number of lag terms increases, the root mean squared error (RMSE) of the unconstrained DLMs decreases. Thus the DLMs’ ability to capture the full effect of exposure increases with the number of lag terms included. The performance of restricted DLMs is similar to the unconstrained DLMs with equal amount of lag terms, however the difference is rather small. For larger amounts of lagged terms, the entry performance decreases again unless the restrictions are lessened.

Figure 6 provides further insight into the advantages of SEDLMs. Clearly the DLM has a larger variance than the SEDLM. Both methods have negligible bias near the center, however this particular DLM suffers from a non-linear attenuation effect for negative entry, whereas positive entry is overestimated. For the SEDLM the bias remains negligible for a larger range. However near to the edges of the envelope attenuation occurs. This effect can be mitigated by using a larger envelope at the cost of increased computational burden and a small variance penalty as is visible in the right panel.

We analysed the data of Chicago with the two best performing SEDLMs. Figure 8 shows the posterior probability density functions as well as the marginals. Zero lies within each marginal highest posterior density interval; these results do not provide evidence of an entry or an exit effect.

Both specifications provide similar posteriors. Furthermore, the results are consistent with Table 1: the accuracy of the exit effect estimate is higher than that of the entry effect estimate, especially for the model with lags 0–60. Similar results were obtained using SEDLM with other lag specifications. To further test sensitivity, we repeated the procedure using subsets of the training set and using a secondary training set wherein the mean life time in the frail state was changed to 1 + Poisson(14). No meaningful changes were observed.

## Discussion

The SEDLM algorithm developed in this paper has multiple benefits over DLMs. Primarily, it provides unbiased estimates. Secondly, it reduces the variance of the estimated entry effect. It also gives an additional estimate of the exit effect. Finally, it illustrates the issue of the strong assumptions necessary to use the sum of lag estimates as an estimate of the entry effect. However, besides these assumptions, other user inputs are required both for DLM and SEDLM.

First, DLMs can be sensitive to the number of lag terms *L* included in the model. The choice of *L* is based on user experience, previous results and intuition about the mean life time in the frail population. Similarly, prior information about the MLT is required for SEDLM. Although our results were not sensitive to changing the mean of the distribution, there may exist sensitivity to more general changes or for other datasets. Even though the SEDLM approach might be generalized to derive a posterior for the MLT too, a prior distribution for the MLT will still need to be chosen. Using at least three non-parallel surfaces \(f_j^{\mathrm {DLM}} \left( \beta _{\mathrm {entry}},\beta _{\mathrm {exit}}, {\mathrm {MLT}}\right) \), such a generalization is conceptually straightforward, but the required three dimensional grid will impose a much larger computational challenge than the two dimensional grid we used. As grid based approaches do not scale well with dimensionality, such extensions are probably impractical.

A second necessary choice for DLMs is the number of degrees of freedom for the seasonal effects. This can be made a priori in GLMs or by optimizing a selection criterion when GAMs are used. For SEDLM, this choice is moved to the Monte Carlo part of the three step process, wherein all time series were generated using the same seasonal smooth. Again, it is straightforward to extend the algorithm to include multiple seasonal functions.

For SEDLM only, an envelope needs to be chosen. It is clear that SEDLM can produce unbiased estimates only when the true effects lie within the envelope. The simulation results show how far from the edge these true effects ought to be. Thus SEDLM can always be made unbiased by increasing the envelope’s size, at the cost of extra computations. Currently, fitting a DLM to each time series, is the most time-consuming part. Fortunately, this step is ‘embarrassingly parallel’. Using eight threads allowed us to fit a set of 81,920 time series in less than three hours. The total computational cost increases slightly when unconstrained DLMs are used. The other steps, simulating the data, smoothing the response surfaces and performing the numerical integration, all take mere minutes or seconds with our code. Thus, in our implementation, the total computational burden depends linearly on the number of simulations; in essence on the size of the envelope.

The envelope can also be regarded as prior information on the entry and exit effects. They are uniform within the envelope and zero outside. Any bounded prior can be used with our algorithm and using a more informative prior may be advantageous when combining results from multiple populations and cities. Because the changes occur only in the final step, they can be applied without intensive computations, provided that the boundaries themselves are not changed.

Another difference with DLMs is the necessity to specify a three state model. The model used in this work is the most simple non-degenerate three state model. More complex models can allow for bypassing the frail state, the existence of multiple frail states (Roberts 2011) and non-linear entry and exit effects (Roberts 2011). In addition, the entry and exit effects themselves are sometimes treated as distributed lags. Although such does not strike us as parsimonious, the other variations cannot be as easily dismissed and must therefore be investigated. While not explicitly tested, it is unlikely that SEDLM estimates obtained from a poorly-specified three state model will be accurate and unbiased. DLMs however, remain agnostic with respect to the multi state model, even though they are sensitive to the underlying data generating process. Some processes, including two state models, will not result in biased DLM estimates. Others can result in attenuation or overestimation.

Although SEDLM does not yield perfect answers for all choices a DLM necessitates, it provides better estimates without large downsides. These estimates have consistent frequentist properties both with respect to bias and standard deviation. Noting that DLMs are the most frequently used models, the case for SEDLM is favorable. Other alternatives for DLM have been discussed by Murray and Lipfert (2010). They partition the methods into zero-sum studies and compartment models.

DLMs are zero-sum studies, as are Frequency Domain LogLinear regression (Kelsall et al. 1999) and Timescale LogLinear regression (Dominici et al. 2003). Fung et al. (2005) investigated the robustness of the latter method to data generated by a three state model and concluded that ‘time scale regression has limited value for detecting mortality displacement in time-series data.’ No results are available for Frequency Domain LogLinear regression, but the method is quite similar to the other zero-sum approaches and we suspect it performs similarly.

Besides SEDLM, two other compartment models exist. The Kalman filter is relatively unknown to air pollution epidemiologists, having its origins in electronics. It can be applied to derive estimates of the exit effect and the mean lifetime in the frail population (Murray and Lipfert 2010; Murray and Nelson 2000). However, the few documented implementations that we are aware of, used the assumption that the entry effect is null. This makes the Kalman filter a good candidate for combination with SEDLM. The former provides estimates of the MLT, which can be used as input for the latter to derive joint estimates of the entry and exit effects, assuming one is interested in the entry effect.

Smith et al. (1999) used a full Bayesian approach to derive simultaneous estimates of the MLT, entry and exit effects. Because of the computational intensity of the Markov Chain Monte Carlo, they were unable to check convergence in their simulation study. One iteration failed to converge altogether, rendering interpretation of the results quite difficult: the posterior standard deviations are too small when compared with the frequentist properties.

When applying the SEDLM to Chicago data, the highest posterior density intervals for entry and exit effects both included zero. In other words, there is no evidence for a ‘net’ effect—an effect of air pollution on the general population of residents of age 65 and above—and, neither is there evidence for a displacement effect. From the non-significant entry estimate it follows that the impact on life expectancy at birth is not significant. Even though the exit effect estimate is not significant, however, it cannot be concluded that the three state model must be degenerate. Because the exit effect represents the change in exit probability due to air pollution and not the base probability of exit, it remains possible that the latter is not equal to one. Further investigation of this would require a combination with other approaches such as the Kalman filter, extension of the SEDLM method or development of a full Bayesian method. In addition to this and the issues outlined above, the interpretation of the SEDLM estimates keeps the traditional caveats: non/significant effects may be due to lack of power or bias through misspecification of seasonal components or confounders.

## Conclusion

SEDLM can be considered a compartment model whose posterior estimates have consistent frequentist properties. The results of our simulation study allow us to be optimistic about the algorithm as well as the other compartment based approaches.

SEDLM was developed when investigating the origin of the bias that DLMs suffer. By modifying one assumption, the SEDLM significantly improves upon the DLM in terms of mean squared error. This boon is the sum of a reduction in bias and a reduction in standard deviation. In addition to more accurate estimates of the entry effect, SEDLM also delivers simultaneous estimation of the exit effect. The exit estimate is even more accurate than the entry effect estimate. This provides valuable quantitative information on the mortality displacement hypothesis.

Besides a compartment based, full Bayesian approach, no other method for simultaneous estimation is available and SEDLM is currently the only approach that is feasible. These results warrant further investigation into SEDLM and/or the full Bayesian approach.

## Additional file

The reader is referred to the on-line Additional file for R code (Additional file 1).

## Declarations

### Authors' contributions

KS drafted the manuscript and elaborated the proposed SEDLM methodology. RB and DC supervised the development of the SEDLM methodology and contributed to writing the discussion and conclusion. AVN helped contextualizing the mortality displacement problem and assisted in obtaining the necessary funding. All authors read and approved the final manuscript.

### Acknowledgements

This work was supported by The Brussels Institute for Research and Innovation (INNOVIRIS).

### Competing interests

The authors declare that they have no competing interests.

**Open Access**This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

## Authors’ Affiliations

## References

- Almon S (1965) The distributed lag between capital appropriations and expenditures. Econometrica 33(1):178–196View ArticleGoogle Scholar
- Basu R (2009) High ambient temperature and mortality: a review of epidemiologic studies from 2001 to 2008. Environ Health 8(1):40View ArticlePubMedPubMed CentralGoogle Scholar
- Dominici F, McDermott A, Zeger SL, Samet JM (2003) Airborne particulate matter and mortality: timescale effects in four us cities. Am J Epidemiol 157:1055–1065View ArticlePubMedGoogle Scholar
- Epanechnikov VA (1969) Non-parametric estimation of a multivariate probability density. Theory Probab Appl 14(1):153–158View ArticleMATHGoogle Scholar
- Filleul L, Cassadou S, Médina S, Fabres P, Lefranc A, Eilstein D, Le Tertre A, Pascal L, Chardon B, Blanchard M, Declercq C, Jusot JF, Prouvost H, Ledrans M (2006) The relation between temperature, ozone, and mortality in nine french cities during the heat wave of 2003. Environ Health Perspect 114(9):1344View ArticlePubMedPubMed CentralGoogle Scholar
- Fung K, Krewski D, Burnett R, Ramsay T, Chen Y (2005) Testing the harvesting hypothesis by time-domain regression analysis. II: Covariate effects. J Toxicol Environ Health A Curr Issues 68:1155–1165View ArticleGoogle Scholar
- Gasparrini A, Armstrong B, Kenward MG (2010) Distributed lag non-linear models. Stat Med 29(21):2224–2234MathSciNetView ArticlePubMedPubMed CentralGoogle Scholar
- Hajat S, Kovats RS, Atkinson RW, Haines A (2002) Impact of hot temperatures on death in london: a time series approach. J Epidemiol Commun Health 56(5):367–372View ArticleGoogle Scholar
- Hastie TJ, Tibshirani RJ (1990) Generalized additive models. In: Johnston E (ed) Monographs on statistics and applied probability, vol 43. Chapman and Hall/CRC, Boca RatonGoogle Scholar
- Kelsall JE, Zeger SL, Samet JM (1999) Frequency domain log-linear models: air pollution and mortality. Appl Stat 48:331–344MATHGoogle Scholar
- McCullagh P, Nelder JA (1989) Generalized linear models. In: Monographs on statistics and applied probability, vol 37. Chapman and Hall/CRC, Boca RatonGoogle Scholar
- Muggeo VM (2008) Modeling temperature effects on mortality: multiple segmented relationships with common break points. Biostatistics 9(4):613–620View ArticlePubMedGoogle Scholar
- Murray CJ, Lipfert FW (2010) Revisiting a population-dynamic model of air pollution and daily mortality of the elderly in philadelphia. J Air Waste Manag Assoc 60:611–628View ArticlePubMedGoogle Scholar
- Murray CJ, Nelson CR (2000) State-space modelling of the relationship between air quality and mortality. J Air Waste Manag Assoc 50:1075–1080ADSView ArticlePubMedGoogle Scholar
- Peng RD, Welty LJ, McDermott A (2004) The national morbidity, mortality, and air pollution study. Database in r. John Hopkins University, Department of Biostatistics Working Papers (44)Google Scholar
- R Development Core Team (2011) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. http://www.R-project.org. ISBN 3-900051-07-0
- Roberts S (2011a) Can mortality displacement mask thresholds in the concentration-response relation between particulate matter air pollution and mortality? Atmos Environ 45(27):4728–4734ADSView ArticleGoogle Scholar
- Roberts S (2011b) What are distributed lag models of particulate matte air pollution estimating when there are populations of frail individuals? Environ Int 37:586–591View ArticlePubMedGoogle Scholar
- Roberts S, Switzer P (2004) Mortality displacement and distributed lag models. Inhalation Toxicol 16:879–888View ArticleGoogle Scholar
- Schwartz J (2000) The distributed lag between air pollution and daily deaths. Epidemiology 11(3):320–326View ArticlePubMedGoogle Scholar
- Smith RL, Davis JM, Speckman P (1999) Human health effects of environmental pollution in the atmosphere. In: Barnett V, Stein A, Turkman F (eds) Statistics for the environment 4: statistical aspects of health and the environment, chap 6. Wiley, Chichester, pp 91–115Google Scholar
- Wood SN (2011) Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J R Stat Soc Ser B (Stat Methodol) 73(1):3–36MathSciNetView ArticleGoogle Scholar
- Zanobetti A, Schwartz J (2008) Mortality displacement in the association of ozone with mortality: an analysis of 48 cities in the united states. Am J Respir Crit Care Med 177(2):184–189View ArticlePubMedGoogle Scholar
- Zanobetti A, Schwartz J, Wand MP, Ryan LM (2000) Generalized additive distributed lag models: quantifying mortality displacement. Biostatistics 1:279–292View ArticleMATHPubMedGoogle Scholar
- Zanobetti A, Schwartz J, Samoli E, Gryparis A, Touloumi G, Atkinson R, Le Tertre A, Bobros J, Celko M, Goren A (2002) The temporal pattern of mortality responses to air pollution: a multicity assessment of mortality displacement. Epidemiology 13(1):87–93View ArticlePubMedGoogle Scholar