--- title: "Model formulation" author: "Georg Rüppel" date: "`r Sys.Date()`" output: rmarkdown::html_vignette: toc: yes vignette: | %\VignetteIndexEntry{Model formulation} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ## Hidden Markov model ```{r, echo=FALSE, fig.align='center'} DiagrammeR::grViz(" digraph { fontname='Helvetica,Arial,sans-serif' node [fontname='Helvetica,Arial,sans-serif'] edge [fontname='Helvetica,Arial,sans-serif'] # Weight [fillcolor='#5983b0', style=filled] z0 [label='zₜ₋₂'] y0 [label='yⱼ₋₂'] z1 [label='zₜ₋₁'] y1 [label='yⱼ₋₁'] z2 [label='zₜ'] y2 [label='yⱼ'] z3 [label='zₜ₊₁'] y3 [label='yⱼ₊₁'] s0 -> z0 -> y0 s1 -> z1 -> y1 s2 -> z2 -> y2 s3 -> z3 -> y3 subgraph cluster_0 { style=filled; color=lightgrey; node [style=filled,color=white]; {rank=same; s0 s1 s2 s3} s0 [label='sₜ₋₂'] s1 [label='sₜ₋₁'] s2 [label='sₜ'] s3 [label='sₜ₊₁'] s0 -> s1 -> s2 -> s3 label='State process'; } }", width = 500, height = 300) ``` Hidden Markov models (HMM) describe the evolution of a sequence of random variables, $\{S_t\}$ (i.e. behavioural states), which are not directly observable, but can be inferred from another sequence of random variables, $\{Y_t\}$, that are observable (i.e. locations). The two main characteristics of HMMs are (1) each observation is assumed to be generated by one of $N$ distributions, and (2) the hidden state sequence that determines which of the $N$ distributions is chosen at time $t$ is modelled as a Markov chain, where the probability of being in each state at time $t$ depends only on the state value at the previous time step. ### State process The state process $\{S_t\}$ of a $N$-state HMM for $T$ time steps is characterised by its state transition probability matrix $\Gamma^{(t)} = (\gamma^{(t)}_{i,j})$, where $i,j = 1, \dots, N$ and $\gamma^{(t)}_{i,j} = \text{Pr}(s_{t+1} = j|s_t = i)$. The probability of transitioning to state $s_t$ from state $s_{t-1}$ is $$ s_t \sim \text{Categorical}(\Gamma^{(t-1)}), \quad 1 \leq t \leq T. $$ ### Process model The process equation for the true locations of the animal at regular time intervals $t$, $z_t = \begin{bmatrix} z_{t, \text{lon}} \\ z_{t, \text{lat}} \end{bmatrix}$, assumes that the animal's location at time $t$ is not only dependent on the previous location, $z_{t-1}$, but also on the animal's previous displacement in each coordinate, $z_{t-1} - z_{t-2}$: $$ z_t = z_{t-1} + \lambda_n (z_{t-1} - z_{t-2}) + \epsilon_t, \quad \epsilon_t \sim \text{N}(0, \Omega), \quad 1 \leq n \leq N, $$ where $$ \Omega = \begin{bmatrix} \tau^2_{\epsilon, \text{lon}} & 0 \\ 0 & \tau^2_{\epsilon, \text{lat}} \end{bmatrix}. $$ The state-depended parameter, $\lambda_n$, can take values between 0 and 1 (i.e., $0 \leq \lambda \leq 1$), and controls the degree of correlation between steps. By default, `movetrack` estimates track-specific $\lambda_n$ values, but it is also possible to use the same $\lambda_n$ for all tracks by setting `i_lambda = FALSE`. ### Observation model The observed locations of an animal, $y_j = \begin{bmatrix} y_{j, \text{lon}} \\ y_{j, \text{lat}} \end{bmatrix}$, often have irregular time intervals $j$, with $J$ representing the total number of observed locations. Therefore, the true location of the animal is linearly interpolated to the time of the observation, with $w_j$ representing the proportion of the regular time interval between $t-1$ and $t$ when the observation $y_j$ was made: $$ y_j = w_j z_t + (1 - w_j) z_{t-1} + \theta_j, \quad \theta_j \sim \text{T}(0, \sigma_j), \quad 1 \leq j \leq J, $$ where $\text{T}(0, \sigma_j)$ denotes a bivariate Student's $t$-distribution with measurement error $\sigma_j = \begin{bmatrix} \sigma_{j, \text{lon}} \\ \sigma_{j, \text{lat}} \end{bmatrix}$. ## References Auger-Méthé, M., Newman, K., Cole, D., Empacher, F., Gryba, R., King, A. A., ... & Thomas, L. (2021). A guide to state–space modeling of ecological time series. *Ecological Monographs*, 91(4), e01470. doi: [10.1002/ecm.1470](https://doi.org/10.1002/ecm.1470)