As described above, we modeled the trajectories by applying a blind source separation algorithm that learns independent components that are linearly combined with joint-specific time delays (Omlor & Giese,
2007a,
2007b). The joint-angle trajectories
xi(
t) were thus approximated by linear superpositions of the statistically independent source signals (basis functions)
sj(
t), weighted by the mixing weights
αij (
Equation A1). As described above, the model incorporates phase differences between different limbs by allowing for time delays
τij between source signals and angle trajectories:
Exploiting the framework of time-frequency analysis (Wigner-Ville transformation) and critically the fact that the sources are mutually uncorrelated, this relationship can be transformed into the following identities for the Fourier transforms of the trajectories and the source signals (Omlor & Giese,
2007a):
These two equations can be solved by consecutive iteration of the following two steps, until convergence is achieved:
-
Solving
Equation A2, by applying source separation methods with additional positivity constraint, such as non-negative PCA (Oja & Plumbley,
2003), positive ICA (Hojen-Sorensen, Winther, & Hansen,
2002) or non-negative matrix factorization (NMF) (Lee & Seung,
1999). This is justified by the fact that the only difference between
Equation A2 and the standard instantaneous mixing model of standard PCA or ICA is the fact that all variables are non-negative.
-
Solving
Equation A3 numerically, given the results of the preceding step. The solution provides the unknown delays
τ ij and the phases of the Fourier transforms of the source signals arg(
Fs j). To separate these two variables, we estimate
τ ij in a separate step which is then iterated with the solution of
Equation A3.
This separate step for delay estimation exploits the phase information in the Fourier domain. The Fourier transform of a delayed signal simply corresponds to the original Fourier transform multiplied by a complex exponential that depends on the time shift. Assuming the signal
x 2(
t) is a scaled and time-shifted copy of the signal
x 1(
t), such that
x 2(
t) =
αx 1(
t −
τ), the following relationship in the Fourier domain holds (
specifying the complex conjugate of
z):
Equation A4 implies that arg(
Fx 1(
ω) ·
Fx 2(
ω)) = 2
πωτ, which has to hold for all frequencies. The delay can thus be estimated by linear regression, concatenating the equations for a set of different frequencies,
τ specifying the slope of the regression line.
Equation A4 shows how the complex phase of the cross-spectrum is connected with the unknown delay
τ ij.
If the two signals
x 1 and
x 2 are influenced by Gaussian additive noise, it can be shown that the delay can be estimated by linear regression using the equation
where
ɛ(
ω) is a composite noise term. Under appropriate assumptions, the estimated slope 2
πτ of this regression line is the best unbiased linear estimator (Chan, Hattin, & Plant,
1978).
Since the time delays for the individual joints varied only weakly between the different emotions (Omlor & Giese,
2007b), we constrained all delays belonging to the same joint across all emotions to the same value (i.e.,
τij =
τkj if
i, k specify the same joint and source, but different emotions). This constraint resulted in a higher interpretability of the mixing weights. Assuming we want to estimate a common delay from the time shifts between a reference signal
x0(
t) and the signals
xl(
t), 1 ≤
l ≤
L, we can concatenate all regression equations belonging to the same joint into the vector relationship
where the vector
c contains the values of the cross spectrum for the different signals, and where
u is a one-element vector. Concatenating these equations over different values of the frequency
ω results in a regression problem from which the joint delay can be estimated in the same way as from
Equation A5.