 Methodology
 Open Access
 Published:
Empirical mode decomposition with missing values
SpringerPlus volume 5, Article number: 2016 (2016)
Abstract
This paper considers an improvement of empirical mode decomposition (EMD) in the presence of missing data. EMD has been widely used to decompose nonlinear and nonstationary signals into some components according to intrinsic frequency called intrinsic mode functions. However, the conventional EMD may not be efficient when missing values are present. This paper proposes a modified EMD procedure based on a novel combination of empirical mode decomposition and selfconsistency concept. The selfconsistency provides an effective imputation method of missing data, and hence, the proposed EMD procedure produces stable decomposition results. Simulation studies and the image analysis demonstrate that the proposed method produces substantially effective results.
Background
Multiscale methods for decomposing a signal into several significant modes with simple forms have been widely studied for last two decades. Since the decomposition procedure reduces the complexity of a signal and at the same time, enhances interpretability of its components, the information embedded in a signal can be easily recovered.
Spectral analysis (Priestley 1981) and wavelet analysis (Mallat 2009; Daubechies 1992; Vidakovic 1999) are popular multiscale methods for signal decomposition. Huang et al. (1998) proposed a dataadaptive procedure called empirical mode decomposition (EMD), and Daubechies et al. (2011) proposed an alternative method of EMD, termed synchrosqueezed wavelet transforms, which are based on reassignment methods of wavelet coefficients. These multiscale decomposition methods implicitly assume that a signal is observed at equally spaced time points. Since the local behavior of a signal can evolve over time, by utilizing the local information observed at equally spaced time points, it is useful for identifying the amount of variation at different scale and time location and for extracting each superimposed component. However, for many signals, missing values occur quite common. In practice, large amount of missing values may occur at random, for example, for intermittent wireless signal caused by malfunction of network device for onedimensional signal and partial fingerprint due to incomplete touch in digital scanner for twodimensional image. The problem we concern in this paper is that when some observations are missing, most multiscale decomposition methods produce ineffective outcome. Especially, since EMD depends on the behavior of local extrema, missing of local extrema causes severe distorted results. The brief review of EMD procedure will reveal this aspect more clearly.
Huang et al. (1998) proposed a datadriven multiscale procedure for sequentially separating each superimposed component from a given signal s(t) as follows. First, identify local extrema and construct two functions called upper and lower envelopes by interpolating local maxima and local minima, respectively. Second, take a mean of upper and lower envelopes, which produces a signal with a lower frequency than that of the original signal s(t). Third, subtract the mean envelope from the signal s(t) and obtain a highly oscillatory wave h(t). Such an oscillatory wave h(t) is defined as an intrinsic mode function (IMF) if it satisfies two conditions: (1) the number of extrema and the number of zerocrossings of h(t) is equal or differ by one and (2) the local average of h(t) is zero. If the wave h(t) does not satisfy the above conditions, then the same procedure is repeated for h(t) until the conditions are satisfied. This iterative process is called sifting. Through sifting the original signal s(t) is decomposed into the highest frequency \({{\mathrm{imf}}}_1(t)\) and a residual signal \(r_1(t) = s(t)  {{\mathrm{imf}}}_1(t)\). As the sifting is applied to the residue, the signal is sequentially decomposed into several signals having different frequencies from the highestfrequency component \({\mathrm{imf}}_1(t)\) to the lowestfrequency component \({\mathrm{imf}}_n(t)\) and a residual signal r(t). After decomposition, we finally have n IMFs and a residual signal
In the literature, there have been many studies to enhance the performance of the conventional EMD. Boudraa and Cexus (2007) separated the highfrequency components using a filtering method. Wu and Huang (2009) developed the ensemble EMD (EEMD) by averaging the simulated signals. The variants of EEMD have been proposed by several authors. The complementary ensemble EMD (CEEMD) (Yeh et al. 2010) was introduced by adding pairs of positive and negative noises into a signal and applying EEMD. Torres et al. (2011) proposed the complete ensemble EMD with adaptive noise (CEEMDAN). EEMD is applied to each stage of decomposition by adding a noise to a signal and a residue after each IMF extraction. The improved complete ensemble EMD (ICEEMD) (Colominas et al. 2014) controlled noise level between the added noise and a residue for CEEMDAN process. Xu et al. (2009) proposed a hybrid extrema estimation algorithm based on Fourier interpolation to decompose signals with lower sampling rate. Diop et al. (2010) suggested a PDEbased approach to compute envelopes. Barnhart et al. (2011, 2012) provided a methodology for discontinuous data by applying EMD on each individual continuous data segment, and by adapting mirroring approach for the discontinuous data gaps. Kim et al. (2012a) introduced the statistical EMD adapting smoothing of local extrema instead of interpolation. Komaty et al. (2014) suggested a signalfiltering approach based on a combination of EMD and a similarity measure for noise removal. Park et al. (2015) applied a quantile smoothing method to a signal itself instead of interpolating local extrema of a signal for sifting. The extension of EMD to twodimensional image has been developed by several authors. Damerval et al. (2005) employed moving window and Nunes et al. (2005) used morphological operation for twodimensional extrema detection. Bhuiyan et al. (2008) proposed orderstatistics filter method for envelope estimation of twodimensional image. Kim et al. (2012b) proposed a twodimensional EMD through the smoothing sifting of twodimensional local extrema.
As observed in the aforementioned EMD procedure, when some missing values are present, EMD produces distorted decomposition results due to two reasons: (1) when the observations are not equally spaced, it is difficult that the local behavior of a signal can be captured in a balanced way and (2) especially if missing occurs in local extrema, the sifting fails to capture upper and lower envelopes properly.
We consider a simulated example that clarifies the abovementioned problem of the conventional EMD and provides a motivation of the proposed method. The left panel of Fig. 1 illustrates a signal of three components \(\sin (\pi t)+ \sin (2 \pi t) + \sin (6 \pi t)(t \in [0, 30])\) and sifting process. Figure 1b shows upper and lower envelopes constructed by interpolating the local maxima and minima (black points), respectively, and its mean envelope denoted by dotted line on \(t \in [15, 20]\). By subtracting the mean envelope from the original signal, a candidate IMF h is obtained as in Fig. 1c. The sifting process continues until the first IMF \(imf_1\) in Fig. 1d is extracted. On the other hand, the right panel of Fig. 1 shows the sifting process for a signal with 40% missing values denoted by red points. By observing the location of local extrema and missing values in Fig. 1e, we notice that the overall pattern of the original signal cannot be captured between two envelopes. Especially, on the area around \(t=17\) in Fig. 1f, the lower envelope is distorted due to the missing values. This phenomena eventually produces improper candidate IMF h and the first IMF \(imf_1\) as displayed in Fig. 1g, h. Thus, the conventional EMD does not work properly to decompose the three component signal with missing values, and hence, fails to extract the sinusoid component \(\sin (6 \pi t)\) effectively.
To improve EMD algorithm in the presence of missing values, we propose a new method by adapting the concept of selfconsistency that recursively imputes missing values and decomposes the imputed signal efficiently under EMD framework. For practical implementation, we provide a modified EMD algorithm which consists of two alternating steps, imputation and decomposition. In addition, we discuss some remarks of the algorithm such as the fitting method, the selection of smoothing parameter in the fitting method, the choice of initial values, and so on. Furthermore, we extend the proposed method to a twodimensional signal with missing values, so that this extension provides a meaningful influence on image decomposition.
The rest of the paper is organized as follows. In “Methods” section, we briefly review the selfconsistency principle, and propose a new method for signal decomposition in presence of missing values with a practical algorithm. To evaluate empirical performance of the proposed method, simulation studies for onedimensional signal are conducted in “Numerical study” section, and a real data example is presented in this section. Furthermore, in “Extension to image” section, the extension to twodimensional signals is discussed. Lastly, conclusions are addressed in “Conclusions” section.
Methods
Review: selfconsistency
Tarpey and Flury (1996) introduced the selfconsistency as a fundamental concept in statistics, which is inspired by Hastie and Stuetzle (1989) for developing principal curves.
Definition 1
(Tarpey and Flury 1996) A random signal f is selfconsistent for g if \(E(fg)=g\) almost surely.
As pointed out in Tarpey and Flury (1996), there is a close connection between the concept of selfconsistency and ExpectationMaximization (EM) algorithm of Dempster et al. (1977). The EM algorithm generates a sequence of selfconsistent random variables and the maximum likelihood estimator satisfies the selfconsistency condition. Further, Lee and Meng (2007) considered a selfconsistent regression estimator with incomplete data. They proposed an estimate \({\hat{f}}_{obs}\) of f given observed data \({\varvec{X}}_{obs}\) that is the solution of the following selfconsistent equation
where \({\hat{f}}_{com}\) denotes an estimate of f based on the imaginary complete data \({\varvec{X}}_{com}=({\varvec{X}}_{obs},{\varvec{X}}_{mis})\) and \({\varvec{X}}_{mis}\) is missing data. Since Eq. (1) does not depend on the method for estimation, it can be applicable for EMD approach with missing data.
Proposed algorithm
Suppose that a signal s(t) is observed at equally spaced time points, but there are some missing values at a set of time points \(t_m\). Let \(t_c=(t_o,t_m)\) be the set of time points in the complete data \(s(t_c)\), where \(t_o\) denotes the set of time points for the observed signal. Thus, it follows that \(s(t_c)=(s(t_o),s(t_m))\). Suppose for the moment that, given \(s(t_c)\), we have a decomposition by EMD as \(s(t_c)= \sum\nolimits_{i=1}^{n} {\mathrm{imf}}_{i}(t_c) + r(t_c)\). In practice, the values \(s(t_m)\) are not available, and hence, given imputed values \({\hat{s}}(t_m)\), we consider an estimated decomposition as
Thus, it is required to have a method to fit the data at the observed locations and predict \({\hat{s}}(t_m)\) at the missing locations, which are used for imputed values.
For this purpose, we consider the selfconsistent Eq. (1) under this framework as
Suppose that \({\hat{f}}(\cdot )\) satisfies the above equation. Then the imputed values can be obtained by \({\hat{s}}(t_m):={\hat{f}}(t_m)\), and hence, we finally obtain the decomposition in (2). In fact, the rationale to employ Eq. (3) for our missing problem can be explained as follows. Let \({\hat{f}}_{com}\) be the estimator based on the complete data and \({\hat{f}}\) be the estimator based on the observed data. For any \({\hat{f}}\) and a set of missing time points \(t_m\), it follows that, under the assumption that the missing pattern is random,
Hence, the minimization of \(\Vert s(t_m){\hat{f}}(t_m)\Vert ^2\) over \({\hat{f}}\) at the set \(t_m\) is equivalent to minimization of \(\Vert E[{\hat{f}}_{com}(t_m)s(t_o),{\hat{f}}(t_m)]{\hat{f}}(t_m)\Vert ^2\), which is obtained by the solution of (3).
Here we propose a simple and fast algorithm to implement (2) and (3) as follows. Given initial values \({\hat{s}}^{(0)}(t_m)\),

1.
Iterate, until convergence, the following alternating steps for \(\ell =1,2,\ldots ,\)

1.1.
Imputation Step: Fit \({\hat{f}}^{(\ell 1)}(t_o)\) at the observed locations \(t_o\), impute by prediction \({\hat{s}}^{(\ell )}(t_m):={\hat{f}}^{(\ell 1)}(t_m)\) at the missing locations \(t_m\) and construct a complete data \({\hat{s}}^{(\ell )}(t_c):=(s(t_o),{\hat{s}}^{(\ell )}(t_m))\).

1.2.
Decomposition Step: Apply EMD procedure to \({\hat{s}}^{(\ell )}(t_c)\) and obtain
$$\begin{aligned} {\hat{s}}^{(\ell )}(t_c)=\sum _{i=1}^{n} {\widehat{{\mathrm{imf}}}}_{i}^{(\ell )}(t_c) + {\hat{r}}^{(\ell )}(t_c). \end{aligned}$$ 
1.3.
The iteration stops if \(\frac{{\hat{s}}^{(\ell + 1)}{\hat{s}}^{(\ell )}}{{\hat{s}}^{(\ell )}} \le \delta\) for some tolerance level \(\delta > 0\).

1.1.

2.
Take the converged IMFs as the final IMFs.
We have some remarks regarding the aforementioned algorithm:

For the fitting method, we consider various nonparametric function estimation methods such as smoothing splines, kernel smoothing, and the local polynomial regression. The asymptotic results of the equivalent kernel described in Silverman (1984) support the fact that both a splinetype estimator and kernel smoother including local polynomial regression estimator can be written
$$\begin{aligned} {\hat{f}}(t)=\frac{1}{n}\sum _{i=1}^nw(t,t_i)s(t_i), \end{aligned}$$where \(w(t,t_i)\) denote weights at time point \(t_i\). In this study, we employ the smoothing splines with a smoothing parameter chosen by generalized crossvalidation.

In this study, we use the local mean values as \({\hat{s}}^{(0)}(t_m)\) for the choice of initial values.

Note that the imputation step does not depend on the dimension of s. Thus it can be easily extended to twodimensional image. The only modification required is to replace the 1dimensional smoothing method in the imputation step by a 2dimensional method such as thinplate smoothing splines.
Numerical study
Simulation study
Here we discuss results from a simulation study that was designed to assess the practical performance of the proposed method. In this simulation study, we compare the proposed method with two other methods:

1.
EMD.obs: the conventional EMD algorithm with observed data,

2.
EMD.self: the proposed EMD algorithm described in “Proposed algorithm” section, and

3.
EMD.com: the conventional EMD algorithm with imaginary complete data.
Note that EMD.com is used as a benchmark for comparison. We consider two test functions, (1) a composite sine function of the form \(S_1(t)= \sin (\pi t)+ \sin (2 \pi t) + \sin (6 \pi t) + 0.05t(t \in [0, 30])\), and (2) a chirp signal of the form \(S_2(t) = \exp (0.01t) \cos (\pi t/4)+0.002t(t \in [0, 500])\). Figure 2 displays two test functions and their components.
To evaluate how the proposed algorithm performs according to missing data percentage, we consider five different missing percentages: 10, 20, 30, 40 and 50%. In addition, we consider two cases of missing pattern: (a) The first one is missing at random where missing locations were randomly selected from inside 90% of the time domain and missing values do not exist near boundaries. (b) The second one is missing at random where missing locations were randomly selected over the entire domain including boundaries.
For each test function, missing percentage, and missing pattern, 100 datasets with sample size 2000 are generated. For each generated dataset, three methods above were applied to decompose the test function. To evaluate the decomposition results, we consider mean squared error (MSE) as a discrepancy measure
where g and \({\hat{g}}\) denote the true component and the corresponding extracted IMF.
Figures 3 and 4 show box plots of MSE values for two missing patterns. The proposed EMD.self is comparable to EMD.com when the missing data percentage is up to 50%. Even when missing exists near the boundary, EMD.self works well due to effective imputation by the proposed algorithm.
To access the property of the extracted IMFs, we compare the decomposition results for test signal \(S_1\) with 40% missing values. Figure 5 illustrates that the decomposition result of the panel (b) by the proposed EMD.self produces more accurate IMFs than the decomposition result of the panel (a) by EMD.obs. Figure 6 shows that the periodogram of signal \(S_1\), reconstruction by EMD.obs and reconstruction by EMD.self. The periodogram by EMD.self effectively detects three main frequencies of signal \(S_1\), while it is difficult to identify three main frequencies from the periodogram by EMD.obs. The proposed method can be applied to a noisy signal. From Fig. 7 of noisy signal \(S_1\) with the signaltonoise ratio (SNR) seven, the proposed EMD.self produces more accurate IMFs than EMD.obs does. Here, SNR is defined as \(S_1/ \sigma\) where \(\sigma\) is the standard deviation of noise. Table 1 lists MSE values for noisefree signal \(S_1\) in Fig. 5 and noisy signal \(S_1\) in Fig. 7 by EMD.obs and EMD.self. As one can see, MSEs of extracted IMFs by the proposed EMD.self are smaller than MSEs by EMD.obs, for both noisefree and noisy signal.
Furthermore, it is common that the missing locations occur in consecutive time points, for example, due to the malfunction of sensor to measure a signal. As for the third missing pattern, we consider the missing occurs in consecutive time points to form a block of missing values. Box plots of MSEs according to different length of block are given in Fig. 8 when missing percentage is 30%. From Fig. 8, the proposed method is comparable to EMD.com when the length of block is up to four.
Real data example
The signal in Fig. 9 shows clarinet sound playing the note Coctave0. The data are available in the website http://wiki.laptop.org/go/Sound_samples. We extracted 2048 samples between 0.32 and 0.41 s with 22,050 Hertz for the analysis. For comparison, we intensionally make 40% missing values from clarinet signal denoted by black points. Then we perform the aforementioned three methods for decomposition. As shown from Fig. 10, the conventional EMD does not work properly to decompose a signal under the presence of missing values especially around 0.38 s, while the proposed method produces almost the same decomposition result to the decomposition result of the complete signal.
Extension to image
Bidimensional EMD for twodimensional signals such as images has been proposed by some studies (Damerval et al. 2005; Nunes et al. 2005; Bhuiyan et al. 2008; Kim et al. 2012b). To construct the upper and lower envelopes of twodimensional signals, interpolation is done with the scattered sparse extrema. Therefore, missing aggravates insufficient sampling rate, and causes more obstacle to estimate candidate IMFs. Twodimensional extension is straightforward by recursively imputing the twodimensional missing values through thinplate spline and decomposing imputed image by an existing twodimensional EMD procedure.
Now we consider a test image f as, for \(x, y \in [0,1]\),
This image consists of two sinusoidal types of components and a trend component. Figure 11 shows test image f, which consists of the highfrequency component \(\sin (15\pi x)\sin (15\pi y)\) and the lowfrequency component \(\sin (7\pi x)\sin (7\pi y)\). The rightmost panel of Fig. 11 illustrates the image with 40% missing values. The white pixels represent the locations of the missing observations.
To evaluate the proposed method, we compare the decomposition results for the complete image and imputed image with local mean near missing. The first and second rows of Fig. 12 represent two components as the decomposition results for the complete image and imputed image with local mean near missing. As shown, the proposed method extracts the proper IMFs, and decomposes the highfrequency and lowfrequency components effectively as the decomposition for the complete image does. However the decomposition for the imputed image by mean does not extract the first IMF properly, which affects the subsequent decomposition results. There are lots of spots in the first IMF due to the improper imputation.
Conclusions
In this paper, we have proposed a modified empirical mode decomposition to deal with missing data problems. The proposed method is based on imputation using the selfconsistency principle. We have presented an effective algorithm for implementation of the proposed method. The empirical performance of the proposed method has been evaluated throughout various numerical experiments including both one and twodimensional settings. Results from these experiments illustrate the proposed method possesses good empirical properties.
References
Barnhart BL, Eichinger WE, Prueger JH (2012) Introducing an Ogive method for discontinuous data. Agric For Meteorol 162–163:58–62
Barnhart BL, Nandage HKW, Eichinger WE (2011) Assessing discontinuous data using ensemble empirical mode decomposition. Adv Adapt Data Anal 3:483–491
Bhuiyan SMA, Adhami RR, Khan JF (2008) Fast and adaptive bidimensional empirical mode decomposition using orderstatistics filter based envelope estimation. EURASIP J Adv Signal Process 2008:728356
Boudraa AO, Cexus JC (2007) EMDbased signal filtering. IEEE Trans Instrum Meas 56:2196–2202
Colominas MA, Schlotthauer G, Torres ME (2014) Improved complete ensemble EMD: a suitable tool for biomedical signal processing. Biomed Signal Process Control 14:19–29
Damerval C, Meignen S, Perrier V (2005) A fast algorithm for bidimensional EMD. IEEE Signal Process Lett 12:701–704
Daubechies I (1992) Ten lectures on wavelets. SIAM, Philadelphia
Daubechies I, Lu J, Wu HT (2011) Synchrosqueezed wavelet transforms: an empirical mode decompositionlike tool. Appl Comput Harmonic Anal 30:243–261
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood estimation from incomplete data via EM algorithm. J R Stat Soc B 39:1–38 (with discussion)
Diop EHS, Alexandre R, Boudraa AO (2010) Analysis of intrinsic mode functions: a PDE approach. IEEE Signal Process Lett 17:398–401
Hastie T, Stuetzle W (1989) Principal curves. J Am Stat Assoc 84:502–516
Huang NE, Shen Z, Long SR, Wu ML, Shih HH, Zheng Q, Yen NC, Tung CC, Liu HH (1998) The empirical mode decomposition and Hilbert spectrum for nonlinear and nonstationary time series analysis. Proc R Soc Lond A 454:903–995
Kim D, Kim KO, Oh HS (2012a) Extending the scope of empirical mode decomposition using smoothing. EURASIP J Adv Signal Process 2012:168
Kim D, Park M, Oh HS (2012b) Bidimensional statistical empirical mode decomposition. IEEE Signal Process Lett 19:191–194
Komaty A, Boudraa AO, Auiger B, DaréEmzivat D (2014) EMDbased filtering using similarity measure between probability density functions of IMFs. IEEE Trans Instrum Meas 63:27–34
Lee TCM, Meng XL (2007) Self consistency: a general recipe for wavelet estimation with irregularlyspaced and/or incomplete data. arXiv:0701196
Mallat S (2009) A wavelet tour of signal processing. Academic Press, New York
Nunes JC, Guyot S, Delchelle E (2005) Texture analysis based on local analysis of the bidimensional empirical mode decomposition. Mach Vis Appl 16:177–188
Park M, Kim D, Oh HS (2015) Quantilebased empirical mode decomposition: an efficient way to decompose noisy signals. IEEE Trans Instrum Meas 64:1802–1813
Priestley MB (1981) Spectral analysis and time series. Academic Press, New York
Silverman BW (1984) Spline smoothing: the equivalent variable kernel method. Ann Stat 12:898–916
Tarpey T, Flury B (1996) Selfconsistency: a fundamental concept in statistics. Stat Sci 11:229–243
Torres ME, Colominas MA, Schlotthauer G, Flandrin P (2011) A complete ensemble empirical mode decomposition with adaptive noise. In: Proceedings of 2011 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 4144–4147
Vidakovic B (1999) Statistical modeling by wavelets. Wiley, New York
Wu Z, Huang NE (2009) Ensemble empirical mode decomposition: a noise assisted data analysis method. Adv Adapt Data Anal 1:1–41
Xu Z, Huang B, Zhang F (2009) Improvement of empirical mode decomposition under low sampling rate. Signal Process 89:2296–2303
Yeh JR, Shieh JS, Huang NE (2010) Complementary ensemble empirical mode decomposition: a novel noise enhanced data analysis method. Adv Adapt Data Anal 2:135–156
Authors' contributions
HO introduced the initial idea, and provided the theoretical background. DK set up methodological procedure, and designed numerical studies. Both authors contributed to write the final manuscript. Both authors read and approved the final manuscript.
Acknowledgements
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2016R1D1A1B03930463), and by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (2015R1D1A1A01056854 and 20110030811).
Competing interests
The authors declare that they have no competing interests.
Author information
Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Kim, D., Oh, HS. Empirical mode decomposition with missing values. SpringerPlus 5, 2016 (2016). https://doi.org/10.1186/s4006401636921
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s4006401636921
Keywords
 Empirical mode decomposition
 Imputation
 Missing
 Multiscale method
 Selfconsistency