Abstract:Symbolization is an important method for time series analysis, but choosing appropriate symbolization strategy is very difficult. Finite statistic complexity (FSC) can calculate the information quantity contained in the symbol time series, so it is evaluation criterion of symbolizing process. In this paper, several symbolization methods are analyzed including static transformation method, dynamic method, wavelet space method, etc. Eight time series are transformed into the symbol series by different methods and the FSC of all the symbol series are compared from several aspects. These time series which come from different domains are nonlinear and nonstationary. Some meaningful empirical conclusions are thus drawn. All of the analyses imply that the dynamic transformation is the best, and then the integrated one and wavelet space one. Unexpectedly, the static transformation is the most commonly used but the worst.
[1] Daw C S, Finney C E A, Tracy E R. A Review of Symbolic Analysis of Experimental Data. Review of Scientific Instruments, 2003, 74(2): 915930 [2] Kurths J, Schwarz U, Witt A,et al. Measures of Complexity in Signal Analysis // Proc of the AIP Conference on Chaotic, Fractal, and Nonlinear Signal Processing. Woodbury, USA, 1996: 3354 [3] Shi Hong, Shen Yi, Liu Zhiyan, et al. Research on Main Issues in Rough Sets and Its Application. Computer Engineering, 2003, 29(3): 13 (in Chinese) (石 红, 沈 毅, 刘志言,等.关于粗糙集理论及应用问题的研究.计算机工程, 2003, 29(3): 13) [4] Tang X Z, Tracy E R, Boozer A D, et al. Symbol Sequence Statistics in Noisy Chaotic Signal Reconstruction. Physical Review E, 1995, 51(5): 38713889 [5] Shalizi C R, Crutchfield J P. Computational Mechanics: Pattern and Prediction, Structure and Simplicity. Journal of Statistical Physics, 2001, 104(3/4): 817879 [6] Lin J, Keogh E, Loncell S, et al. A Symbolic Representation of Time Series, with Implications for Streaming Algorithms // Proc of the 8th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery. San Diego, USA, 2003: 211 [7] Ray A. Symbolic Dynamic Analysis of Complex Systems for Anomaly Detection. Signal Processing, 2004, 84(7): 11151130 [8] Chin S C. Real Time Anomaly Detection in Complex Dynamic Systems. Ph.D Dissertation. Park, USA: The Pennsylvania State University. Depatment of Electrical Engineering, 2004 [9] Crutchfield J P. The Calculi of Emergence: Computation, Dynamics and Induction. Physica D, 1994, 75(1/2/3): 1154 [10] Crutchfield J P, Young K. Inferring Statistical Complexity. Physical Review Letters, 1989, 63(2): 105108 [11] Perry N, Binder P M. Finite Statistical Complexity for Sofic Systems. Physical Review E, 1999, 60(1): 459463 [12] Hyndman R J. Time Series Data Library [DB/OL]. [20060406]. http://wwwpersonal.buseco.monash.edu.au/~hyndman/TSDL [13] Weigend A. The Santa Fe Time Series Competition Data [DB/OL]. [20060406]. http://wwwpsych.stanford.edu/~andreas/ TimeSeries/SantaFe.html [14] Andrzejak G, Lehnertz K, Rieke C, et al. EEG Time Series Download Page [DB/OL]. [20060406]. http://www.meb.unibonn.de/epileptologie/science/physik/eegdata.html