1. Introduction
The change-point detection has been studied in various fields including environtology [1], climatology [2], agricultural economy [3], bioinformatics [4], and public economics [5]. In this paper, the basic and canonical normal model with multiple mean change-points [6–8] is considered as follows:
A class of classical methods is to estimate the number and locations of change-points by fitting criterion, such as AIC [9] and BIC [6, 10, 11]. However, the computational complexity of these methods is very high. Braun et al. [12] and Bai and Perron [13] employed a dynamic programming algorithm to reduce the computational cost to the order of
To reduce the computational complexity, some stepwise approaches are proposed. Since being proposed, LASSO [19] has become a very popular statistical approach. After a reparametrization
Binary segmentation (BS) algorithm is another classical stepwise technique for multiple change-point detection by combining with a CUSUM statistic [24]. Due to its low computational complexity of
Recently, by using a sliding fixed window approach, Niu and Zhang [7] proposed a very efficient (the computational complexity can even reach
Although the computational complexity of SaRa is down to
In this paper, we make two contributions. First, we show that our FSSR algorithm can make the computational complexity of the algorithm far less than
The paper is organized as follows. Our motivation is described in Section 2. In Section 3, the FSSR algorithm is introduced. The performances of FSSR, SaRa, and mSaRa are compared by a simulation study in Section 4. In Section 5, the proposed methodology is used in DNA copy number variations identifying and a practical engineering task involving electric power system, and we validate the effectiveness of our FSSR algorithm.
2. Motivation
Our motivation can be shown by Figure 1. Dividing the data into several small subsegments, we find that, in most small subsegments, the data is normal white noise with no change-point. The shape of data sequence is different only in a few subsegments which cover change-points. It is important to find these subsegments that contain change-points quickly. In addition, it is easy to pick out small subsegments that do not contain a change-point. Excluding these subsegments, the rest subsegments are likely to contain a real change-points.
[figure omitted; refer to PDF]Because two adjacent subsegments which do not contain a change-point have common mean, the difference between two CUSUM statistics of these two adjacent subsegments should be small. On the other hand, if a small subsegment covers a change-point, the difference between the CUSUM statistics of this small subsegment and the adjacent subsegment will be significant. Then we can identify subsegments with change-points through a suitable threshold. Let
3. Fast Screen and Shape Recognition Algorithm
In this section, we give a brief description of the FSSR.
3.1. FSSR Algorithm
First, for a given positive integer
Second, for each pair of two adjacent subsegments, the local CUSUM statistic is defined as follows.
Third, based on the front screen, there is no change-point in the most subsegments, then we only need to search change-points in each selected subsegment
A flow chart of FSSR algorithm is given in Figure 2.
[figure omitted; refer to PDF]3.2. Robustness
The good performance of CUSUM statistic is based on the normal assumption of error. In practice, the data does not necessarily obey a normal distribution. Xiao et al. [34] used the Quantile normalization (QN) on the original intensities to seek the requirement of normality. Then, two robust processes embedded into our FSSR algorithm.
First, QN is used to make the data close to follow a normal distribution at each subsegment. In the procedure of FSSR, we rank the data in each subsegment. Then a sample with the same size as each subsegment from the standard normal distribution
Second, a single-peak recognition is used to enhance the robustness of the local maximizer. In most algorithms (such as BS, WBS, SaRa, and mSaRa), local maximum principle and threshold are used to confirm the change-point. In practice, the choice of threshold is very sensitive and has great influence on the result. From Figure 3, we can see that the local CUSUM statistic indicates a single-peak at each change-point. In this paper, to further improve the robustness of change-point detection, we define a simple single-peak principle. For any local maximum point
3.3. Computational Complexity
The time complexity in the FSSR is twofold. First, in the scan step, it is only needed to calculate
4. Simulation Study
Many papers show that SaRa and mSaRa are better than those BS-type methods, such as Niu and Zhang [7], Xiao et al. [34], and Song et al. [36]. Then, in this section, the performance of FSSR against SaRa and mSaRa should be useful to examine.
4.1. An Example
Before conducting large-scale simulation experiments, we first demonstrate the implementation process and effect of our FSSR algorithm through an example. We consider an example with
From this example, we can see that our FSSR algorithm can quickly and accurately find the change-points. In order to show more comparisons, we consider the normal error case in Section 4.3 and
4.2. Simulation Design
Before presenting the detailed comparison, we give the simulation design.
First, the generation of basic data comes from the standard normal distribution and a student
Second, the jump size of change-point
Third, we consider four sample sizes (
4.3. Performance on Normal Data
In this case, because the error is normal, the QN process is not embed into our algorithm. From Table 1, there are some observations as follows.
Table 1
Distribution of
| | |||||||||||
| | |||||||||||
| Sample Size | Method | | -2 | -1 | 0 | 1 | 2 | | BIC | Time | |
| | |||||||||||
| n=500 | N=5 | FSSR | 6 | 20 | 48 | 23 | 3 | 0 | 0 | 0.0662× | 0.0113 |
| SaRa | 0 | 1 | 5 | 14 | 22 | 32 | 26 | 0.1495 | 0.0231 | ||
| mSaRa | 48 | 30 | 11 | 10 | 0 | 1 | 0 | 0.1644 | 0.3802 | ||
| | |||||||||||
| n=3000 | N=10 | FSSR | 1 | 5 | 37 | 45 | 10 | 2 | 0 | 0.1445 | 0.0583 |
| SaRa | 0 | 0 | 2 | 13 | 7 | 2 | 76 | 0.6343 | 0.1339 | ||
| mSaRa | 31 | 8 | 11 | 9 | 12 | 8 | 21 | 0.6626 | 1.3432 | ||
| N=15 | FSSR | 20 | 22 | 29 | 24 | 5 | 0 | 0 | 0.2644 | 0.0593 | |
| SaRa | 5 | 3 | 9 | 15 | 6 | 8 | 54 | 0.9871 | 0.1101 | ||
| mSaRa | 88 | 6 | 2 | 1 | 1 | 0 | 2 | 0.8995 | 1.0129 | ||
| | |||||||||||
| n=5000 | N=10 | FSSR | 0 | 1 | 19 | 67 | 10 | 3 | 0 | 0.1357 | 0.0794 |
| SaRa | 0 | 0 | 1 | 10 | 4 | 2 | 83 | 0.9874 | 0.1869 | ||
| mSaRa | 11 | 2 | 5 | 10 | 7 | 5 | 60 | 0.8727 | 2.3090 | ||
| N=20 | FSSR | 15 | 22 | 28 | 27 | 6 | 1 | 1 | 0.3125 | 0.1280 | |
| SaRa | 7 | 3 | 4 | 24 | 6 | 6 | 50 | 1.5053 | 0.2049 | ||
| mSaRa | 86 | 1 | 2 | 2 | 1 | 0 | 8 | 1.4307 | 2.0621 | ||
| N=30 | FSSR | 44 | 6 | 9 | 16 | 7 | 9 | 9 | 0.5242 | 0.1317 | |
| SaRa | 30 | 8 | 12 | 20 | 10 | 5 | 15 | 3.1588 | 0.1973 | ||
| mSaRa | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 1.9524 | 1.4564 | ||
| | |||||||||||
| n=8000 | N=10 | FSSR | 0 | 0 | 25 | 55 | 17 | 3 | 0 | 0.1759 | 0.0920 |
| SaRa | 0 | 0 | 6 | 9 | 5 | 6 | 74 | 1.9034 | 0.3024 | ||
| mSaRa | 2 | 1 | 1 | 7 | 5 | 3 | 81 | 1.0359 | 2.3814 | ||
| N=20 | FSSR | 13 | 11 | 28 | 37 | 10 | 1 | 0 | 0.3645 | 0.2373 | |
| SaRa | 7 | 4 | 3 | 18 | 6 | 4 | 58 | 2.2859 | 0.4487 | ||
| mSaRa | 64 | 8 | 7 | 3 | 2 | 0 | 16 | 1.8060 | 3.2446 | ||
| N=30 | FSSR | 12 | 4 | 19 | 11 | 12 | 7 | 35 | 0.5457 | 0.2480 | |
| SaRa | 17 | 3 | 11 | 32 | 5 | 3 | 29 | 3.9068 | 0.3850 | ||
| mSaRa | 95 | 0 | 1 | 0 | 1 | 0 | 3 | 2.7051 | 2.5822 | ||
| N=50 | FSSR | 95 | 3 | 1 | 0 | 1 | 0 | 0 | 1.0258 | 0.3051 | |
| SaRa | 51 | 12 | 14 | 14 | 2 | 4 | 3 | 6.9394 | 0.3918 | ||
| mSaRa | 99 | 1 | 0 | 0 | 0 | 0 | 0 | 3.3326 | 3.0239 | ||
4.4. Robustness on t-Distribution
To investigate the effect of our FSSR on the thick tail errors, we set the errors to obey the
Besides the advantages similar to the normal case, we get some new discoveries in Table 2.
Table 2
Distribution of
| | |||||||||||
| | |||||||||||
| Sample Size | Method | | -2 | -1 | 0 | 1 | 2 | | BIC | Time | |
| | |||||||||||
| n=500 | N=5 | FSSR | 23 | 21 | 27 | 20 | 8 | 0 | 1 | 0.3016 | 0.0266 |
| SaRa | 9 | 11 | 37 | 21 | 18 | 2 | 2 | 0.4719 | 0.0233 | ||
| mSaRa | 39 | 23 | 26 | 11 | 1 | 0 | 0 | 0.4054 | 0.2920 | ||
| | |||||||||||
| n=3000 | N=10 | FSSR | 18 | 32 | 19 | 24 | 5 | 0 | 2 | 1.6737 | 0.1009 |
| SaRa | 37 | 9 | 5 | 12 | 3 | 12 | 22 | 2.7135 | 0.1253 | ||
| mSaRa | 47 | 11 | 8 | 8 | 8 | 3 | 15 | 2.0120 | 0.6151 | ||
| N=15 | FSSR | 55 | 28 | 12 | 2 | 2 | 1 | 0 | 1.6977 | 0.1240 | |
| SaRa | 68 | 3 | 9 | 5 | 2 | 1 | 12 | 3.1348 | 0.1283 | ||
| mSaRa | 80 | 6 | 6 | 3 | 4 | 1 | 0 | 2.1958 | 0.6368 | ||
| | |||||||||||
| n=5000 | N=10 | FSSR | 18 | 19 | 30 | 20 | 11 | 2 | 0 | 2.7748 | 0.1771 |
| SaRa | 35 | 8 | 6 | 5 | 5 | 4 | 37 | 4.3186 | 0.2001 | ||
| mSaRa | 38 | 4 | 8 | 10 | 6 | 4 | 30 | 3.3002 | 0.8318 | ||
| N=20 | FSSR | 66 | 23 | 9 | 7 | 1 | 0 | 0 | 2.7637 | 0.1921 | |
| SaRa | 77 | 4 | 4 | 1 | 6 | 2 | 6 | 5.0548 | 0.1948 | ||
| mSaRa | 91 | 2 | 4 | 0 | 2 | 0 | 1 | 3.5509 | 0.8705 | ||
| N=30 | FSSR | 99 | 1 | 0 | 0 | 0 | 0 | 0 | 2.8936 | 0.2050 | |
| SaRa | 90 | 0 | 4 | 1 | 0 | 2 | 3 | 6.3680 | 0.1879 | ||
| mSaRa | 99 | 1 | 0 | 0 | 0 | 0 | 0 | 4.2754 | 0.8828 | ||
| | |||||||||||
| n=8000 | N=10 | FSSR | 10 | 15 | 37 | 27 | 9 | 2 | 0 | 4.4204 | 0.2831 |
| SaRa | 17 | 5 | 3 | 5 | 6 | 5 | 59 | 6.8527 | 0.3153 | ||
| mSaRa | 15 | 7 | 6 | 4 | 4 | 9 | 55 | 4.9348 | 1.3940 | ||
| N=20 | FSSR | 30 | 30 | 28 | 10 | 2 | 0 | 0 | 4.3621 | 0.3262 | |
| SaRa | 61 | 5 | 5 | 4 | 4 | 5 | 16 | 8.6200 | 0.3047 | ||
| mSaRa | 76 | 4 | 4 | 5 | 2 | 1 | 8 | 5.6735 | 1.2837 | ||
| N=30 | FSSR | 93 | 5 | 2 | 0 | 0 | 0 | 0 | 4.4635 | 0.3484 | |
| SaRa | 90 | 2 | 4 | 1 | 1 | 0 | 2 | 10.5598 | 0.3003 | ||
| mSaRa | 95 | 2 | 0 | 0 | 2 | 0 | 1 | 6.6622 | 1.1897 | ||
| N=50 | FSSR | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 4.7080 | 0.3608 | |
| SaRa | 98 | 1 | 0 | 1 | 0 | 0 | 0 | 12.0999 | 0.3109 | ||
| mSaRa | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 7.3868 | 1.2347 | ||
5. Real Data
5.1. Application to Coriel Data
Several methods based on change-point (e.g., [7, 26]) have been widely studied and applied in copy number variation (CNV) detection.
Generally, as a new source of genetic variation, copy number variation (CNV) plays an important role in phenotypic diversity and evolution. Moreover, many studies have shown that CNV is related to the pathogenicity mechanism of some diseases, including cancer, schizophrenia, and so on [37–40]. Compared with a reference genome assembly [41], CNV usually refers to the deletion or amplification of a region of DNA sequences. Recently, with the significant advances in DNA array technology to detect DNA CNV, various techniques and platforms have been developed for analyzing DNA copy number, including array comparative genomic hybridization (aCGH), single nucleotide polymorphism (SNP) genotyping platforms, and next-generation sequencing, which provided lots of data. The goal of analyzing of DNA copy number data is to divide the whole genome into segments where copy number vary between contiguous segments and then quantify for each segment. Hence, the target of change-point based methods is to identify the exact locations of copy number changes.
For demonstrating the high efficiency and precision of FSSR, we use the FSSR to analyze the Coriel data set (Download at http://www.nature.com/ng/journal/v29/n3/suppinfo/ng754S1.html), which is firstly studied by Snijders et al. [42]. The well-known data set has been widely used in evaluating CNV detection algorithms ([7, 11, 20, 23, 26, 43, 44] and among others). The data sets consist of a logarithmic ratio of normalized intensities from the disease versus control samples, which are indexed by the physical location of the probes on the genome. The goal is to identify segments of concentrated high or low log ratios. The experiment on 15 fibroblast cell lines makes up the data sets. Each fibroblast cell line contains measurements for 2700 BACs (bacterial artificial chromosome) spotted in triplicate. There are 15 chromosomes with partial alterations and 8 whole chromosomal alterations. All of these alterations but one (Chromosome 15 on GM07801) were confirmed by spectral karyotyping. As shown in Figure 5, we apply FSSR to four chromosomes. They are Chromosome 1 of GM13330, Chromosome 7 of GM07081, Chromosome 11 of GM05296, and Chromosome 14 of GM01750. In the diagram, the points are normalized log ratios, and the dashed lines are locations of change-points detected by our proposed method. As the results show, FSSR identifies all. The results of SaRa or some other methods applied in this real data can consult references ([7, 44] and among others).
[figure omitted; refer to PDF]5.2. Application to Electric Power System
In this section, we apply the proposed FSSR approach in a real industry application to the electric power system. In the data analysis, the FSSR algorithm can be seen to overperform the SaRa and mSaRa algorithms.
In recent years, the electric distribution network (DN) faces a new challenge to the integration of distributed generations (DGs), after access of distributed scenario energy in the power system. A reasonable and appropriate plan needs to be considered to secure DN for future years. However, in order to save cost, few typical scenarios, which are used to guide in future years, are required to extract from existing massive scenarios. The power load data in the electric power system is typically time series, so the typical scenario reduction can be treated as a problem of detecting change-points.
The real data are collected from the 220kv grade DN of Sichuan province in China. Because the real data can only store for three months in practice, so we intercept data from April 20, 2016, 0:04:00 am, to May 31, 2016, 23:59:00 pm. An observation is recorded every 5 minutes; therefore the sample size is
We apply FSSR, SaRa, and mSaRa algorithms to the time series of the power data on two transformers, respectively. The results of active power and reactive power are presented in Figures 6 and 7, respectively. In Figures 6 and 7, the vertical line represents the location of change-point given by the algorithms.
[figure omitted; refer to PDF] [figure omitted; refer to PDF]Tables 3 and 4 show the fitting effect, number of change-points selected, and running time of three algorithms. The BIC value of FSSR is lowest and the number of change-points given by FSSR is smallest, while the running time of FSSR is almost as short as SaRa and is obviously shorter than mSaRa.
Table 3
The performance of FSSR, SaRa, and mSaRa on transformer 1.
| BIC | Number of Change-points | Time | |
|---|---|---|---|
| FSSR | 1.5014 | 44 | 0.3148 |
| SaRa | 1.6664 | 138 | 0.3055 |
| mSaRa | 1.5255 | 119 | 5.2266 |
Table 4
The performance of FSSR, SaRa, and mSaRa the transformer 2.
| BIC | Number of Change-points | Time | |
|---|---|---|---|
| FSSR | 7.0256 | 42 | 0.2950 |
| SaRa | 7.2773 | 148 | 0.2214 |
| mSaRa | 7.7619 | 115 | 7.2700 |
6. Concluding Remarks
For the multiple change-point detection problems, an optimal method is mainly evaluated with two aspects: the detecting criterion of change-point and the design of algorithm.
For the criterion of detecting change-points, most of the existing methods are based on the maximization criterion of global CUSUM statistic (such as BS and CBS) or local CUSUM statistic (such as SaRa and mSaRa). From Figure 3, we note that a change-point not only is the local maximum but also should be the local single-peak of the CUSUM statistic distribution. Therefore our FSSR algorithm based on single-peak recognition is more robust than the traditional one by the maximization of the CUSUM statistic. In addition, we use QN on raw data to further enhance robustness.
During the algorithm design, a fast and efficient screening process is considered. We can select the approximate subsegments including change-points at very low computational cost.
Finally, the proposed FSSR has a good performance compared to the comparable existing algorithms according to our simulation and practical application results.
Conflicts of Interest
The author declares that there are no conflicts of interest regarding the publication of this paper.
Authors’ Contributions
Youbo Liu gives the practical motivation on the change-point detecting and offers the real data of application in electric power system. Moreover, he provides many good suggestions to revise the manuscript.
[1] D. Jarušková, "Change-point detection methods to environmental data," Environmetrics, vol. 8 no. 5, pp. 469-483, DOI: 10.1002/(SICI)1099-095X(199709/10)8:5<469::AID-ENV265>3.0.CO;2-J, 1997.
[2] Q. Lu, R. Lund, T. C. Lee, "An MDL approach to the climate segmentation problem," The Annals of Applied Statistics, vol. 4 no. 1, pp. 299-319, DOI: 10.1214/09-AOAS289, 2010.
[3] H. J. Jin, D. Miljkovic, "An analysis of multiple structural breaks in US relative farm prices," Applied Economics, vol. 42 no. 25, pp. 3253-3265, DOI: 10.1080/00036840802600368, 2010.
[4] F. Caron, A. Doucet, R. Gottardo, "On-line changepoint detection and parameter estimation with application to genomic data," Statistics and Computing, vol. 22 no. 2, pp. 579-595, DOI: 10.1007/s11222-011-9248-x, 2012.
[5] G. B. Pezzatti, T. Zumbrunnen, M. Bürgi, P. Ambrosetti, M. Conedera, "Fire regime shifts as a consequence of fire policy and socio-economic development: An analysis based on the change point approach," Forest Policy and Economics, vol. 29,DOI: 10.1016/j.forpol.2011.07.002, 2013.
[6] Y.-C. Yao, S. T. Au, "Least-squares estimation of a step function," Sankhya: The Indian Journal of Statistics, Series A, vol. 51 no. 3, pp. 370-381, 1989.
[7] Y. S. Niu, H. Zhang, "The screening and ranking algorithm to detect DNA copy number variations," The Annals of Applied Statistics, vol. 6 no. 3, pp. 1306-1326, DOI: 10.1214/12-AOAS539, 2012.
[8] P. Fryzlewicz, "Wild binary segmentation for multiple change-point detection," The Annals of Statistics, vol. 42 no. 6, pp. 2243-2281, DOI: 10.1214/14-AOS1245, 2014.
[9] Y. Ninomiya, "Information criterion for Gaussian change-point model," Statistics & Probability Letters, vol. 72 no. 3, pp. 237-247, DOI: 10.1016/j.spl.2004.10.037, 2005.
[10] Y. C. Yao, "Estimating the number of change-points via schwarzs criterion," Statistics & Probability Letters, vol. 6 no. 3, pp. 181-189, 1988.
[11] N. R. Zhang, D. O. Siegmund, "A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data," Biometrics, vol. 63 no. 1, pp. 22-32, DOI: 10.1111/j.1541-0420.2006.00662.x, 2007.
[12] J. V. Braun, R. K. Braun, "Multiple changepoint fitting via quasilikelihood, with application to DNA sequence segmentation," Biometrika, vol. 87 no. 2, pp. 301-314, DOI: 10.1093/biomet/87.2.301, 2000.
[13] J. Bai, P. Perron, "Computation and analysis of multiple structural change models," Journal of Applied Econometrics, vol. 18 no. 1,DOI: 10.1002/jae.659, 2003.
[14] B. Jackson, J. D. Scargle, D. Barnes, S. Arabhi, A. Alt, P. Gioumousis, E. Gwin, P. Sangtrakulcharoen, L. Tan, T. T. Tsai, "An algorithm for optimal partitioning of data on an interval," IEEE Signal Processing Letters, vol. 12 no. 2, pp. 105-108, DOI: 10.1109/LSP.2001.838216, 2005.
[15] J. Antoch, D. Jaruskova, "Testing for multiple change points," Computational Statistics, vol. 28 no. 5, pp. 2161-2183, DOI: 10.1007/s00180-013-0401-1, 2013.
[16] R. Killick, I. A. Eckley, "Changepoint: An R package for changepoint analysis," Journal of Statistical Software, vol. 58 no. 3, 2014.
[17] R. Maidstone, T. Hocking, G. Rigaill, P. Fearnhead, "On optimal multiple changepoint algorithms for large data," Statistics and Computing, vol. 27 no. 2, pp. 519-533, DOI: 10.1007/s11222-016-9636-3, 2017.
[18] M. Maciak, I. Mizera, "Regularization techniques in joinpoint regression," Statistical Papers, vol. 57 no. 4, pp. 939-955, DOI: 10.1007/s00362-016-0823-2, 2016.
[19] R. Tibshirani, "Regression shrinkage and selection via the lasso," Journal of the Royal Statistical Society Series B, vol. 58 no. 1, pp. 267-288, 1996.
[20] T. Huang, B. Wu, P. Lizardi, H. Zhao, "Detection of DNA copy number alterations using penalized least squares regression," Bioinformatics, vol. 21 no. 20, pp. 3811-3817, DOI: 10.1093/bioinformatics/bti646, 2005.
[21] R. Tibshirani, P. Wang, "Spatial smoothing and hot spot detection for CGH data using the fused lasso," Biostatistics, vol. 9 no. 1, pp. 18-29, DOI: 10.1093/biostatistics/kxm013, 2008.
[22] J. Shen, C. M. Gallagher, Q. Lu, "Detection of multiple undocumented change-points using adaptive Lasso," Journal of Applied Statistics, vol. 41 no. 6, pp. 1161-1173, DOI: 10.1080/02664763.2013.862220, 2014.
[23] Q. Li, L. Wang, "Robust change point detection method via adaptive LAD-LASSO," Statistical Papers, vol. 1, 2017.
[24] E. S. Venkatraman, Consistency Results in Multiple Change-Point Situations, [Ph.D. thesis], 1992.
[25] J. Chen, A. K. Gupta, "Statistical inference on covariance change points in Gaussian model," Statistics. A Journal of Theoretical and Applied Statistics, vol. 38 no. 1, pp. 17-28, DOI: 10.1080/0233188032000158817, 2004.
[26] A. B. Olshen, E. S. Venkatraman, R. Lucito, M. Wigler, "Circular binary segmentation for the analysis of array-based DNA copy number data," Biostatistics, vol. 5 no. 4, pp. 557-572, DOI: 10.1093/biostatistics/kxh008, 2004.
[27] W. R. Lai, M. D. Johnson, R. Kucherlapati, P. J. Park, "Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data," Bioinformatics, vol. 21 no. 19, pp. 3763-3770, DOI: 10.1093/bioinformatics/bti611, 2005.
[28] A. Batsidis, "Robustness of the likelihood ratio test for detection and estimation of a mean change point in a sequence of elliptically contoured observations," Statistics, vol. 44 no. 1, pp. 17-24, DOI: 10.1080/02331880902758029, 2010.
[29] H. Cho, P. Fryzlewicz, "Multiple-change-point detection for high dimensional time series via sparsified binary segmentation," Journal of the Royal Statistical Society: Series B, vol. 77 no. 2, pp. 475-507, DOI: 10.1111/rssb.12079, 2015.
[30] N. Hao, Y. S. Niu, H. Zhang, "Multiple change-point detection via a screening and ranking algorithm," Statistica Sinica, vol. 23 no. 4, pp. 1553-1572, 2013.
[31] Z. Chen, Y. Hu, "Cumulative sum estimator for change-point in panel data," Statistical Papers, vol. 58 no. 3, pp. 707-728, DOI: 10.1007/s00362-015-0722-y, 2017.
[32] J. Cabrieto, F. Tuerlinckx, P. Kuppens, F. H. Wilhelm, M. Liedlgruber, E. Ceulemans, "Capturing correlation changes by applying kernel change point detection on the running correlations," Information Sciences, vol. 447, pp. 117-139, DOI: 10.1016/j.ins.2018.03.010, 2018.
[33] C.-S. J. Chu, "Time series segmentation: A sliding window approach," Information Sciences, vol. 85 no. 1-3, pp. 147-173, DOI: 10.1016/0020-0255(95)00021-G, 1995.
[34] F. Xiao, X. Min, H. Zhang, "Modified screening and ranking algorithm for copy number variation detection," Bioinformatics, vol. 31 no. 9, pp. 1341-1348, DOI: 10.1093/bioinformatics/btu850, 2015.
[35] C. Y. Yau, Z. Zhao, "Inference for multiple change points in time series via likelihood ratio scan statistics," Journal of the Royal Statistical Society Series B, vol. 78 no. 4, pp. 895-916, DOI: 10.1111/rssb.12139, 2016.
[36] C. Song, X. Min, H. Zhang, "The screening and ranking algorithm for change-points detection in multiple samples," The Annals of Applied Statistics, vol. 10 no. 4, pp. 2102-2129, DOI: 10.1214/16-AOAS966, 2016.
[37] S. J. Diskin, C. Hou, J. T. Glessner, E. F. Attiyeh, M. Laudenslager, K. Bosse, K. Cole, Y. P. Mossé, A. Wood, J. E. Lynch, K. Pecor, M. Diamond, C. Winter, K. Wang, C. Kim, E. A. Geiger, P. W. McGrady, A. I. F. Blakemore, W. B. London, T. H. Shaikh, J. Bradfield, S. F. A. Grant, H. Li, M. Devoto, E. R. Rappaport, H. Hakonarson, J. M. Maris, "Copy number variation at 1q21.1 associated with neuroblastoma," Nature, vol. 459 no. 7249, pp. 987-991, DOI: 10.1038/nature08035, 2009.
[38] G. Kirov, "The role of copy number variation in schizophrenia," Expert Review of Neurotherapeutics, vol. 10 no. 1, pp. 25-32, DOI: 10.1586/ern.09.133, 2010.
[39] P. Ibáñez, A.-M. Bonnet, B. Débarges, E. Lohmann, F. Tison, P. Pollak, Y. Agid, A. Dürr, P. A. Brice, "Causal relation between α -synuclein gene duplication and familial Parkinson's disease," The Lancet, vol. 364 no. 9440, pp. 1169-1171, DOI: 10.1016/s0140-6736(04)17104-3, 2004.
[40] J. A. Lee, C. M. B. Carvalho, J. R. Lupski, "A DNA Replication Mechanism for Generating Nonrecurrent Rearrangements Associated with Genomic Disorders," Cell, vol. 131 no. 7, pp. 1235-1247, DOI: 10.1016/j.cell.2007.11.037, 2007.
[41] R. Redon, S. Ishikawa, K. R. Fitch, L. Feuk, G. H. Perry, T. D. Andrews, H. Fiegler, M. H. Shapero, A. R. Carson, W. Chen, E. K. Cho, S. Dallaire, J. L. Freeman, J. R. González, M. Gratacòs, J. Huang, D. Kalaitzopoulos, D. Komura, J. R. MacDonald, C. R. Marshall, R. Mei, L. Montgomery, K. Nishimura, K. Okamura, F. Shen, M. J. Somerville, J. Tchinda, A. Valsesia, C. Woodwark, F. Yang, J. Zhang, T. Zerjal, J. Zhang, L. Armengol, D. F. Conrad, X. Estivill, C. Tyler-Smith, N. P. Carter, H. Aburatani, C. Lee, K. W. Jones, S. W. Scherer, M. E. Hurles, "Global variation in copy number in the human genome," Nature, vol. 444 no. 7118, pp. 444-454, DOI: 10.1038/nature05329, 2006.
[42] A. M. Snijders, N. Nowak, R. Segraves, S. Blackwood, N. Brown, J. Conroy, G. Hamilton, A. K. Hindle, B. Huey, K. Kimura, S. Law, K. Myambo, J. Palmer, B. Ylstra, J. P. Yue, J. W. Gray, A. N. Jain, D. Pinkel, D. G. Albertson, "Assembly of microarrays for genome-wide measurement of DNA copy number," Nature Genetics, vol. 29 no. 3, pp. 263-264, DOI: 10.1038/ng754, 2001.
[43] J. Fridlyand, A. M. Snijders, D. Pinkel, D. G. Albertson, A. N. Jain, "Hidden Markov models approach to the analysis of array CGH data," Journal of Multivariate Analysis, vol. 90 no. 1, pp. 132-153, DOI: 10.1016/j.jmva.2004.02.008, 2004.
[44] X.-L. Yin, J. Li, "Detecting copy number variations from array cgh data based on a conditional random field model," Journal of Bioinformatics and Computational Biology, vol. 8 no. 2, pp. 295-314, DOI: 10.1142/S021972001000480X, 2010.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2018 Dan Zhuang and Youbo Liu. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
A Fast Screen and Shape Recognition (FSSR) algorithm is proposed with complexity down to
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer






