Mining precise cause and effect rules in large time series data of socio-economic indicators

Discovery of cause–effect relationships, particularly in large databases of time-series is challenging because of continuous data of different characteristics and complex lagged relationships. In this paper, we have proposed a novel approach, to extract cause–effect relationships in large time series data set of socioeconomic indicators. The method enhances the scope of relationship discovery to cause–effect relationships by identifying multiple causal structures such as binary, transitive, many to one and cyclic. We use temporal association and temporal odds ratio to exclude noncausal association and to ensure the high reliability of discovered causal rules. We assess the method with both synthetic and real-world datasets. Our proposed method will help to build quantitative models to analyze socioeconomic processes by generating a precise cause–effect relationship between different economic indicators. The outcome shows that the proposed method can effectively discover existing causality structure in large time series databases.

between socioeconomic processes is challenging due to the presence of various complex dependencies in the data. This dependency among the various parameters has enabled us to identify relationships among different domain parameters in time series data (Madsen 2007;Geweke 1984). The cause-effect relationship for time series prediction is a step towards extracting the various existing causal relations between different domain, such as employment, education, agriculture and rural development etc. Causal discovery has been used in various fields with great success as bioinformatics (Needham et al. 2007), biology (Shipley 2002), earth sciences, etc. to identify protein interactions (Sachs et al. 2005;Chen et al. 2010), gene regulatory networks (Pinna et al. 2010;Friedman et al. 2007) and to study atmospheric teleconnections (Chu et al. 2005). It has also emerged in economics and social sciences (Spirtes et al. 2000;Neapolitan 2004) such as to improve the economic development (Easterly and Levine 2003) and growth (Asafu-Adjaye 2000) of a country and to study the impact of climate change Deng and Ebert-Uphoff 2014). Before describing the proposed method to extract various causal rules, we explain the following example ( Fig. 1) to show the motivation of our research.
Suppose we have set of indicators such as exercise, weight, diseases, calcium, alcohol, and bone growth etc. Various causal relationships can exists among them. An indicator may affect other instantly or after some time. For example, if a person takes alcohol he may feel a lack of energy (lethargy) instantly or after some time (Fig. 1a). If he takes alcohol frequently, the changes can be observed and it can be concluded that alcohol is one of the causes behind tiredness. We could identify the time between alcohol was taken and occurrence of lethargy and can also identify the amount of alcohol dose tends to cause the lethargy. More relationship like transitive can be analyzed between set of indicators (shown in Fig. 1), such as lack of exercise increases weight, which increases the chance of diseases (Fig. 1b, c). Many to one, shows the relationship such as if a person is taking the proper dose of calcium and vitamin D, it will help in bone growth i.e. bone growth requires both calcium and vitamin D. Figure 1d describes the cyclic relationship mean properties affecting each other in a cyclic manner, for example, lethargy increases weight which in turn also increases lethargy. These extracted relationships are referred as binary, transitive, many to one and cyclic respectively. of the time, two or more parameters may enhance the strength of effects. Even when individual parameter does not cause more effect, together they may do. We noticed that discovering causal structures in observational data only is insufficient. So, the discovered relationships have to be verified with time series data and controlled experiments. Still, it is acceptable to remove noncausal relationships discovered from data. Causeeffect relationship discovery is to find a brief list of rules that are probably causal. These causal rules provide a set of statistically decisive relationships which are acceptable to embed cause-effect relationships. This differentiates between the causal and normal rule discovery.
Association rule mining (Agrawal et al. 1993) has an efficient and versatile means for discovering relationships in data (Han et al. 2011). Authors (Jin et al. 2012;Li et al. 2013;Ma et al. 2016) use the advantage of association rule mining for causality discoveries. Jin et al. (2012) discovers the causal relationships with multiple cause variables in large databases of binary variables and excludes non-causal associations. Researchers (Li et al. 2013;Ma et al. 2016) discover potential causal rules using cohort study (Euser et al. 2009;Fleiss et al. 2003) and capable to generate combine causal rules in observational data. Author (Li et al. 2015) presented four approaches PC, HITON-PC, CR-PA and CR-CS for causality detection around a given target variable and discuss their efficiency. The PC and HITON-PC methods are based on Bayesian network learning theory and use conditional independence tests to eliminate non persistent associations, CR-PA use association rule and partial association and CR-CS uses the concept of a cohort study.
These proposed methods are able to find single and combined causal rules effectively in small and large database with low and high dimensional data, but they are restricted to discrete data and unable to extract the cyclic relationships and strength of relationships, although causality can be observed in various hidden relationships. However, statistically predictable associations do not illustrate cause-effect relationships, although mostly causality is usually observed as an association in the dataset. Therefore, in this paper, initially we use the concept of temporal association (Ji et al. 2011) and odds ratio (Fleiss et al. 2003) to extract binary causal relationship and further other relationships are extracted.
To the best of our knowledge, there is no previous work on discovering cyclic and transitive causal relationships with properties as the rate of change of parameters and their relationship strength in time series data. We should observe that discovering causal relationships in observational and constraint-based data only are insufficient.
The contributions of this work are listed in the following: • First, we present a method to extract cause-effect relationships like binary, transitive, many to one and cyclic in large time series database. • Second, we define the concept of temporal association lag rule and temporal odds ratio to extract cause-effect relationships between various parameters. • Third, we are generating more specific cause-effect rules like binary, transitive, many to one and cyclic with their relationship strength which is useful for strategic decisions.
Our proposed method is useful to extract time lagged relationships across different field indicators that can be used to understand the lagged response of one indicator on another and various relationships such as binary, cyclic, many to one and transitive. We show the utility of our approach by extracting some relationships between different field indicators. For example, the rule (Cereal production, D, 2 %, 2) ⇒ (Agricultural raw materials exports, 3 %), indicates a causal rule that cereal production is directly related to agricultural raw materials exports and if it is changed by 2 %, it affects the export of agricultural raw material by 3 % after 2 years. The proposed approach can be broadly applied to other problems in the temporal domain to extract various time lagged relationships.

Preliminaries
In this section, first we define the terms used in this paper. Then we define the concepts for describing proposed cause-effect relationship extraction method. Finally, we describe the formal definition of various cause-effect relationships, discovering such causal relationships is the aim of this paper. This paper deals with continuous parameters. Since all the parameters are having different ranges and we are interested in finding relationships. So instead of taking the absolute value of parameters, the rate of change is used to extract the effect of change of one parameter on another parameter, each time series value is categorized as a positive rate of change (U), a negative rate of change (D) and no rate of change (Q). To find an association between two parameters temporal association rule is used and defined using following terms: (1) Based on above structure of time series, the relationship between two parameters P i and P j for lag l is defined using following terms: j,k,l Parameters indicate direct relationship, defined as: i.e. the rate of change of P i matches with the rate of change of P j after time period l S D (P i , P j , l) Support count of direct relationship, defined as: α D (P i , P j , l) Support percent of direct relationship, defined as: Parameters indicate inverse relationship, defined as: i.e. the rate of change of P i is opposite to rate of change of P j after time period l S I (P i , P j , l) Support count of inverse relationship, defined as: α I (P i , P j , l) Support percent of inverse relationship, defined as: Θ R Strength of relationship. It indicates toughness of relationship exists between parameters. The relationship between P i and P j is calculated as: With our approach, we first consider the temporal association between indicators P i and P j since an association is needed for a cause-effect relationship. User defined support count threshold are defined as follows: α 1 Support count threshold for all causal relationships (considered as 70 % for experimentation). β Threshold for temporal odds ratio (considered as 3 for experimentation)Since α 1 is set 70 %, β is set to 3. (3) Definition 1 (Temporal association) Using direct or indirect relationship [Eqs.
Temporal direct association Temporal direct association between two parameters P i and P j for time lag l is defined as P i l → P j if α D (P i , P j , l) ≥ α 1 .
Temporal inverse association Temporal inverse association between two parameters P i and P j for time lag l is defined as P i l → P j if α I (P i , P j , l) ≥ α 1 .
Next, we define the terms to calculate the temporal odds ratio of temporally associated parameters to check whether the temporal association rule P i l → P j is also causal rule or not.
C E (P i , P j , l) = Count of the number of pairs when no rate of change in P i is associated with positive or negative rate of change in P j after time period l, defined as: where E i,j,k,l = Parameters indicate neutral-change relationship, defined as: C F (P i , P j , l) = Count of the number of pairs when the positive or negative rate of change in P i is associated with no rate of change in P j after time period l, defined as: where F i,j,k,l = Parameters indicate change-neutral relationship, defined as: C N (P i , P j , l) = Count of the number of pairs when no rate of change in P i is associated with no rate of change in P j after time period l, defined as: where N i,j,k,l = Parameters indicate neutral relationship, defined as: Definition 2 (Temporal odds ratio) It quantifies how strongly the presence or absence of change in value of parameter P i effecting change in value of parameter P j . Using above terms [Eqs. (11)-(16)] temporal odds ratio is defined as follows. (10) Temporal direct odds ratio Temporal direct odds ratio between two parameters P i and P j for time lag l is defined as: Temporal inverse odds ratio Temporal inverse odds ratio between two parameters P i and P j for time lag l is defined as: In our experimentation, if the value of C N (P i , P j , l) or C E (P i , P j , l) or C F (P i , P j , l) between parameters is zero, we considered it as 1 to avoid infinite temporal odds ratio.
Further causal rules are defined using terms define in Definitions 1 and 2.
Definition 3 (Binary rule) A binary causal rule (P i , D, l) ⇒ (P j ) , exists between P i and P j if there is temporal association rule P i l → P j and Oddratio D (P i , P j , l) ≥ β or In experimentation results, we represent direct causal rule by (P i , D, l) ⇒ (P j ) and inverse by (P i , I, l) ⇒ (P j ).
This rule will serve as a forward pruning criterion where all parameters which are not associated with another parameter with non-zero lag value are excluded from the combination of future search. The minimum required support makes the search space manageable.
Definition 4 (Precise binary rule) A precise binary rule (P i , D, δ 1 , l) ⇒ (P j , δ 2 ), exists between P i and P j if there is binary rule (P i , D, l) ⇒ (P j ) and (δ = δ 1 ), i.e. minimum growth rate of change of P i and (δ = δ 2 ), i.e. minimum growth rate of change of P j and the rule will not hold either δ > δ 1 for P i or δ > δ 2 for P j .
Based on binary causal rule, we try to extract other causal relationships as transitive, many to one (combined cause) and cyclic. We define these relationships as follows.

Proposed method
In this section, we described an algorithm based on the definitions. The algorithm is explained in five steps.
Step 1 generates the binary causal rule.
Step 2 generates more precise rules of binary causal rules. Steps 3, 4, and 5 generate the transitive, many to one and cyclic rules. Further, we give the explanation of each step of an algorithm. Table 1 represents the abbreviations used in the algorithm and in this paper. Let P be a time series database in discrete form and P i is a time series of parameter P i have U, D, and Q values as mentioned in the definitions, z is a number of parameters in database P.
Explanation To describe this step, we consider the time series using rate of change as positive (U) or negative (D) of two parameters say P i and P j for a time period (91-97). , 1992, 1993, 1994, 1995, 1996, 1997} P Here we calculate support value α for lag value = 1. Support value for lag value 1 α D (P i , P j , 1) = 83 % and temporal odd ratio (TOR), Oddratio D (P i , P j , 1) = 5.
Relationship strength [using Eq. (10)] of this rule is, 70.13. If time series data are given for some parameters, we can calculate α D and TOR between parameters and rules can be extracted. So with the help of the above algorithm, we would be able to extract all two-variable causal relationships between parameters for a time series data set.
Step 2: Specific rules generation In this step, we calculated the specific rule for binary causal rules generated in the above algorithm.
Let γ i and γ j are the rate of change of parameters P i and P j and parameters have a direct relationship.
Letδ i max = maximum value of the rate of change of P i , δ j max = maximum value of the rate of change of P j , δ i min = minimum value of the rate of change P i , δ j min = minimum value of the rate of change P j .
Explanation To understand this, we consider the time series of three parameters P i , P j , and P k as follows.
Let TOR > 3 and δ 1 , δ 2 , δ 3 is the rate of change of parameters P i , P j , P k . Calculate support values from Table 2 is: Support value of P i (U ) and P j (D), α ij P i , P j , 1 = 77.7. Support value of P j (D) and P k (D), α jk P j , P k , 1 = 88.8, Support value of P i (D) and P k (D), α ik (P i , P k , 2) = 75, Since α ij > α 1 , α jk > α 1 , α ik > α 1 , generated binary causal rules are
Explanation Let we have the following values for parameters P i , P j , and P k .
Let TOR > 3, δ 1 , δ 2 , δ 3 is the rate of change of parameters P i , P j , P k . Calculate support values from Table 3 as: Support value of α ik (P i , P k , 1) = 77.7 %, Support value of α jk P j , P k , 1 = 88.8%.
Explanation To understand this rule, we consider two parameters say P i , and P j , for a time period 1998-2015. Let δ 1 and δ 2 are rate of change for parameters P i , P j which have the following values.

Experiments
We implemented our method using Java programming language with Net Beans IDE 7.3. The computation time to check the causal relationship between parameters is high using serialized programming. So we use a parallelization approach in our program using threads in Java on a machine with configuration Dual-Core CPU contains 12-Cores, 8 GB RAM, and 64-bit Windows 7 Operating System. Our goal is to discover various causal relationships between the different economic parameters. Firstly, we find all the binary causal rules (i.e. one cause and one effect parameter) and then other causality rules are discovered using proposed method. For experimentation, minimum support threshold α 1 is set 70 % and β is set 3.

Table 3 Parameter time series
Italic letters indicate the temporal association between parameters for given time. For example, P i and P j are associated for lag 0 in 1991 and (P i , P j ) are associated with P k at lag 1. So, P i and P j values are italic at 1991 and P k at 1992

Dataset
The approach is discussed using 2 synthetic and 3 real-world dataset. Table 5 shows the summary of data sets. The synthetic dataset is generated using R software based on Bayesian network (BN

Results
This section presents the various extracted causal relationships for World Bank data sets. Results on other datasets are shown in "Comparison" section. To save space, at below, we omitted all relationships and consider only those relationships which are present in multiple countries and displaying some of them. The discovered causal rules with our approach are shown in Table 6 for south-Asian countries. In Table 6 causal relationship between parameters is described with its support, strength and rate of change of indicators. For example, a rule (Cereal production, D, 3 %, 1) ⇒ (Crop production index, 1 %), indicates direct relationship, i.e. increase in cereal production by 3 %, will increase the crop production index by 1 % after 1 year. This rule is discovered in four countries Srilanka, Nepal, Pakistan and India with different strength and support values. On the basis of support and strength value, we can say that this rule is more valid for Nepal rather than the other three countries. We can also identify a rule which has more valid for a country. In Table 6 from the binary causal rule, we can observe that three rules are present in India and above discussed rule is more valid than other rules in India. The transitive causal rules: (Rural population, D, 1 %, 1) ⇒ Population density, D, 0.33 %, 1) ⇒ Population, total, 0.68 %) can be described as, a 1 % increase in rural population increase population density by 0.33 % after 1 year, which tends to increase the total population by 0.68 % after a year. This rule is present in four countries, Afghanistan, India, Maldives, and Nepal. The rule is having more impact on India. As compared to binary and transitive causal rules, the algorithm extracts the less number of causal rules for many to one (combined causal) and cyclic. The many to one causal rule: {(Forest rents, I, 5 %, 2), (Foreign direct investment, D, 3 %, 1)} ⇒ (Crop production index, 7 %) indicates that the decrease in forest rent by 5 % and increase in foreign direct investment by 3 % would tend to increase the crop production index by 7 %. The cyclic causal rule: (Gross domestic savings, D, 1 %, 1) ⇔ (Cereal yield, D, 0.5 %, 2) can be described as, a 1 % increase in gross domestic savings increase cereal yield by 0.5 % after a year and increases in cereal yield would again increase gross domestic savings after 2 years. Similarly, other rules in all causal relationships can be analyzed.

Prediction effectiveness
The rules can be validated by calculating the mutual information (Meyer 2014) between indicators and the conditional entropy (Marsh 2013;Meyer 2014) change of the indicator before and after applying the rule. It is shown in Table 7 that the indicators are mutually related and the entropy of the indicator is decreased after applying the rule. Table 7 results show that the target indicator entropy is decreased after the rule is applied, which represents that indicator value is more uncertain when it is considered alone. For example, the large value of mutual information between CP and ARME, indicates that the two indicators are related and the entropy of ARME is decreased after the rule CP → ARME is applied. So it can be concluded that the proposed method achieves high prediction effectiveness. We validated all the generated causal rules using the concept of decrease in entropy and mutual information to check their prediction effectiveness. Generated causal rules can also be validated using time series graphs shown in "Appendix".

Scalability
Further, we do experimentation to evaluate the scalability of the algorithm with the involved years and the number of indicators. Considering Figs. 2 and 3, it could be seen that, the proposed cause-effect discovery method scales up with the number of indicators. We examine the performance degradation of the algorithm on the basis of various causal rule discoveries for nine different scales (number of indicators): 50, 75, 100, 125,  Fig. 2, the extraction time increases squarely with the number of indicators. More important, the curve is parabolic, which means that the performance of our algorithm is non-linearly related to the increase of number of indicators in binary causal rules. Though the time for generation of the binary causal rule is increasing squarely with a number of indicators, time for generation of other rules is not non-linear because the generation of other rules uses the result of binary rule generation (in Fig. 3).
The proposed method is able to extract nonlinear relationship from extracted causal rules because we are dealing with change of values as the rate of change and this change can be linear or nonlinear.

Comparison
To assess the efficiency of the proposed method, we compared proposed method with both statistical and non statistical methods. Statistical (Granger causality, Bayesian network) methods comparison is performed using R software packages as lmtest (Hothorn et al. 2015) for GC and bnlearn (Scutar 2016) for BN. In BN we calculate the results using constraint based local discovery algorithm hiton.pc (Aliferis et al. 2003). For nonstatistical approaches, we implemented the methods (Silverstein et al. 2000;Jin et al. 2012;Li et al. 2013) in Java for causal rule discovery.
First, we compared proposed method with GC and BN. GC is the base method to detect lag relationship in stationary time series data set. We run GC for different lag values with significance level, α = 0.05. HITON-PC is an effective algorithm of BN to extract parent-child relationship. So we considered both statistical methods as a benchmark for accuracy comparison. Tables 8 and 9 describe that all the binary rules which are generated in all the datasets by other methods are also generated by the proposed method. For example in the synthetic-2 dataset, we described the rule related to indicator I 7 and I 8 . In the statistical approach from Table 8, we can observe that the GC can discover only binary causal rules while BN can discover transitive as well as binary rules between indicators. For example, in a BN graph like I 1 → I 3 → I 6 can be generated, but I 1 and I 6 are independent, i.e. I 1 and I 6 may or may not be dependent. In proposed method I 1 and I 6 are conditionally dependent or I 1 is an indirect cause of I 6 .
Second, we compared our method with non-statistical methods. From Table 9 it can observe that binary and combined (many to one) causal relationship can be discovered by Jin et al. (2012) and Li et al. (2013) in all datasets. Silverstein et al. (2000) can also detect many to one rule but independently. For example, if we consider the rule (I 2 , I 4 ) → I 5 in the synthetic-1 dataset it would be considered as I 2 → I 5 ← I 4 , i.e. I 2 and I 4 affect I 5 independently, so we have not considered the many to one rule generated in a method (Silverstein et al. 2000). A transitive relationship is extracted by Silverstein et al. (2000) and proposed method. Relationships extracted by various methods are shown in Tables 8 and 9.
Based on the experimental results, it is reasonable to conclude that proposed method is capable to extract various causal relationships and causal rules like cyclic and the transitive causal rule cannot be extracted by other methods. Although non-statistical methods can generate combined causal rules, but are not generating specific rule and relationship strength. One more advantage of our method is that it also generates more specific rule and their strength between indicators. For example, when we run our algorithm on the synthetic-1 dataset, rules are extracted with various properties as lag value (time period after which one affects another indicator), strength and the rate of change of indicators i.e. positive or negative percent change. Actually, the rule I 1 → I 3 is extracted as (I 1 , I, 2%, 1) ⇒ (I 3 , 1%), 113.6, which indicates 2 % change in I 1 inversely effect 1 % change in I 3 after 1 year with 113.6 relationship strength. The results of proposed method are also demonstrated with real world data sets, as described in the following.
To investigate various causal rules in the real world cases, we run the proposed algorithm on the three real world data sets shown in Table 5 for performance evaluation. The proposed algorithm generates various binary, many to one, transitive and cyclic rules, some of the causal rules are reasonable as judged by common sense, shown in Table 8. For example, from the IMF data set, it is found that increases in general government revenue would also increase the volume of exports of goods, increase in growth of general government revenue and gross national saving effect to increase in total investment, and a decrease in government revenue can lead to decreased exports of goods too. Some interesting causal relationships are also extracted in the WTO and World Bank dataset. For example, if crop production of a country is increased, it effects to increase the export of agriculture raw material which helps to improve the economic growth of a country.

Performance evaluation
This section presents measures for assessing how accurately our proposed method can generate causal rules. The used accuracy measures (Han et al. 2011) are Precision, Recall, Specificity, F-score, Accuracy (recognition rate) and Misclassification rate. We evaluated all measures for proposed, statistical and non-statistical methods compared previously. Binary rules are considered to predict accuracy because this can be generated by all compared methods. Initially we classify the results in two classes as a causal rule (CR) and non-causal rule (NCR). Then, based on the CR and NCR results confusion matrix (TP, TN, FP, FN) is created to evaluate measures shown in "Appendix". Finally accuracy measures are calculated using TP, TN, FP and FN values. Performance of various methods is evaluated in real world, World Bank dataset for five different scales (numbers of indicators): 10, 20, 30, 40 and 50. Number of target indicators is set to 5 and remain same for all different scales. In Table 10, WBD-10 represents that 10 indicators are considered for causal rule extraction similarly others can be interpreted. Causal rules (some of them) extracted by most of the compared methods are shown in "Appendix". To indicate extracted causal rules significance appropriate references from previous literatures and documents are given. In Table 10, we can see that the proposed method can achieve higher accuracy and less error rate than all other statistical and non-statistical method for different scales of World Bank dataset. The accuracy curve for proposed method and the compared methods is shown in Fig. 4. The proposed method can extract causal rules more accurately and performs the best in all different scales. We can also notice when the dataset size increases; the statistical method performance degrades more than non-statistical methods. We regard our proposed method has a stable and good performance accuracy in comparison with the other compared methods.
In summary the comparison results show that the proposed method has high performance and also performs well in terms of all accuracy measures as compare to other compared methods.

Complexity
The steps defined in an algorithm to make minimum passes over the data. In the first pass, we calculate the growth rate of parameters and its positive, negative or neutral growth rate change value U, D, and Q are assigned to each parameter to perform the next steps. In the second pass, we calculate the support value and an odds ratio of all the individual parameters together with other parameters for different lag values. Nonzero lag value associations identified from the tests are considered. Associations with insufficient support and odds ratio will be eliminated directly. The cause-effect rules in current pairs can be determined from temporal associations and temporal odds ratio for nonzero lag value. At the end, causal pairs found previously are combined for the next steps to generate transitive, many to one and cyclic rule using basic causal binary rule. To achieve efficiency, all the combinations are not considered as a condition during the generation of other causality rules. Instead, we only investigate the combinations appearing in the data which are related to non-zero lag value. Since such combinations are very small as compared to total combinations, the cost of computation is reduced.
To analyze the performance of the algorithm with respect to time and space complexity, and the number of passes over the data set, we denote the set of parameter S, the number of parameters n, the length of the time series t, the number of extracted pairs m and the lag value l. The complexity of the method is discussed based on the extraction of binary causal rules in the form of P 1 → P 2 for lag value l.
The single parameters are paired and the support is calculated with O(n) passes over the data set. Each pair combination needs to test for l lag values to determine the association and causality, which requires O(n * l) passes. In the process of extracting binary causal relationships, a causal association will be examined on all combinations. The total number of possible pair combinations P is: So the data set needs to scan as many as O (Pnl) times. This way we can conclude the passes over the data set is O (Pnl), and the time it takes is O (Pnlt). Complexity will be substantially reduced by firstly applying the pruning step1 (binary rule generation) before extraction of other relationships.

Conclusion
This paper proposed a novel method to extract various types of causal relationship like binary, transitive, many to one and cyclic in large time series database. The proposed method is generating more specific rules and their strength which are useful for strategic information. We also defined the concept of temporal odds ratio to categorize temporal association as a causal rule. Experiments have shown that the proposed algorithm can extract single, transitive, combined and cyclic causes from large time series data sets. Additionally, the extracted rules are validated to prove their accuracy and the algorithms have been shown to scale up well with respect to the number of indicators on time series data.
In future, the efficiency of the method can be improved by using fast algorithms of mining association rule. The concept of the algorithm can also be extended to other types of time series. The proposed method can be applied in various social, economic, agriculture domains to generate strategic rules for decision making. The method is also useful to detect the exact cause of fault for the large mechanical system which is monitored by various sensors generating time series data.

Fig. 4 Accuracy curve of various methods on different scales
We can also examine the accuracy of the proposed method through by plotting time series graph between indicators. We have shown time series for four causal relationships. Table 11 shows the growth rate change of parameters for the time period 1972-2009. It represents a value with a lag difference. For example, consider a binary rule CP-(2) → ARME, indicates CP effect ARME after 2 years. In Table 11 value 5.93, shows the growth rate of change of CP in 1972 and 4.35 in the same row shows the growth rate of change of ARME in 1974. Italic values represent the pairs which follow the relationship for a rule. Similarly, we can interpret all entries of other indicators. All time series graphs are generated based on the values given in Table 11. Figure 5 shows the time lagged relations between Cereal Production (CP) and Agriculture raw material exports (ARME) with lag 2. A time period where indicators follow the direct relationship for given rule are: {1972, 1973{1972, , 1974{1972, , 1975{1972, , 1976{1972, , 1978{1972, , 1980{1972, , 1981{1972, , 1982{1972, , 1983{1972, , 1986{1972, , 1988{1972, , 1990{1972, , 1991{1972, , 1993{1972, , 1995{1972, , 1996{1972, , 1997{1972, , 1998{1972, , 1999{1972, , 2000{1972, , 2001{1972, , 2002{1972, , 2004{1972, , 2006{1972, , 2008{1972, , 2009}. For each time period (say 1974 rule would be interpreted as if the growth of CP has increased in the year 1974 it will increase ARME in 1976. In Fig. 5, time series graph we can observe that parameter satisfied the minimum support and odds ratio which indicates that indicators (CP and ARME) are causally related. Since most of the time, an increase in CP increases ARME, this relationship can be considered as a binary causal (U-U) direct relationship (Table 11). Figure 6 shows the time lagged transitive causal relationship between AR, AG, and CO2 with lag 1 and 3. Time where indicators follow the relationship for given rule are: {1974, 1975, 1976, 1977, 1978, 1980, 1981, 1982, 1984, 1987, 1988, 1990, 1991, 1992, 1994, 1995, 1996, 1998, 2000, 2002, 2004, 2005, 2006}. For each time period (say 1974) rule would be interpreted as increase in AR in 1974 will increase AG in 1975 which again increases CO2 in 1978. From Fig. 6, we can conclude that the rule satisfied the minimum support and indicators (AR, AG, and CO2) are causally related. Most of the time increase in AR increases AG after 1 year, which again increases the CO2 after 3 years. This rule can be considered as a transitive causal (U-U-U) direct relationship (Table 11). Figure 7 shows the time lagged many to one relation between (FDI, FR) and CPI with lag 1. Time period follows this relationship can be seen in many to one rule in Table 11 as italic values. In this relation, both indicators FDI and FR together affect CPI after 1 year. In Fig. 7, we can observe that if FDI increases and FR decreases they tend to increase the CPI, i.e. FDI and FR both are the cause of CPI. Indicators follow many to one (combined) causal (U,D)-U relationship. Table 12, shows confusion matrix (TP, TN, FP, FN) values to evaluate accuracy measures and Table 13, represents the causal rules extracted by most of the compared methods.  Figure 8 shows the time lagged cyclic relations between GDP and CY with lag 2 and 1. Time period follows this relationship can be seen in the cyclic rule in Table 11 as italic values. In this cyclic relation, one more indicator GDP1 values are given which is nothing but the value of GDP after 3 years. Here GDP effect CY after 2 years, which again