Support is defined on an itemset as the proportion of transactions that contain the attribute
where
For an association rule, support is defined as the support of all the attributes in the rule.
Range:
Reference: Michael Hahsler, A Probabilistic Comparison of Commonly Used Interest Measures for Association Rules, 2015, URL: https://mhahsler.github.io/arules/docs/measures
Confidence of the rule, defined as the proportion of transactions that contain the consequent in the set of transactions that contain the antecedent. This proportion is an estimate of the probability of seeing the consequent, if the antecedent is present in the transaction.
Range:
Reference: Michael Hahsler, A Probabilistic Comparison of Commonly Used Interest Measures for Association Rules, 2015, URL: https://mhahsler.github.io/arules/docs/measures
Lift measures how many times more often the antecedent and the consequent Y occur together than expected if they were statistically independent.
Range:
Reference: Michael Hahsler, A Probabilistic Comparison of Commonly Used Interest Measures for Association Rules, 2015, URL: https://mhahsler.github.io/arules/docs/measures
Coverage, also known as antecedent support, is an estimate of the probability that the rule applies to a randomly selected transaction. It is the proportion of transactions that contain the antecedent.
Range:
Reference: Michael Hahsler, A Probabilistic Comparison of Commonly Used Interest Measures for Association Rules, 2015, URL: https://mhahsler.github.io/arules/docs/measures
Support of the consequent.
Range:
Reference: Michael Hahsler, A Probabilistic Comparison of Commonly Used Interest Measures for Association Rules, 2015, URL: https://mhahsler.github.io/arules/docs/measures
Conviction can be interpreted as the ratio of the expected frequency that the antecedent occurs without the consequent.
Range:
Reference: Michael Hahsler, A Probabilistic Comparison of Commonly Used Interest Measures for Association Rules, 2015, URL: https://mhahsler.github.io/arules/docs/measures
Inclusion is defined as the ratio between the number of attributes of the rule and all attributes in the database.
where
Range:
Reference: I. Fister Jr., V. Podgorelec, I. Fister. Improved Nature-Inspired Algorithms for Numeric Association Rule Mining. In: Vasant P., Zelinka I., Weber GW. (eds) Intelligent Computing and Optimization. ICO 2020. Advances in Intelligent Systems and Computing, vol 1324. Springer, Cham.
Amplitude measures the quality of a rule, preferring attributes with smaller intervals.
where
Range:
Reference: I. Fister Jr., I. Fister A brief overview of swarm intelligence-based algorithms for numerical association rule mining. arXiv preprint arXiv:2010.15524 (2020).
Interestingness of the rule, defined as:
Here, the first part gives us the probability of generating the rule based on the antecedent, the second part gives us the probability of generating the rule based on the consequent and the third part is the probability that the rule won't be generated. Thus, rules with very high support will be deemed uninteresting.
Range:
Reference: I. Fister Jr., I. Fister A brief overview of swarm intelligence-based algorithms for numerical association rule mining. arXiv preprint arXiv:2010.15524 (2020).
Comprehensibility of the rule. Rules with fewer attributes in the consequent are more comprehensible.
Range:
Reference: I. Fister Jr., I. Fister A brief overview of swarm intelligence-based algorithms for numerical association rule mining. arXiv preprint arXiv:2010.15524 (2020).
The netconf metric evaluates the interestingness of association rules depending on the support of the rule and the support of the antecedent and consequent of the rule.
Range:
Reference: E. V. Altay and B. Alatas, "Sensitivity Analysis of MODENAR Method for Mining of Numeric Association Rules," 2019 1st International Informatics and Software Engineering Conference (UBMYK), 2019, pp. 1-6, doi: 10.1109/UBMYK48245.2019.8965539.
The Yule's Q metric represents the correlation between two possibly related dichotomous events.
Range:
Reference: E. V. Altay and B. Alatas, "Sensitivity Analysis of MODENAR Method for Mining of Numeric Association Rules," 2019 1st International Informatics and Software Engineering Conference (UBMYK), 2019, pp. 1-6, doi: 10.1109/UBMYK48245.2019.8965539.
Zheng's metric measures the strength of association (positive or negative) between the antecedent and consequent, taking into account both their co-occurrence and non-co-occurrence.
Range:
Reference: T. Zhang, “Association Rules,” in Knowledge Discovery and Data Mining. Current Issues and New Applications, 2000, pp. 245–256. doi: 10.1007/3-540-45571-X_31.
Leverage metric is difference between the frequency of antecedent and the consequent appearing together and the expected frequency of them appearing separately based on their individual support
Range: [-1, 1] (-1 reflects total negative association, 1 reflects perfect positive association
and 0 reflects independence)
Reference: Gregory Piatetsky-Shapiro. 1991. Discovery, Analysis, and Presentation of Strong Rules. In Knowledge Discovery in Databases, Gregory Piatetsky-Shapiro and William J. Frawley (Eds.). AAAI/MIT Press, 229–248.