Synopsis
Performs an ANOVA significance test for a single numerical attribute based on the groups defined by another (nominal) attribute.
Description
This operator creates groups of the input example set based on the defined grouping attribute. For each of the groups the mean and variance of another attribute (the anova attribute) is calculated and an ANalysis Of VAriance (ANOVA) is performed. The result will be a significance test result for the specified significance level indicating if the values for the attribute are significantly different between the groups defined by the grouping attribute.
Input
- example set: expects: ExampleSetMetaData: #examples: = 0; #attributes: 0
, expects: ExampleSet, expects: ExampleSet
Output
- significance:
- example set:
Parameters
- anova attribute: Calculate the ANOVA for this attribute based on the groups defines by group_by_attribute.
- group by attribute: Performs a grouping by the values of the attribute with this name.
- significance level: The significance level for the ANOVA calculation.
- only distinct: Indicates if only rows with distinct values for the aggregation attribute should be used for the calculation of the aggregation function.