If the results of an ANOVA test are significant, ________ tests are often used to identify specifically which groups' means are different.

Interaction
Post-hoc
Pre-hoc
Tukey

If the results of an ANOVA test are significant, post-hoc tests are often used to identify specifically which groups' means are different. These tests are performed after the ANOVA and help to avoid type I errors when making multiple comparisons.

Discuss it

What is the primary objective of cluster analysis?

To classify variables into different groups
To group similar instances into clusters
To predict the output variable
To visualize high-dimensional data

The primary objective of cluster analysis is to group similar instances (observations, data points, etc.) into clusters.

Discuss it

In ANOVA, if the F statistic is significantly high, it suggests that the null ________ should be rejected.

Distribution
Hypothesis
Model
Theory

If the F statistic in an ANOVA is significantly high, it suggests that the null hypothesis should be rejected. The null hypothesis in ANOVA is typically that all group means are equal.

Discuss it

If the population standard deviation is unknown, we use the sample standard deviation to estimate the ________ of the mean.

Confidence interval
Range
Standard error
Variability

If the population standard deviation is unknown, the sample standard deviation is used to estimate the standard error of the mean. The standard error is a measure of how much the sample mean is expected to vary from the true population mean.

Discuss it

What is the purpose of Pearson's Correlation Coefficient?

To compute the standard deviation of a dataset
To determine the linear relationship between two variables
To find the mean of a set of values
To transform qualitative data into quantitative data

Pearson's correlation coefficient (denoted as r) is a measure of the strength and direction of association that exists between two continuous variables. It measures the degree to which pairs of data for these two variables lie on a line. The values lie between -1 and 1, where 1 indicates a perfect positive correlation, -1 a perfect negative correlation, and 0 no correlation at all.

Discuss it

Quantitative data represents quantities and can be measured on a ________ scale.

Categorical
Nominal
Numerical
Ordinal

Quantitative data represents quantities and can be measured on a Numerical scale. It includes both discrete data (e.g., the number of students in a class) and continuous data (e.g., the weight of a person).

Discuss it

How does standard deviation differ from the mean absolute deviation?

Mean absolute deviation is always greater
Standard deviation is always greater
Standard deviation squares the deviations while mean absolute deviation takes absolute values
They are the same

The standard deviation and mean absolute deviation both measure the dispersion in a dataset. The key difference lies in how they treat deviations from the mean: standard deviation squares the deviations before averaging them, while mean absolute deviation takes the absolute value of deviations before averaging. As a result, standard deviation is more sensitive to extreme values than the mean absolute deviation.

Discuss it

In the presence of multicollinearity, the estimated regression coefficients are _______.

biased
equal to zero
negative
unbiased

Even in the presence of multicollinearity, the least squares estimates of the regression coefficients are still unbiased. However, they are less precise and have high standard errors.

Discuss it

When two or more predictors in a multiple linear regression model are highly correlated, it is known as __________.

Autocorrelation
Homoscedasticity
Multicollinearity
Overfitting

Multicollinearity is a phenomenon in which one predictor variable in a multiple regression model can be linearly predicted from the others with a substantial degree of accuracy. This can lead to unstable estimates of the coefficients.

Discuss it

How would you interpret the result of a Kruskal-Wallis Test?

As a measure of correlation
As a measure of dependence
As a measure of difference between groups
As a measure of variance

The result of a Kruskal-Wallis Test is interpreted as a measure of difference between groups. If the test is significant, it suggests that at least one of the groups differs from the others.

Discuss it

The geometric mean is particularly useful when comparing different items with very different ________.

Mean values
Median values
Mode values
Ranges

The geometric mean is particularly useful when comparing different items with very different ranges. It is used in various kinds of growth rates, like population growth or financial growth, where each year's value is relative to the previous year's value.

Discuss it

What is a uniform distribution?

A bell-shaped distribution
A distribution with different probabilities for different outcomes
A distribution with the same probability for all outcomes
A skewed distribution

A uniform distribution, also called a rectangular distribution, is a type of probability distribution in which all outcomes are equally likely. Each interval of equal length on the distribution's support has the same probability.

Discuss it