The Mann-Whitney U test is a ________ test.

Chi-square
correlation
non-parametric
parametric

The Mann-Whitney U test is a non-parametric test, meaning it does not assume that the underlying data follows a specific distribution.

Discuss it

How does the concept of "updating" apply in Bayesian statistics?

It means changing the data after analysis
It means replacing old hypotheses with new ones
It refers to modifying the statistical model
It refers to the process of using new evidence to update a prior probability

Updating in Bayesian statistics refers to the process of using new evidence to update a prior probability to a posterior probability. Bayes' theorem provides the mathematical framework for this updating process.

Discuss it

What are the implications of having a small expected frequency in a Chi-square test for goodness of fit?

It can cause the Chi-square distribution approximation to be inaccurate
It increases the degrees of freedom
It leads to a higher power of the test
It leads to a smaller Chi-square statistic

If the expected frequency in any category is too small (common rule of thumb is less than 5), the Chi-square distribution approximation may be inaccurate, leading to incorrect conclusions.

Discuss it

If the calculated Chi-square statistic is greater than the critical Chi-square value, we ________ the null hypothesis.

accept
adjust
reject
retain

If the calculated Chi-square statistic is greater than the critical Chi-square value (based on the chosen significance level and the degrees of freedom), we reject the null hypothesis. This means the observed distribution significantly differs from the expected distribution.

Discuss it

Factor analysis reduces the dimensions of data by combining similar _______ into groups or factors.

eigenvalues
factors
observations
variables

Factor analysis reduces the dimensions of data by combining similar variables into groups or factors.

Discuss it

The ________ distribution is symmetric and its mean, median and mode are equal.

Binomial
Normal
Poisson
Uniform

The normal distribution, also known as the Gaussian distribution, is symmetric, and its mean, median, and mode are all equal. It is shaped like a bell curve, with the data evenly distributed about the mean.

Discuss it

What are the key assumptions for applying the Sign Test?

Data must be at least ordinal
Data must be categorical
Data must be continuous
Data must be normally distributed

The key assumption for applying the Sign Test is that the data must be at least ordinal. The Sign Test is a non-parametric test and does not require the assumption of normality.

Discuss it

The optimal number of clusters in K-means clustering is often determined using the ________ method.

elbow
foot
hand
knee

The optimal number of clusters in K-means clustering is often determined using the elbow method. This involves plotting the explained variation as a function of the number of clusters and picking the elbow of the curve as the number of clusters to use.

Discuss it

The _______ test compares the means of two independent groups.

Chi-square
Independent t
Paired t
Z

An Independent t-test (or two sample t-test) compares the means of two independent groups.

Discuss it

How does a higher R-squared value impact the inference in multiple linear regression?

It decreases the number of observations
It improves the interpretability of the model
It increases the residuals
It makes the model more complex

The R-squared value measures the proportion of the variance in the dependent variable that is predictable from the independent variables. A higher R-squared value, closer to 1, implies a higher proportion of variability in the response variable is explained by the predictors, improving the model's interpretability and predictive power.

Discuss it