What are the assumptions made by the Spearman’s Rank Correlation test?

The data is continuous and the relationship is monotonic
The data is normally distributed and linear
The data is ordinal and the relationship is linear
The data is ordinal or continuous and the relationship is monotonic

The Spearman’s Rank Correlation test assumes that the variables are ordinal or continuous and that the relationship between them is monotonic. It does not require the relationship to be linear or the data to be normally distributed.

Discuss it

What are the limitations of using mean as a measure of central tendency?

It can't be used with large data sets
It can't be used with small data sets
It is difficult to calculate
It is highly sensitive to outliers

The main limitation of the mean as a measure of central tendency is that it is highly sensitive to outliers or extreme values. An outlier can skew the mean and make it a less accurate representation of the data. Moreover, mean does not describe the middle value or most common value in the dataset, which are often important characteristics.

Discuss it

What is the effect of monotonic transformations on Spearman’s rank correlation coefficient?

They decrease the coefficient
They don't affect the coefficient
They increase the coefficient
They make the coefficient negative

Monotonic transformations do not affect the Spearman’s rank correlation coefficient. This is because Spearman's correlation is based on the rank order of data, and monotonic transformations preserve this order.

Discuss it

What's the difference between a histogram and a bar plot?

Bar plots are for continuous data, histograms for categorical data
Both are for continuous data only
Histograms are for continuous data, bar plots for categorical data
There is no difference

The main difference between a histogram and a bar plot is the type of data they represent. A histogram is used for continuous data, where the bins represent ranges of data, while a bar plot is used for categorical data to compare the frequency or count of different categories.

Discuss it

What is the error term in a simple linear regression model?

It is the dependent variable
It is the difference between the observed and predicted values
It is the independent variable
It is the slope of the regression line

The error term in a simple linear regression model is the difference between the observed and predicted values. It captures the variability in the dependent variable that is not explained by the independent variable in the model.

Discuss it

What can be inferred if the residuals are not randomly distributed in the residual plot?

The data has no outliers
The data is perfectly linear
The linear regression model is a perfect fit for the data
The linear regression model is not a good fit for the data

If the residuals are not randomly distributed (e.g., if they form a pattern), it suggests that the linear regression model is not a good fit for the data. This could be because the relationship between the variables is not linear, or because the data exhibits heteroscedasticity (unequal variances of errors), among other reasons.

Discuss it

What type of data is used in the Chi-square test for goodness of fit?

Categorical data
Continuous data
Interval data
Ordinal data

The Chi-square test for goodness of fit is used with categorical data. It compares the observed frequencies in each category with the frequencies we would expect to see if the data followed the theoretical distribution.

Discuss it

What is the null hypothesis in the Mann-Whitney U test?

The groups have different variances
The groups have equal variances
There is a significant difference between the groups
There is no significant difference between the groups

In the Mann-Whitney U test, the null hypothesis is that there is no significant difference between the groups. More specifically, it states that the probability that a randomly selected value from the first group is greater than a randomly selected value from the second group is equal to 0.5.

Discuss it

How does sample size affect the width of a confidence interval?

Increasing the sample size decreases the width of the confidence interval
Increasing the sample size has no effect on the width of the confidence interval
Increasing the sample size increases the width of the confidence interval
The relationship between sample size and the width of the confidence interval is unpredictable

Increasing the sample size decreases the width of the confidence interval. The larger the sample size, the more information you have, and thus the less uncertainty (which translates into a smaller standard error and narrower confidence interval).

Discuss it

If A and B are independent events, the probability of both occurring is ________.

P(A + B)
P(A / B)
P(A ∩ B)
P(A ∪ B)

If A and B are independent events, the probability of both occurring is P(A ∩ B) which is equal to P(A) * P(B). This is the fundamental characteristic of independent events in probability.

Discuss it