What does a 95% confidence interval estimate?

The mean of the sample
The range within which 95% of the data points lie
The standard deviation of the population
The true population parameter with a 95% level of confidence

A 95% confidence interval estimates the range within which we are 95% confident that the true population parameter lies. It is not about the range of the data or the mean of the sample.

Discuss it

What are the limitations of using mean as a measure of central tendency?

It can't be used with large data sets
It can't be used with small data sets
It is difficult to calculate
It is highly sensitive to outliers

The main limitation of the mean as a measure of central tendency is that it is highly sensitive to outliers or extreme values. An outlier can skew the mean and make it a less accurate representation of the data. Moreover, mean does not describe the middle value or most common value in the dataset, which are often important characteristics.

Discuss it

What are the assumptions made by the Spearman’s Rank Correlation test?

The data is continuous and the relationship is monotonic
The data is normally distributed and linear
The data is ordinal and the relationship is linear
The data is ordinal or continuous and the relationship is monotonic

The Spearman’s Rank Correlation test assumes that the variables are ordinal or continuous and that the relationship between them is monotonic. It does not require the relationship to be linear or the data to be normally distributed.

Discuss it

_______ regression is a method used to handle multicollinearity by adding a degree of bias to the regression estimates.

Logistic
Polynomial
Ridge
Simple linear

Ridge regression handles multicollinearity by introducing a degree of bias to the regression estimates, reducing their variance, and making them more reliable.

Discuss it

What is the primary goal of random sampling?

To always select the same individuals
To ensure that every member of the population has an equal chance of being selected
To select individuals who are likely to give the desired results
To select the individuals who are easiest to reach

The primary goal of random sampling is to ensure that every member of the population has an equal chance of being selected. This helps to reduce bias and increase the likelihood that the sample is representative of the population, which makes the results more valid and generalizable.

Discuss it

The conditional probability of A given B is denoted as ________.

P(A + B)
P(A / B)
P(A B)
P(A ∩ B)

The conditional probability of A given B is denoted as P(A

Discuss it

What is the effect of monotonic transformations on Spearman’s rank correlation coefficient?

They decrease the coefficient
They don't affect the coefficient
They increase the coefficient
They make the coefficient negative

Monotonic transformations do not affect the Spearman’s rank correlation coefficient. This is because Spearman's correlation is based on the rank order of data, and monotonic transformations preserve this order.

Discuss it

In the context of cluster analysis, what is the 'centroid'?

The average distance between clusters
The geometric center of a cluster
The largest point in a cluster
The smallest point in a cluster

The centroid is the geometric center of a cluster. In other words, it's the mean value of all the points in a specific cluster.

Discuss it

_______ is a measure of how spread out the numbers in a dataset are around the mean.

Median
Range
Standard Deviation
Variance

Standard deviation is a measure of how spread out the numbers in a dataset are around the mean. It measures the average distance between each data point and the mean. The higher the standard deviation, the more spread out the data is.

Discuss it

Why is it important to consider the power of a test when designing a study?

To ensure the study can detect an effect if it exists
To ensure the study does not detect an effect if it does not exist
To maximize the chance of a Type I error
To minimize the chance of a Type I error

The power of a test is the ability of the test to detect an effect if it truly exists. It's the probability that the test correctly rejects a false null hypothesis. High power is desirable because it means the test is less likely to make a Type II error (false negative). When designing a study, it's important to choose a sample size and significance level that will provide enough power to detect an effect if one exists.

Discuss it

If A and B are independent events, the probability of both occurring is ________.

P(A + B)
P(A / B)
P(A ∩ B)
P(A ∪ B)

If A and B are independent events, the probability of both occurring is P(A ∩ B) which is equal to P(A) * P(B). This is the fundamental characteristic of independent events in probability.

Discuss it

How does sample size affect the width of a confidence interval?

Increasing the sample size decreases the width of the confidence interval
Increasing the sample size has no effect on the width of the confidence interval
Increasing the sample size increases the width of the confidence interval
The relationship between sample size and the width of the confidence interval is unpredictable

Increasing the sample size decreases the width of the confidence interval. The larger the sample size, the more information you have, and thus the less uncertainty (which translates into a smaller standard error and narrower confidence interval).

Discuss it