What is the main difference between a population and a sample?

A population can only consist of people
A population is always smaller than a sample
A sample is a subset of a population
A sample is always larger than a population

The main difference between a population and a sample is that a sample is a subset of a population. A population refers to the entire group of individuals or observations that we're interested in, while a sample is a smaller group that's been selected from that population.

Discuss it

In the Kruskal-Wallis Test, if the p-value is less than the chosen significance level, we ________ the null hypothesis.

accept
consider
ignore
reject

If the p-value is less than the chosen significance level in the Kruskal-Wallis Test, we reject the null hypothesis. It means there is enough evidence to suggest that at least one of the groups is different from the others.

Discuss it

When is it more appropriate to use the Mann-Whitney U test than a t-test?

When data is normally distributed
When data is not normally distributed
When sample sizes are equal
When the variances of the two groups are equal

The Mann-Whitney U test is more appropriate to use than a t-test when the data is not normally distributed. This test is a non-parametric alternative to the independent t-test and does not assume normality.

Discuss it

Can the Mann-Whitney U test be used for paired samples?

No
Only if the data is normally distributed
Only if the variances are equal
Yes

No, the Mann-Whitney U test is not used for paired samples. It is designed for two independent samples. For paired samples, a different test, such as the Wilcoxon signed-rank test, would be more appropriate.

Discuss it

What does a Principal Component represent in a dataset?

A combination of original features
A feature of the dataset
A group of similar data points
A target variable

A Principal Component is a linear combination of the original features in a dataset. Each principal component is orthogonal to each other, meaning they are uncorrelated and each represents a different direction in which the data varies.

Discuss it

What are the characteristics of a Poisson distribution?

All outcomes are equally likely
It describes the distribution of non-overlapping events in an interval
It describes the distribution of rare events
It describes the events that are not independent

The Poisson distribution is used for describing the distribution of rare events in a large population or time/space interval. It also describes events that are independent, meaning the occurrence of one event doesn't affect the occurrence of another.

Discuss it

What is the role of interaction effects in a two-way ANOVA?

They calculate the variance within each group
They correct for multiple comparisons
They show how the levels of one independent variable affect the effect of the other variable on the dependent variable
They show the distribution of residuals

In a two-way ANOVA, interaction effects show how the levels of one independent variable affect the effect of the other variable on the dependent variable. Essentially, it shows whether the effect of one independent variable depends on the level of the other independent variable.

Discuss it

In which situations would you use the Kruskal-Wallis Test instead of ANOVA?

When data is normally distributed
When sample sizes are large
When the assumptions of ANOVA are violated
When there is only one independent variable

You would use the Kruskal-Wallis Test when the assumptions of ANOVA (like normality or equal variances) are violated.

Discuss it

The ___________ correlation is a non-parametric measure of correlation based on data rank.

Kendall's
Pearson's
Point-biserial
Spearman's

Spearman's correlation is a non-parametric measure of rank correlation. It assesses how well the relationship between two variables can be described using a monotonic function. This makes it suitable for both continuous and discrete ordinal variables.

Discuss it

When two events are mutually exclusive, what is the probability that both will occur?

0
0.5
1
The sum of the probabilities of the two events

When two events are mutually exclusive, it means they cannot occur at the same time. Therefore, the probability that both will occur is 0.

Discuss it

Data that can be divided into categories but has no order or priority is known as ________ data.

Continuous
Discrete
Nominal
Ordinal

Nominal data is data that can be divided into categories but has no order or priority. It is a type of categorical data that simply allows us to classify or categorize. Examples include types of cuisine (Italian, Chinese, Mexican, etc.), hair color, or city of residence.

Discuss it

A distribution that is symmetric and bell-shaped is known as a _______ distribution.

Bimodal
Normal
Skewed
Uniform

A normal distribution, also known as Gaussian distribution, is symmetric and bell-shaped. It is characterized by its mean and standard deviation. The mean, mode and median are all equal and are located at the center of the distribution.

Discuss it