When is it appropriate to use quantitative data over qualitative data?

Never
When both measuring and categorizing are required
When categorizing or describing is required
When measuring or counting is required

Quantitative data is appropriate to use when measuring or counting is required, or when the data can be numerically quantified. This data type allows for statistical analysis and can provide a more objective and precise understanding than qualitative data. For example, it's appropriate to use quantitative data when you want to know how many people visited a website, how much customers are willing to pay for a product, or how often a certain event occurs.

Discuss it

What kind of hypothesis is tested in the Sign Test?

The means of two groups are equal
The medians of two groups are equal
The proportions of two groups are equal
The variances of two groups are equal

The Sign Test tests the null hypothesis that the medians of two groups are equal.

Discuss it

PCA assumes that the data follows a _______ distribution.

Poisson
binomial
normal
uniform

PCA makes the assumption that the data follows a multivariate normal distribution. This means that all linear combinations of the original variables also follow a normal distribution.

Discuss it

What happens to the width of a confidence interval as the confidence level increases?

It decreases
It fluctuates unpredictably
It increases
It stays the same

The width of a confidence interval increases as the confidence level increases. A higher confidence level means that you want to be more sure that you are capturing the true population parameter, which requires a wider interval.

Discuss it

What is the Central Limit Theorem and how does it relate to point and interval estimation?

It implies that every data set is symmetrically distributed, which affects the reliability of point and interval estimations
It suggests that all data has a central tendency and this affects the point and interval estimations
It suggests that as sample size increases, the distribution of sample means approaches a normal distribution, which affects how we estimate population parameters
It suggests that every large enough dataset is normally distributed, which is the foundation of point and interval estimations

The Central Limit Theorem states that when you have a sufficiently large sample, the distribution of the sample mean approximates a normal distribution, regardless of the shape of the population distribution. This allows us to make inferences about the population parameters using the sample mean and the standard error, which form the basis of point and interval estimation.

Discuss it

An event that cannot possibly occur has a probability of ________.

-1
0
0.5
1

An event that cannot possibly occur is said to be impossible and has a probability of 0. This is in line with the definition of probability as a measure that takes values between 0 and 1, inclusive.

Discuss it

Bayes' theorem combines our prior knowledge about an event with evidence from data to provide a ________ probability.

joint
marginal
posterior
prior

The theorem combines our prior knowledge (the prior probability) and evidence (the likelihood) to provide a new, updated probability of an event (the posterior probability).

Discuss it

What are the components of a confidence interval?

The population mean, the margin of error, and the level of confidence
The population mean, the sample size, and the standard error
The sample mean, the margin of error, and the level of confidence
The sample mean, the population size, and the standard deviation

A confidence interval is composed of three parts: a point estimate (the sample mean), a margin of error (which depends on the standard error and the Z-value or T-value), and the level of confidence (which indicates the probability that the interval estimate contains the population parameter).

Discuss it

What does it mean when we say a non-parametric test makes fewer assumptions about the data distribution?

The data distribution must be known
The data does not have to follow a specific distribution, such as normal
The data must be normally distributed
The data must be uniformly distributed

When we say a non-parametric test makes fewer assumptions about the data distribution, we mean that the data does not have to follow a specific distribution, such as the normal distribution. Non-parametric tests are distribution-free tests and make no assumption about the probability distribution of the variables.

Discuss it

The Pearson's Correlation Coefficient measures the ________ between two variables.

causal relationship
linear correlation
percentage similarity
rank

Pearson's Correlation Coefficient measures the linear correlation between two variables. It quantifies the degree to which two variables are related to each other.

Discuss it