The Central Limit Theorem states that the sampling distribution of the sample means approaches a ________ distribution as the sample size gets larger, regardless of the shape of the population distribution.

Poisson
binomial
normal
uniform

The Central Limit Theorem is a fundamental theorem in statistics that states that the sampling distribution of the sample means approaches a normal distribution as the sample size gets larger, no matter what the shape of the population distribution. This outcome is significant because it enables us to make statistical inferences about the population mean based on the distribution of sample means.

Discuss it

What is the purpose of hypothesis testing in statistics?

To compare the sample mean to the population mean
To make inferences about a population based on sample data
To understand the distribution of the data
To visualize the data

Hypothesis testing is a statistical method that is used in making statistical decisions using experimental data. It's an inferential statistic that allows us to infer if our observed results deviate from null hypothesis by chance or by a true statistical difference.

Discuss it

The p-value in a hypothesis test is the probability of getting a sample statistic as extreme as the test statistic, given that the _______ hypothesis is true.

Alternative
Null
Original
Random

In the context of hypothesis testing, the p-value is the probability of observing a test statistic as extreme as the one calculated, assuming that the null hypothesis is true.

Discuss it

What are the assumptions required for a distribution to be considered a Poisson distribution?

The events are dependent on each other
The events are occurring at a constant mean rate and independently of the time since the last event
The events have more than two possible outcomes
The number of trials is fixed

The key assumptions for a Poisson distribution are that the events are happening at a constant mean rate and independently of the time since the last event. This is often used for modeling the number of times an event occurs in a given interval of time or space.

Discuss it

What is the relationship between the mean and the standard deviation in a normal distribution?

The mean is always larger than the standard deviation
The mean is the midpoint of the distribution, and the standard deviation measures the spread
The standard deviation is always larger than the mean
There is no relationship between the mean and the standard deviation

In a normal distribution, the mean is the center of the distribution and represents the "average" value. The standard deviation measures the dispersion around the mean. Roughly 68% of the data falls within one standard deviation of the mean in a normal distribution.

Discuss it

The conditional probability of A given B is denoted as ________.

P(A + B)
P(A / B)
P(A B)
P(A ∩ B)

The conditional probability of A given B is denoted as P(A

Discuss it

What is the primary goal of random sampling?

To always select the same individuals
To ensure that every member of the population has an equal chance of being selected
To select individuals who are likely to give the desired results
To select the individuals who are easiest to reach

The primary goal of random sampling is to ensure that every member of the population has an equal chance of being selected. This helps to reduce bias and increase the likelihood that the sample is representative of the population, which makes the results more valid and generalizable.

Discuss it

_______ regression is a method used to handle multicollinearity by adding a degree of bias to the regression estimates.

Logistic
Polynomial
Ridge
Simple linear

Ridge regression handles multicollinearity by introducing a degree of bias to the regression estimates, reducing their variance, and making them more reliable.

Discuss it

What are the assumptions made by the Spearman’s Rank Correlation test?

The data is continuous and the relationship is monotonic
The data is normally distributed and linear
The data is ordinal and the relationship is linear
The data is ordinal or continuous and the relationship is monotonic

The Spearman’s Rank Correlation test assumes that the variables are ordinal or continuous and that the relationship between them is monotonic. It does not require the relationship to be linear or the data to be normally distributed.

Discuss it

What are the limitations of using mean as a measure of central tendency?

It can't be used with large data sets
It can't be used with small data sets
It is difficult to calculate
It is highly sensitive to outliers

The main limitation of the mean as a measure of central tendency is that it is highly sensitive to outliers or extreme values. An outlier can skew the mean and make it a less accurate representation of the data. Moreover, mean does not describe the middle value or most common value in the dataset, which are often important characteristics.

Discuss it