What is the difference between a discrete and a continuous probability distribution?

Discrete distributions are always normal; continuous distributions are always uniform
Discrete distributions are for qualitative data; continuous distributions are for quantitative data
Discrete distributions involve countable outcomes; continuous distributions involve uncountable outcomes
There is no difference

Discrete probability distributions are used when the outcomes are countable or discrete. Examples include the number of heads when flipping coins or the number of defective items in a batch. Continuous probability distributions are used when outcomes are uncountably infinite, typically involving measurements. Examples include the height of individuals or the time it takes to run a mile.

Discuss it

What are the dependent and independent variables in simple linear regression?

Both variables are dependent
Both variables are independent
The dependent variable is the outcome we are trying to predict, and the independent variable is the predictor
The dependent variable is the predictor, and the independent variable is the outcome we are trying to predict

In simple linear regression, the dependent variable is the outcome we are trying to predict, and the independent variable is the predictor. The dependent variable is also known as the response or target variable, and the independent variable is also known as the explanatory or feature variable.

Discuss it

When is it appropriate to use a binomial distribution?

When each trial in an experiment has exactly two possible outcomes
When the data is continuous
When the outcomes are not independent
When the probability of success changes with each trial

A binomial distribution is appropriate when conducting an experiment where each trial has exactly two possible outcomes (often termed success and failure), the trials are independent, and the probability of success is constant across trials.

Discuss it

The ________ is used to fit the regression line in a simple linear regression model.

least squares method
mean
median
mode

The least squares method is used to find the best-fitting line through the data points. This is done by minimizing the sum of the squares of the vertical distances of the points from the line.

Discuss it

In multiple linear regression, ________ is used to test the overall significance of the model.

the Chi-square statistic
the F-statistic
the Z-statistic
the t-statistic

In multiple linear regression, the F-statistic is used to test the overall significance of the model. This test checks the null hypothesis that all regression coefficients are zero against the alternative that at least one of them is not zero. If the F-statistic is significantly large and the corresponding p-value is small, we reject the null hypothesis, concluding that the regression model has some validity in predicting the outcome variable.

Discuss it

What is the primary purpose of conducting an ANOVA test?

To calculate the standard deviation of a dataset
To determine the mode of a set of data
To find the correlation between two variables
To test the equality of means among groups

The primary purpose of an ANOVA test is to compare the means of different groups and determine whether any of those means are significantly different from each other.

Discuss it

Why is interval estimation generally preferred over point estimation?

Because it gives more accurate results
Because it is easier to calculate
Because it is less affected by outliers
Because it provides a range of possible values rather than a single point

Interval estimation is generally preferred over point estimation because it provides a range of possible values rather than a single value. This range of values gives a better understanding of the uncertainty around the estimate, hence, it provides more information than a single point estimate.

Discuss it

Spearman's Rank Correlation is especially useful when the relationship between variables is ________, but not necessarily linear.

Bimodal
Monotonic
Negative
Positive

Spearman's Rank Correlation is especially useful when the relationship between variables is monotonic, but not necessarily linear. A monotonic relationship is one where the variables tend to change together, but not necessarily at a constant rate.

Discuss it

If the null hypothesis is true in ANOVA, the F-statistic follows a ________ distribution.

Binomial
Chi-Square
F
Normal

In ANOVA, if the null hypothesis is true, the F-statistic follows an F-distribution. The F-distribution is a probability distribution that is used most commonly in Analysis of Variance.

Discuss it

What does a Pearson Correlation Coefficient of 0 indicate?

No correlation
Perfect negative correlation
Perfect positive correlation
Weak positive correlation

A Pearson correlation coefficient of 0 indicates no correlation. This means that the variables are independent and there is no linear relationship between them.

Discuss it