A ________ plot uses rectangular bars to represent data. The length of the bar corresponds to the frequency of data.

Bar
Line
Pie
Scatter

A bar plot uses rectangular bars to represent data. The length (or height, if vertical) of each bar corresponds to the frequency or amount of data it represents. Bar plots are particularly useful for comparing categories of data.

Discuss it

The ________ is a statistic that provides an estimate of the center of a distribution.

mean
median
mode
range

The mean, often known as the average, is a measure of central tendency that provides an estimate of the center of a distribution. It's calculated by adding all the numbers in the dataset and then dividing by the number of values in the dataset. However, it's worth noting that the mean can be skewed by extremely large or small values.

Discuss it

What does a correlation coefficient of 0 indicate?

A perfect negative correlation
A perfect positive correlation
A very strong correlation
No linear correlation

A correlation coefficient of 0 indicates no linear correlation between the two variables. This means that as one variable changes, there's no predictable pattern of change in the other variable. However, this doesn't rule out the possibility of a non-linear relationship.

Discuss it

What is conditional probability?

The probability of an event given the occurrence of another event
The probability of an event regardless of the occurrence of other events
The probability that both of two events occur
The ratio of the number of outcomes in an event to the number of outcomes in a sample space

Conditional probability is the probability of an event (A) given that another event (B) has already occurred. It's a fundamental concept in probability theory and is often denoted as P(A

Discuss it

Which measure of dispersion considers all the data points in a dataset?

Interquartile range
Mode
Range
Variance

Variance is a measure of dispersion that considers all data points in the dataset. It is calculated by taking the average of the squared differences from the mean.

Discuss it

The residuals in a simple linear regression model should be randomly distributed. This is referred to as the assumption of ________.

autocorrelation
heteroscedasticity
independence
multicollinearity

The assumption of independence in simple linear regression implies that the residuals (errors) between the observed and predicted values are not correlated. That is, the error value for one observation does not depend on the error value of any other observation. This is typically checked by examining a plot of the residuals for any visible pattern.

Discuss it

What is the interpretation of a 95% confidence interval that contains zero?

The sample mean is significantly different from zero
The sample size was not large enough to determine a precise estimate of the population parameter
There is a 95% chance that the true population parameter is zero
There is no significant evidence to suggest that the true population parameter is different from zero

If a 95% confidence interval includes zero, it means that there is no significant evidence to suggest that the true population parameter is different from zero. This is often interpreted in the context of hypothesis testing, where a confidence interval that includes zero implies that we fail to reject the null hypothesis.

Discuss it

What is an interaction effect in regression analysis?

It's when one variable has a stronger effect than another
It's when the effect of one variable changes based on the level of another variable
It's when two variables have no effect on each other
It's when two variables have the same effect on the dependent variable

An interaction effect in regression analysis is when the effect of one independent variable on the dependent variable changes based on the level of another independent variable. This is captured by including an interaction term in the regression model.

Discuss it

How can you detect multicollinearity in multiple linear regression?

By checking the correlation among predictors
By checking the normality of residuals
By looking at the scatter plot of residuals
By using the F-test

Multicollinearity can be detected by examining the correlations among the predictors. High correlation among the predictors indicates the presence of multicollinearity. More formal methods such as the Variance Inflation Factor (VIF) can also be used.

Discuss it

How does kurtosis relate to the tails of a distribution?

Kurtosis does not relate to the tails of a distribution
Kurtosis is a measure of the weight in the tails
Kurtosis relates to the length of the tails
Kurtosis relates to the width of the tails

Kurtosis is a statistical measure used to describe the distribution of observed data around the mean. It is a measure of the heaviness of the tails of a distribution. A high kurtosis in a data set is a signal that data has heavy tails or outliers.

Discuss it