What type of correlation does the Spearman's Rank Correlation test measure?

  • Correlation of variances
  • Linear correlation
  • Monotonic correlation
  • Polynomial correlation
Spearman's Rank Correlation test measures monotonic correlation, which indicates whether an increase in one variable will increase or decrease the other variable. It does not require the relationship between the variables to be linear.

The _______ of a confidence interval corresponds to the total area under the curve that is excluded on both sides of the curve.

  • Confidence level
  • Margin of error
  • Population parameter
  • Standard error
The margin of error of a confidence interval corresponds to the total area under the curve that is excluded on both sides of the curve. This margin of error determines the width of the confidence interval.

What happens if the assumption of homoscedasticity is violated in simple linear regression?

  • It has no effect on the regression model
  • It makes the regression model more accurate
  • It makes the regression model perfectly fit the data
  • It makes the standard errors and confidence intervals invalid
Homoscedasticity is the assumption that the variance of the residuals is constant across all levels of the independent variable. If this assumption is violated (a condition known as heteroscedasticity), it can lead to unreliable and inefficient estimates of the standard errors. This, in turn, can make the confidence intervals and hypothesis tests invalid.

In the context of a continuous random variable, the ________ function gives the probability that the variable takes a value less than or equal to a certain value.

  • Cumulative Distribution Function
  • Probability Density Function
  • Probability Mass Function
  • Random Function
The Cumulative Distribution Function (CDF) of a random variable is defined as the probability that the variable takes a value less than or equal to a certain value. The difference between discrete and continuous random variables is the way their probabilities are assigned.

The ________ is the most frequent value in a data set.

  • Mean
  • Median
  • Mode
  • nan
The mode is the value that appears most frequently in a data set. A set of data may have one mode, more than one mode, or no mode at all.

What is a uniform distribution?

  • A distribution in which all outcomes are equally likely
  • A distribution in which all outcomes follow a linear pattern
  • A distribution where outcomes are not related
  • A distribution where outcomes have a bell-shaped pattern
A uniform distribution, sometimes also known as a rectangular distribution, is a distribution that has constant probability. This distribution is often used in the cases where all outcomes are equally likely.

How is the 'mean' calculated for a data set?

  • By arranging the values in ascending order
  • By finding the middle value
  • By finding the most frequent value
  • By summing all values and dividing by the number of values
The mean of a data set is calculated by summing all the values and then dividing by the number of values. It gives the 'average' of the data and can be used for both discrete and continuous data sets. However, it can be heavily influenced by outliers or extreme values.

How can the effects of interaction be investigated in a two-way ANOVA?

  • By calculating the F-statistic for each independent variable
  • By checking the interaction term in the ANOVA table
  • By conducting a one-way ANOVA
  • By conducting post-hoc tests
The effects of interaction in a two-way ANOVA can be investigated by examining the interaction term in the ANOVA table. If the p-value for the interaction term is significant, it suggests that the effect of one independent variable on the dependent variable depends on the level of the other independent variable. Post-hoc tests can then be conducted to investigate the interaction effects further.

How does the presence of multicollinearity affect the standard errors of the regression coefficients?

  • It does not affect the standard errors.
  • It makes them equal to zero.
  • It makes them larger.
  • It makes them smaller.
In the presence of multicollinearity, the standard errors of the affected coefficients are inflated, leading to less precise and unstable estimates of the regression coefficients.

How can Bayes' theorem be applied to hypothesis testing?

  • All of the above
  • It can't be used in hypothesis testing
  • It is used to calculate the probability of the null hypothesis given the data
  • It is used to reject or fail to reject the null hypothesis
Bayes' theorem can be applied to hypothesis testing by calculating the probability of the hypothesis given the observed data. This differs from traditional frequentist hypothesis testing, where the data is assumed given and the hypothesis is tested.