Quantitative data represents quantities and can be measured on a ________ scale.

  • Categorical
  • Nominal
  • Numerical
  • Ordinal
Quantitative data represents quantities and can be measured on a Numerical scale. It includes both discrete data (e.g., the number of students in a class) and continuous data (e.g., the weight of a person).

What is the purpose of Pearson's Correlation Coefficient?

  • To compute the standard deviation of a dataset
  • To determine the linear relationship between two variables
  • To find the mean of a set of values
  • To transform qualitative data into quantitative data
Pearson's correlation coefficient (denoted as r) is a measure of the strength and direction of association that exists between two continuous variables. It measures the degree to which pairs of data for these two variables lie on a line. The values lie between -1 and 1, where 1 indicates a perfect positive correlation, -1 a perfect negative correlation, and 0 no correlation at all.

If the population standard deviation is unknown, we use the sample standard deviation to estimate the ________ of the mean.

  • Confidence interval
  • Range
  • Standard error
  • Variability
If the population standard deviation is unknown, the sample standard deviation is used to estimate the standard error of the mean. The standard error is a measure of how much the sample mean is expected to vary from the true population mean.

In ANOVA, if the F statistic is significantly high, it suggests that the null ________ should be rejected.

  • Distribution
  • Hypothesis
  • Model
  • Theory
If the F statistic in an ANOVA is significantly high, it suggests that the null hypothesis should be rejected. The null hypothesis in ANOVA is typically that all group means are equal.

What is a uniform distribution?

  • A bell-shaped distribution
  • A distribution with different probabilities for different outcomes
  • A distribution with the same probability for all outcomes
  • A skewed distribution
A uniform distribution, also called a rectangular distribution, is a type of probability distribution in which all outcomes are equally likely. Each interval of equal length on the distribution's support has the same probability.

The geometric mean is particularly useful when comparing different items with very different ________.

  • Mean values
  • Median values
  • Mode values
  • Ranges
The geometric mean is particularly useful when comparing different items with very different ranges. It is used in various kinds of growth rates, like population growth or financial growth, where each year's value is relative to the previous year's value.

How does the correlation coefficient change when you switch the X and Y variables?

  • It changes sign
  • It decreases
  • It increases
  • It remains the same
The correlation coefficient remains the same when you switch the X and Y variables. This is because correlation measures the strength and direction of a relationship between two variables, not the dependency of one on the other.

What is meant by the term "multicollinearity" in multiple linear regression?

  • The dependent variables are correlated with each other
  • The error terms are correlated with each other
  • The independent variables are correlated with each other
  • The residuals are correlated with each other
In multiple linear regression, multicollinearity refers to a situation in which two or more independent variables are highly linearly related. This can cause problems because it can affect the interpretability of the regression coefficients and can make the model unstable.

How do we define expectation of a random variable?

  • It is the most likely outcome of the variable
  • It is the range of the variable
  • It is the variance of the variable
  • It is the weighted average of all possible values the variable can take, with weights being the respective probabilities
The expected value or expectation of a random variable is a key concept in probability and statistics and represents the weighted average of all possible values that the variable can take, with weights being the respective probabilities.

The Chi-square test for goodness of fit is only applicable to ________ data.

  • categorical
  • continuous
  • normally distributed
  • time series
The Chi-square test for goodness of fit is applicable only to categorical data. It is used to determine whether the observed frequencies differ from the expected frequencies.