The _________ is crucial in hypothesis testing and the construction of confidence intervals.
- Central Limit Theorem
- Law of Large Numbers
- Probability Rule
- Sampling Distribution
The Central Limit Theorem is crucial in hypothesis testing and the construction of confidence intervals. By ensuring the normality of the distribution, it allows us to make inferences about the population from our sample data and to assess the likelihood that our sample mean is a reliable estimate of the population mean.
The ________ score is a measure of how close each point in one cluster is to the points in the neighboring clusters.
- boundary
- distance
- proximity
- silhouette
The silhouette score is a measure of how close each point in one cluster is to the points in the neighboring clusters. It ranges from -1 (incorrect clustering) to +1 (highly dense clustering). 0 indicates overlapping clusters.
What types of scales of measurement are suitable for non-parametric tests?
- Nominal, ordinal, interval, and ratio
- Only interval and ratio
- Only nominal and ordinal
- Only ratio
Non-parametric tests can be used with nominal, ordinal, interval, and ratio scales of measurement. This is one of the reasons why non-parametric tests are sometimes chosen over parametric ones, as they can handle data that are not interval or ratio (which are required for many parametric tests).
In a multiple linear regression model, the assumption that the variance of the residuals is the same for all levels of the predictors is known as __________.
- Autocorrelation
- Homoscedasticity
- Linearity
- Multicollinearity
Homoscedasticity refers to the assumption in regression analysis that the variance of the residuals (or "errors") is constant across all levels of the independent variables.
Simple linear regression is a method used to predict a ________ variable using a ________ variable.
- continuous, discrete
- dependent, independent
- discrete, continuous
- independent, dependent
Simple linear regression is a statistical method that allows us to summarize and study relationships between two continuous (quantitative) variables: One variable, denoted x, is regarded as the predictor, explanatory, or independent variable. The other variable, denoted y, is regarded as the response, outcome, or dependent variable.
Can the probability of an event be a negative number?
- It depends on the event
- No
- Only if the event is impossible
- Yes
The probability of an event cannot be a negative number. By definition, the probability of an event is a number between 0 and 1, inclusive.
What is the key characteristic of a symmetric distribution?
- It has a mean of zero
- It has a mode at the peak
- It has no outliers
- It has the same shape on the left and right when split vertically at the center
The key characteristic of a symmetric distribution is that it has the same shape on the left and right when split vertically at the center (i.e., about the mean). This means that the frequencies of corresponding values on either side of the center are equal.
The measure of how much individual sample means will vary is called the __________ error.
- Absolute
- Margin of
- Sampling
- Standard
The standard error of a statistic is a measure of the statistical accuracy of an estimate, equal to the standard deviation of the theoretical distribution of a large population of such estimates. It is used to test hypotheses on the grounds of a set of data. For sample means, the standard error tells us how the mean varies from one sample to another.
How does changing the units of measurement affect the standard deviation and variance of a dataset?
- It decreases them
- It depends on the new units
- It doesn't affect them
- It increases them
Changing the units of measurement will change the scale of the data, and hence will affect the values of standard deviation and variance. If the data is scaled up, both measures will increase, and if the data is scaled down, they will decrease. However, the relative dispersion, as measured by the coefficient of variation, will remain the same.
What is the principle of equally likely outcomes?
- All outcomes are equally probable
- All outcomes are identical
- All outcomes are independent
- All outcomes are mutually exclusive
The principle of equally likely outcomes is a basic assumption in the classical definition of probability. It states that if an experiment has n outcomes, and there's no reason to believe that any one outcome is more likely than any other, then each outcome is assumed to have an equal probability of 1/n. For example, in tossing a fair coin, heads and tails are equally likely.