What does it mean when we say a non-parametric test makes fewer assumptions about the data distribution?

  • The data distribution must be known
  • The data does not have to follow a specific distribution, such as normal
  • The data must be normally distributed
  • The data must be uniformly distributed
When we say a non-parametric test makes fewer assumptions about the data distribution, we mean that the data does not have to follow a specific distribution, such as the normal distribution. Non-parametric tests are distribution-free tests and make no assumption about the probability distribution of the variables.

The Pearson's Correlation Coefficient measures the ________ between two variables.

  • causal relationship
  • linear correlation
  • percentage similarity
  • rank
Pearson's Correlation Coefficient measures the linear correlation between two variables. It quantifies the degree to which two variables are related to each other.

What is the null hypothesis in a one-way ANOVA test?

  • All group means are different
  • All group means are equal
  • The sample is not representative of the population
  • The variance is the same across all groups
The null hypothesis in a one-way ANOVA test is that all group means are equal. This hypothesis is tested against the alternative that at least one group mean is different.

How can the harmonic mean be useful in the analysis of rates?

  • It gives more weight to larger rates
  • It gives more weight to smaller rates
  • It is not useful in analyzing rates
  • It treats all rates equally
The harmonic mean is useful in the analysis of rates as it gives more weight to smaller values. This can be particularly useful when dealing with rates or ratios, for example, in calculating average speed. The harmonic mean is the reciprocal of the arithmetic mean of the reciprocals, making it a robust measure for rates.

What are the common techniques used for model selection in multiple regression?

  • Chi-square test
  • F-test
  • Forward selection, backward elimination, and stepwise regression.
  • T-test
Techniques like forward selection, backward elimination, and stepwise regression are commonly used for model selection in multiple regression.

The presence of a pattern in the residuals of a multiple linear regression model can indicate violations of the ________ assumption.

  • homoscedasticity
  • independence
  • linearity
  • normality
The presence of a pattern in the residuals of a multiple linear regression model can indicate a violation of the independence assumption. This assumption requires that the residuals, which are the differences between the observed and predicted values of the dependent variable, are independent of each other. If a pattern is observed in the residuals, it may indicate that the residuals are not independent, and the model may not provide valid results.

The Sign Test ignores the ________ of the differences between paired observations.

  • direction
  • distribution
  • magnitude
  • nan
The Sign Test ignores the magnitude of the differences between paired observations, and only considers the sign of the differences.

How does the sample size affect the width of the confidence interval?

  • Larger sample size makes the interval narrower
  • Larger sample size makes the interval wider
  • Sample size has no effect on the interval
  • nan
Larger sample sizes reduce the standard error and thus, the width of the confidence interval becomes narrower. This means that with larger samples, our estimates are more precise.

How is the Kaiser-Meyer-Olkin (KMO) measure of sampling adequacy used in factor analysis?

  • It is used to assess the appropriateness of factor analysis
  • It is used to determine the number of factors to retain
  • It is used to test the assumption of homoscedasticity
  • It is used to test the assumption of normality
The Kaiser-Meyer-Olkin (KMO) measure is a measure of how suitable the data is for factor analysis. It determines the adequacy for each observed variable and for the complete model. KMO estimates vary between 0 and 1. A value of 0 indicates that the sum of partial correlations is large relative to the sum correlations, implying diffusion in the pattern of correlations (hence, factor analysis will be likely inappropriate).

What type of data can be further classified as discrete and continuous?

  • Categorical data
  • Nominal data
  • Qualitative data
  • Quantitative data
Quantitative data can be further classified as discrete and continuous. Discrete data is countable and has a finite number of possible values, such as the number of students in a class. Continuous data can take any value within a given range, such as the weight of a person.