The ________ is the standard deviation of the sampling distribution of a statistic.
- Median deviation
- Population deviation
- Sample deviation
- Standard error
The standard error is the standard deviation of the sampling distribution of a statistic. It measures the dispersion of the sample means around the true population mean.
What are the two branches of statistics?
- Descriptive and hypothetical
- Descriptive and inferential
- Inferential and hypothetical
- Predictive and inferential
The two main branches of statistics are descriptive and inferential. Descriptive statistics involves methods of organizing, picturing, and summarizing information from data. It provides simple summaries about the sample and measures, such as mean, median, mode, etc. Inferential statistics, on the other hand, involves methods of using information from a sample to draw conclusions (inferences) about the population. It includes various techniques like hypothesis testing, regression analysis, etc.
The Mann-Whitney U test assumes that the samples are ________ and ________.
- dependent, heterogeneous
- dependent, homogeneous
- independent, heterogeneous
- independent, homogeneous
The Mann-Whitney U test assumes that the samples are independent (not paired or related) and heterogeneous (can have different variances).
The ________ of an event A is the event that A does not occur.
- Complement
- Mirror
- Opposite
- Substitute
In probability theory, the "complement" of an event A is the event that A does not occur, often denoted as A'. If the probability of event A happening is P(A), then the probability of it not happening, or its complement, is P(A') = 1 - P(A).
In a two-way ANOVA, ________ refers to the effect of one independent variable on the dependent variable, adjusting for the effects of the other independent variables.
- Interaction effect
- Main effect
- Simple effect
- nan
In a two-way ANOVA, the main effect refers to the effect of one independent variable on the dependent variable, adjusting for the effects of the other independent variables. It provides the overall effect of one factor on the outcome, irrespective of the levels of other factors.
What assumptions must be met for a Chi-square test for independence to be valid?
- The data must be continuous
- The data must be normally distributed
- The observations must be independent and the expected frequency of each category must be at least 5
- The sample size must be larger than 30
For a Chi-square test for independence to be valid, the observations must be independent, and the expected frequency of each category must be at least 5.
The ________ in Spearman's Rank Correlation indicates the strength and direction of association between two ranked variables.
- Coefficient
- Median
- P-value
- Rank
The coefficient in Spearman's Rank Correlation indicates the strength and direction of the association between two ranked variables. This coefficient can range from -1 (perfect negative correlation) to 1 (perfect positive correlation).
What is the purpose of a scatter plot?
- To compare two numerical variables
- To display a distribution
- To show the relationship between three variables
- To visualize categorical variables
A scatter plot is a graphical representation that uses dots to represent the values obtained for two different variables - one plotted along the x-axis and the other plotted along the y-axis. It helps to identify the type of relationship (if any) between two numerical variables.
Does PCA require the features to be on the same scale?
- Depends on the algorithm used
- Depends on the data
- No
- Yes
Yes, PCA requires the features to be on the same scale. If features are on different scales, PCA might end up giving higher weightage to features with higher variance, which could lead to incorrect principal components. So, it's typically a good practice to standardize the data before applying PCA.
What are communalities in factor analysis?
- They are the shared variance between variables
- They are the unique variances of variables
- They are the variances of the factors after rotation
- They represent the total variance of the factors
In factor analysis, communalities are the proportion of variance in each variable that is accounted for, or shared among the factors. They represent the shared variance between variables.