When is it more appropriate to use the Wilcoxon Signed Rank Test rather than the Sign Test?

When data is nominal
When data is normally distributed
When data is ordinal or interval
When sample size is large

The Wilcoxon Signed Rank Test is more appropriate to use when data is ordinal or interval because it takes into account the magnitude of the differences between paired observations, unlike the Sign Test which only considers the sign of the differences.

Discuss it

What does polynomial regression allow you to model?

Correlations
Data distribution
Non-linear relationships
Relationships between variables

Polynomial regression allows modeling of non-linear relationships. Unlike linear regression that models relationships between variables as a straight line, polynomial regression models relationships as curves, better capturing relationships that change in direction at different levels of the independent variables.

Discuss it

What is the relationship between variance and the square of the standard deviation?

Standard deviation is always larger
They are the same
Variance is always larger
Variance is the square root of the standard deviation

Variance and the square of the standard deviation are the same. The variance is calculated as the mean of the squared deviations from the mean, and the standard deviation is the square root of this variance. Hence, squaring the standard deviation gives us the variance.

Discuss it

In probability, an ________ is the set of possible results of an experiment.

Event
Outcome
Probability Space
Sample Space

In probability theory, an "outcome" is a possible result of an experiment or trial. For example, if you toss a coin, the possible outcomes are heads or tails. Each outcome of an experiment corresponds to a unique event.

Discuss it

When adding polynomial terms or interaction effects, what key assumption of regression might be violated?

Homoscedasticity
Independence of observations
Linearity
Normality of errors

When adding polynomial terms or interaction effects to a regression model, the assumption of linearity might be violated. The linearity assumption in regression analysis states that the relationship between the independent and dependent variables is linear, i.e., a change in the independent variable will result in a constant change in the dependent variable. When adding polynomial terms or interaction effects, we are essentially modeling a non-linear relationship.

Discuss it

How can you identify the presence of bimodal distribution in data?

By looking at the mean and median
By looking at the skewness
By looking at the standard deviation
By looking for two peaks in a histogram

A bimodal distribution is one that has two different modes, or peaks. This can often be identified in a histogram, where two separate areas of the data have higher frequencies. This might indicate that the data is drawn from two different populations.

Discuss it

The ________ of a distribution is the point of maximum frequency.

Mean
Median
Mode
Standard deviation

The mode of a distribution is the point of maximum frequency. It represents the value that appears most frequently in a data set. A distribution can be unimodal (one mode), bimodal (two modes), or multimodal (more than two modes).

Discuss it

When two or more independent variables in a regression model are highly correlated, it's known as ________.

Collinearity
Interaction
Multicollinearity
Overfitting

This is known as multicollinearity. In regression analysis, multicollinearity refers to a situation where two or more independent variables are highly correlated. This can make it difficult to determine the effect of each individual variable on the dependent variable and can lead to unstable and unreliable estimates.

Discuss it

A statistical technique that uses several explanatory variables to predict the outcome of a response variable is called ________.

ANOVA
correlation
multiple linear regression
simple linear regression

Multiple linear regression is a statistical technique used to predict the outcome of a response variable based on the value of two or more explanatory variables.

Discuss it

What are the key properties of a Bernoulli distribution?

It can only take positive integer values
It has a bell-shaped curve
It has a single trial with two possible outcomes
It models a series of independent trials

A Bernoulli distribution is a discrete probability distribution of a random variable which takes the value 1 with probability p and the value 0 with probability q=1-p. It models a single trial with two possible outcomes, often labelled 'success' and 'failure'.

Discuss it