SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. A numerical summary of how tightly the values are clustered around the 'center'
spread
response variable
median
simpson's paradox
2. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small
leverage
blinding
median
linear model
3. A variable whose levels are controlled by the experimenter
units
standard deviation
population parameter
factor
4. Control - randomize - replicate - block
independence
principles of experimental design
simple random sample
standardizing
5. An observational study in which subjects are followed to observe future outcomes
single-blind
data table
prospective study
conditional distribution
6. The square root of the variance
normal percentile
standard deviation
center
variance
7. When the levels of one factor are associated with the levels of another factor so their effects cannot be separated
parameter
confounded
slope
population parameter
8. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____
randomized block
bias
marginal distribution
subset
9. Shows how a 'whole' divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category
statistically significant
strength
pie chart
case
10. Gives the possible values of the variable and the frequency or relative frequency of each value
distribution
statistic
stem-and-leaf display
histogram
11. A variable that names categories (whether with words or numerals)
categorical variable
5-number summary
census
observational study
12. Sampling schemes that combine several sampling methods
units
multistage sample
completely randomized design
experiment
13. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value
direction
center
sampling frame
systematic sample
14. A numerically valued attribute of a model for a population
sample
sampling variability
population parameter
standard deviation
15. This - b0 - gives a starting value in y-units; it's the y-hat-value when x is 0
outcome
intercept
ladder of powers
model
16. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data
sampling frame
mode
subset
observational study
17. A sampling design in which entire groups are chosen at random
response bias
spread
unimodal
cluster sample
18. The difference between the first and third quartiles
spread
interquartile range
simulation component
parameter
19. An event is this if we know what outcomes could happen - but not which particular values will happen
confounded
changing center and spread
random
sample
20. Displays data that change over time
standardizing
subset
intercept
timeplot
21. A variable whose values are compared across different treatments
population
response
normal percentile
statistically significant
22. Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values
independence
outlier
boxplot
model
23. A sample drawn by selecting individuals systematically from a sampling frame
center
systematic sample
convenience sample
timeplot
24. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other
uniform
parameter
census
symmetric
25. Found by substituting the x-value in the regression equation; they're the values on the fitted line
direction
control group
predicted value
population
26. The process - intervention - or other controlled circumstance applied to randomly assigned experimental units
treatment
outlier
regression to the mean
response variable
27. A distribution that's roughly flat
census
regression line
uniform
marginal distribution
28. Bias introduced to a sample when a large fraction of those sampled fails to respond
conditional distribution
representative
nonresponse bias
dotplot
29. A sampling design in which the population is divided into several subpopulations - and random samples are then drawn from each stratum
control group
stratified random sample
outlier
center
30. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment
control group
outcome
random
categorical variable
31. Places in order the effects that many re-expressions have on the data
sample
ladder of powers
multistage sample
single-blind
32. A distribution is this if it's not symmetric and one tail stretches out farther than the other
extrapolation
variance
skewed
leverage
33. Consists of the individuals who are conveniently available
convenience sample
standardizing
dotplot
re-express data
34. The best defense against bias - in which each individual is given a fair - random chance of selection
percentile
center
randomization
spread
35. The number of individuals in a sample
form
contingency table
re-express data
sample size
36. These are hard to generate - but several websites offer an unlimited supply of equally likely random values
random numbers
ladder of powers
predicted value
sample
37. The most basic situation in a simulation in which something happens at random
direction
comparing distributions
simulation component
pie chart
38. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals
least squares
simulation
convenience sample
response variable
39. The distribution of a variable restricting the who to consider only a smaller group of individuals
re-express data
r2
observational study
conditional distribution
40. Shows a bar representing the count of each category in a categorical variable
marginal distribution
rescaling
bar chart
form
41. The sum of squared deviations from the mean - divided by the count minus one
leverage
strength
double-blind
variance
42. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set
re-express data
shifting
cluster sample
changing center and spread
43. The entire group of individuals or instances about whom we hope to learn
random numbers
direction
population
quantitative variable
44. When either those who could influence or evaluate the results is blinded
single-blind
slope
center
marginal distribution
45. When groups of experimental units are similar - it is a good idea to gather them together into these
randomized block
block
residuals
mean
46. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant
form
census
sample survey
rescaling
47. The specific values that the experimenter chooses for a factor
extrapolation
area principle
level
normal percentile
48. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean
independence
response
spread
regression to the mean
49. An arrangement of data in which each row represents a case and each column represents a variable
matched
pie chart
data table
confounded
50. Holds information about the same characteristic for many cases
variable
spread
sample
least squares