SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____
data table
marginal distribution
bias
68-95-99.7 rule
2. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value
residuals
outliers
symmetric
mode
3. Found by summing all the data values and dividing by the count
mean
confounded
matching
bimodal
4. A variable whose values are compared across different treatments
experiment
uniform
response
regression to the mean
5. Control - randomize - replicate - block
dotplot
timeplot
principles of experimental design
quartile
6. A numerical measure of the direction and strength of a linear association
correlation
random assignment
least squares
block
7. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two
shifting
direction
lurking variable
outlier
8. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one
range
z-score
rescaling
tails
9. Gives the possible values of the variable and the relative frequency of each value
distribution
stem-and-leaf display
control group
model
10. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small
center
standardizing
leverage
single-blind
11. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals
sample survey
r2
linear model
least squares
12. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean
subset
scatterplots
68-95-99.7 rule
placebo
13. A numerical summary of how tightly the values are clustered around the 'center'
statistically significant
normal probability plot
categorical variable
spread
14. Design Randomization occurring within blocks
randomized block
simulation
principles of experimental design
sample survey
15. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate
double-blind
random assignment
contingency table
center
16. The middle value with half of the data above and half below it
median
histogram
r2
spread
17. A list of individuals from whom the sample is drawn
simpson's paradox
simulation component
sampling frame
shape
18. The distribution of a variable restricting the who to consider only a smaller group of individuals
conditional distribution
normal model
level
mean
19. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups
response bias
blinding
distribution
normal probability plot
20. Lists the categories in a categorical variable and gives the count or percentage of observations for each category
multimodal
spread
frequency table
timeplot
21. Bias introduced to a sample when a large fraction of those sampled fails to respond
nonresponse bias
observational study
spread
regression line
22. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage
systematic sample
cluster sample
trial
outlier
23. An individual result of a component of a simulation
blinding
sample survey
scatterplots
outcome
24. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative
completely randomized design
direction
context
5-number summary
25. An individual about whom or which we have data
case
outcome
observational study
area principle
26. Found by substituting the x-value in the regression equation; they're the values on the fitted line
spread
trial
categorical variable
predicted value
27. Holds information about the same characteristic for many cases
control group
form
variable
convenience sample
28. When either those who could influence or evaluate the results is blinded
center
changing center and spread
single-blind
multistage sample
29. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data
subset
voluntary response bias
median
independence
30. To describe this aspect of a distribution - look for single vs. multiple modes - and symmetry vs. skewness
shape
statistically significant
pie chart
systematic sample
31. An observational study in which subjects are followed to observe future outcomes
context
5-number summary
prospective study
matching
32. Summarized with the mean or the median
center
leverage
factor
independence
33. The tendency of many human subjects (often 20% or more of experiment subjects) to show a response even when administered a placebo
response
placebo effect
matching
shape
34. A point that does not fit the overall pattern seen in the scatterplot
sample
changing center and spread
frequency table
outlier
35. Bias introduced to a sample when individuals can choose on their own whether to participate in the sample
simpson's paradox
random
subset
voluntary response bias
36. The natural tendency of randomly drawn samples to differ
lurking variable
random
bimodal
sampling variability
37. Value found by subtracting the mean and dividing by the standard deviation
lurking variable
marginal distribution
lurking variable
standardized value
38. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table
percentile
placebo effect
regression line
marginal distribution
39. A sample that consists of the entire population
predicted value
distribution
outcome
census
40. An event is this if we know what outcomes could happen - but not which particular values will happen
comparing distributions
regression to the mean
lurking variable
random
41. The ____ we care about most is straight
sample
comparing distributions
conditional distribution
form
42. Numerically valued attribute of a model
strength
interquartile range
parameter
standardized value
43. An equation of the form y-hat = b0 + b1x
linear model
cluster sample
tails
treatment
44. When groups of experimental units are similar - it is a good idea to gather them together into these
confounded
block
rescaling
statistically significant
45. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set
normal model
lurking variable
re-express data
random numbers
46. The best defense against bias - in which each individual is given a fair - random chance of selection
sample size
median
randomization
bimodal
47. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion
distribution
treatment
voluntary response bias
regression line
48. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels
range
simulation
experiment
retrospective study
49. The lower of this is the value with a quarter of the data below it; the upper of this has a quarter of the data above it
census
double-blind
simple random sample
quartile
50. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean
prospective study
unimodal
standardizing
regression to the mean