SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted
histogram
r2
extrapolation
placebo
2. An observational study in which subjects are followed to observe future outcomes
prospective study
rescaling
z-score
symmetric
3. Values of this record the results of each trial with respect to what we were interested in
standardizing
random assignment
strength
response variable
4. Sampling schemes that combine several sampling methods
retrospective study
placebo
multistage sample
5-number summary
5. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two
standardizing
r2
independence
lurking variable
6. A variable that names categories (whether with words or numerals)
randomization
nonresponse bias
correlation
categorical variable
7. Found by summing all the data values and dividing by the count
mean
experimental units
lurking variable
completely randomized design
8. Shows quantitative data values in a way that sketches the distribution of the data
stem-and-leaf display
simulation component
control group
tails
9. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment
quartile
model
sampling frame
control group
10. Consists of the minimum and maximum - the quartiles Q1 and Q3 - and the median
strength
principles of experimental design
retrospective study
5-number summary
11. When groups of experimental units are similar - it is a good idea to gather them together into these
spread
census
block
regression line
12. Having one mode; this is a useful term for describing the shape of a histogram when it's generally mound-shaped
bar chart
unimodal
parameter
stem-and-leaf display
13. Systematically recorded information - whether numbers or labels - together with its context
data
voluntary response bias
retrospective study
variance
14. Shows the relationship between two quantitative variables measured on the same cases
simple random sample
statistic
scatterplots
cluster sample
15. A list of individuals from whom the sample is drawn
z-score
bias
model
sampling frame
16. Value calculated from data to summarize aspects of the data
confounded
median
response bias
statistic
17. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals
5-number summary
case
range
least squares
18. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population
units
nonresponse bias
undercoverage
shape
19. Consists of the individuals who are conveniently available
stratified random sample
convenience sample
sampling variability
response variable
20. The sequence of several components representing events that we are pretending will take place
sample survey
trial
lurking variable
uniform
21. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative
direction
comparing distributions
double-blind
distribution
22. In a statistical display - each data value should be represented by the same amount of area
model
randomization
units
area principle
23. Useful family of models for unimodal - symmetric distributions
level
pie chart
normal model
boxplot
24. Value found by subtracting the mean and dividing by the standard deviation
sample survey
variance
sampling variability
standardized value
25. A sample is this if the statistics computed from it accurately reflect the corresponding population parameters
lurking variable
strength
representative
simpson's paradox
26. Bias introduced to a sample when a large fraction of those sampled fails to respond
double-blind
pie chart
bimodal
nonresponse bias
27. An arrangement of data in which each row represents a case and each column represents a variable
normal percentile
data table
bimodal
subset
28. An equation of the form y-hat = b0 + b1x
independence
standard deviation
statistically significant
linear model
29. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x
lurking variable
r2
representative
conditional distribution
30. Individuals on whom an experiment is performed
predicted value
experimental units
simple random sample
trial
31. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____
interquartile range
sample size
random
bias
32. The square root of the variance
predicted value
sample size
standard deviation
quantitative variable
33. Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values
block
boxplot
rescaling
scatterplots
34. An observational study in which subjects are selected and then their previous conditions or behaviors are determined
retrospective study
uniform
experiment
voluntary response bias
35. A sampling design in which entire groups are chosen at random
5-number summary
cluster sample
standard normal model
treatment
36. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table
marginal distribution
mean
systematic sample
prospective study
37. Holds information about the same characteristic for many cases
variable
outlier
form
stem-and-leaf display
38. A numerical measure of the direction and strength of a linear association
representative
standardized value
bias
correlation
39. Summarized with the mean or the median
sampling frame
center
stratified random sample
stem-and-leaf display
40. When either those who could influence or evaluate the results is blinded
single-blind
histogram
standardizing
independence
41. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups
distribution
blinding
stem-and-leaf display
outliers
42. The middle value with half of the data above and half below it
median
simpson's paradox
r2
outcome
43. These are hard to generate - but several websites offer an unlimited supply of equally likely random values
random numbers
linear model
cluster sample
intercept
44. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean
68-95-99.7 rule
bimodal
voluntary response bias
spread
45. Numerically valued attribute of a model
parameter
timeplot
simple random sample
placebo effect
46. When an observed difference is too large for us to believe that is is likely to have occurred naturally
outcome
quantitative variable
data
statistically significant
47. The process - intervention - or other controlled circumstance applied to randomly assigned experimental units
pie chart
range
randomization
treatment
48. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model
68-95-99.7 rule
simulation
undercoverage
observational study
49. The difference between the lowest and highest values in a data set
random assignment
range
undercoverage
mean
50. Gives the possible values of the variable and the frequency or relative frequency of each value
spread
randomized block
distribution
quartile