SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. An observational study in which subjects are followed to observe future outcomes
prospective study
variance
stem-and-leaf display
least squares
2. When doing this - consider their shape - center - and spread
comparing distributions
trial
spread
lurking variable
3. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other
contingency table
unimodal
r2
independence
4. When both those who could influence and evaluate the results are blinded
double-blind
single-blind
slope
blinding
5. A variable that is not explicitly part of a model but affects the way the variables in the model appear to be related
dotplot
re-express data
lurking variable
outlier
6. An equation of the form y-hat = b0 + b1x
linear model
center
spread
double-blind
7. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed
quartile
census
context
uniform
8. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value
frequency table
outliers
trial
residuals
9. Numerically valued attribute of a model
data table
parameter
outcome
marginal distribution
10. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x
r2
single-blind
shape
simulation
11. This corresponding to a z-score gives the percentage of values in a standard normal distribution found at that z-score or below
sample survey
center
normal percentile
standardized value
12. The ith ___ is the number that falls above i% of the data
normal probability plot
conditional distribution
dotplot
percentile
13. The sequence of several components representing events that we are pretending will take place
matched
trial
spread
sampling variability
14. Shows quantitative data values in a way that sketches the distribution of the data
experiment
stem-and-leaf display
convenience sample
unimodal
15. Anything in a survey design that influences response
population parameter
control group
variance
response bias
16. A variable whose values are compared across different treatments
lurking variable
categorical variable
response
subset
17. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment
parameter
control group
predicted value
distribution
18. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups
shifting
systematic sample
blinding
median
19. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage
outlier
experimental units
bias
factor
20. A variable in which the numbers act as numerical values; always has units
quantitative variable
pie chart
normal model
subset
21. Doing this is equivalent to changing its units
simple random sample
data table
single-blind
changing center and spread
22. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one
stratified random sample
z-score
outliers
regression line
23. To be valid - an experiment must assign experimental units to treatment groups at random
multistage sample
factor
subset
random assignment
24. When groups of experimental units are similar - it is a good idea to gather them together into these
matching
simulation
block
factor
25. Holds information about the same characteristic for many cases
single-blind
variable
multistage sample
experiment
26. Value found by subtracting the mean and dividing by the standard deviation
standardized value
convenience sample
trial
independence
27. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative
center
mean
normal probability plot
direction
28. In a retrospective or prospective study Subjects who are similar in ways not under study may be ____ and then compared with each other on the variables of interest
randomization
distribution
pie chart
matched
29. Found by substituting the x-value in the regression equation; they're the values on the fitted line
dotplot
random
predicted value
normal probability plot
30. These are hard to generate - but several websites offer an unlimited supply of equally likely random values
center
matching
random numbers
retrospective study
31. A study based on data in which no manipulation of factors has been employed
sampling frame
census
observational study
quantitative variable
32. A numerically valued attribute of a model for a population
percentile
shape
shifting
population parameter
33. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other
trial
variable
simpson's paradox
symmetric
34. Places in order the effects that many re-expressions have on the data
residuals
outliers
statistic
ladder of powers
35. The difference between the first and third quartiles
interquartile range
matched
outlier
simulation component
36. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean
simulation component
68-95-99.7 rule
marginal distribution
intercept
37. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population
spread
experimental units
68-95-99.7 rule
undercoverage
38. A numerical measure of the direction and strength of a linear association
units
residuals
subset
correlation
39. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set
case
correlation
re-express data
regression line
40. A distribution that's roughly flat
uniform
confounded
symmetric
matched
41. When either those who could influence or evaluate the results is blinded
bimodal
cluster sample
simpson's paradox
single-blind
42. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____
bias
least squares
shifting
outliers
43. A variable whose levels are controlled by the experimenter
factor
normal percentile
frequency table
normal model
44. The best defense against bias - in which each individual is given a fair - random chance of selection
distribution
dotplot
randomization
random numbers
45. A scatterplot shows an association that is this if there is little scatter around the underlying relationship
interquartile range
strength
variable
standardizing
46. The parts of a distribution that typically trail off on either side; they can be characterized as long or short
strength
tails
bimodal
range
47. Done to eliminate units; values can be compared and combined even if the original variables had different units and magnitudes
units
standard deviation
standardizing
confounded
48. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value
frequency table
range
mode
center
49. When omitting a point from the data results in a very different regression model - the point is an ____
quartile
influential point
r2
simpson's paradox
50. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two
lurking variable
timeplot
parameter
simple random sample