SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value
sample survey
spread
leverage
residuals
2. Control - randomize - replicate - block
extrapolation
normal model
principles of experimental design
distribution
3. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other
lurking variable
symmetric
undercoverage
response
4. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data
data table
contingency table
subset
blinding
5. Values of this record the results of each trial with respect to what we were interested in
pie chart
response variable
treatment
direction
6. In a statistical display - each data value should be represented by the same amount of area
multistage sample
form
area principle
residuals
7. Having one mode; this is a useful term for describing the shape of a histogram when it's generally mound-shaped
unimodal
units
simple random sample
parameter
8. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean
regression to the mean
bimodal
median
normal percentile
9. The ____ we care about most is straight
form
voluntary response bias
marginal distribution
single-blind
10. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage
undercoverage
range
outlier
skewed
11. A hump or local high point in the shape of the distribution of a variable; the apparent locations of these can change as the scale of a histogram is changed
retrospective study
mode
z-score
re-express data
12. When doing this - consider their shape - center - and spread
retrospective study
histogram
5-number summary
comparing distributions
13. Distributions with two modes
range
bimodal
convenience sample
comparing distributions
14. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x
response bias
68-95-99.7 rule
completely randomized design
r2
15. An individual result of a component of a simulation
nonresponse bias
stratified random sample
outcome
sample survey
16. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR
convenience sample
tails
shifting
symmetric
17. Anything in a survey design that influences response
response bias
uniform
shifting
simulation component
18. The square root of the variance
contingency table
standard deviation
mean
sample survey
19. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set
re-express data
randomized block
convenience sample
mean
20. The number of individuals in a sample
sample size
factor
random
standard normal model
21. Extreme values that don't appear to belong with the rest of the data
standard normal model
sample survey
outliers
5-number summary
22. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals
units
least squares
center
histogram
23. A sample drawn by selecting individuals systematically from a sampling frame
multistage sample
data
systematic sample
re-express data
24. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed
context
form
standard deviation
shape
25. Individuals on whom an experiment is performed
case
context
experimental units
shifting
26. Displays data that change over time
cluster sample
timeplot
completely randomized design
frequency table
27. The process - intervention - or other controlled circumstance applied to randomly assigned experimental units
range
trial
population
treatment
28. The sum of squared deviations from the mean - divided by the count minus one
stem-and-leaf display
correlation
case
variance
29. A sample that consists of the entire population
contingency table
undercoverage
outliers
census
30. Useful family of models for unimodal - symmetric distributions
outliers
normal model
form
5-number summary
31. A distribution is this if it's not symmetric and one tail stretches out farther than the other
data table
slope
skewed
68-95-99.7 rule
32. Places in order the effects that many re-expressions have on the data
prospective study
completely randomized design
ladder of powers
randomization
33. A quantity or amount adopted as a standard of measurement - such as dollars - hours - or grams
units
block
strength
distribution
34. A numerical measure of the direction and strength of a linear association
correlation
form
population
standardized value
35. All experimental units have an equal chance of receiving any treatment
completely randomized design
standard normal model
distribution
bar chart
36. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant
residuals
rescaling
quartile
boxplot
37. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table
unimodal
representative
marginal distribution
undercoverage
38. Distributions with more than two modes
population
multimodal
range
dotplot
39. Numerically valued attribute of a model
normal probability plot
parameter
subset
random numbers
40. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate
census
random numbers
contingency table
standardizing
41. The parts of a distribution that typically trail off on either side; they can be characterized as long or short
range
outlier
variance
tails
42. An observational study in which subjects are followed to observe future outcomes
marginal distribution
rescaling
prospective study
standardizing
43. A sampling design in which entire groups are chosen at random
extrapolation
cluster sample
matching
median
44. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population
case
response variable
census
sample survey
45. An observational study in which subjects are selected and then their previous conditions or behaviors are determined
prospective study
retrospective study
control group
lurking variable
46. These are hard to generate - but several websites offer an unlimited supply of equally likely random values
shifting
data table
random numbers
voluntary response bias
47. To describe this aspect of a distribution - look for single vs. multiple modes - and symmetry vs. skewness
stem-and-leaf display
outliers
shape
level
48. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two
correlation
lurking variable
matched
randomized block
49. An arrangement of data in which each row represents a case and each column represents a variable
data table
subset
bias
pie chart
50. When omitting a point from the data results in a very different regression model - the point is an ____
case
simple random sample
block
influential point