SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Shows how a 'whole' divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category
pie chart
multistage sample
census
timeplot
2. A normal model with a mean of 0 and a standard deviation of 1
normal probability plot
standard normal model
outliers
interquartile range
3. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted
double-blind
model
extrapolation
control group
4. A variable in which the numbers act as numerical values; always has units
cluster sample
mode
quantitative variable
contingency table
5. Consists of the individuals who are conveniently available
sample size
model
stem-and-leaf display
convenience sample
6. The difference between the lowest and highest values in a data set
random assignment
context
factor
range
7. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR
distribution
shifting
uniform
outlier
8. A scatterplot shows an association that is this if there is little scatter around the underlying relationship
scatterplots
strength
residuals
observational study
9. Places in order the effects that many re-expressions have on the data
blinding
undercoverage
single-blind
ladder of powers
10. An arrangement of data in which each row represents a case and each column represents a variable
response
data table
randomized block
influential point
11. In a retrospective or prospective study Subjects who are similar in ways not under study may be ____ and then compared with each other on the variables of interest
stratified random sample
distribution
residuals
matched
12. When both those who could influence and evaluate the results are blinded
ladder of powers
bimodal
double-blind
convenience sample
13. Shows the relationship between two quantitative variables measured on the same cases
spread
scatterplots
bias
placebo
14. Value found by subtracting the mean and dividing by the standard deviation
subset
matching
response bias
standardized value
15. The specific values that the experimenter chooses for a factor
statistically significant
control group
level
percentile
16. Uses adjacent bars to show the distribution of vales in a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in an interval of values
strength
level
changing center and spread
histogram
17. An equation or formula that simplifies and represents reality
categorical variable
model
median
systematic sample
18. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x
r2
influential point
skewed
lurking variable
19. An equation of the form y-hat = b0 + b1x
simulation component
shifting
variance
linear model
20. Numerically valued attribute of a model
sample size
residuals
statistic
parameter
21. Shows quantitative data values in a way that sketches the distribution of the data
regression to the mean
stem-and-leaf display
confounded
simulation component
22. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean
response
mode
68-95-99.7 rule
correlation
23. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set
skewed
re-express data
conditional distribution
double-blind
24. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____
interquartile range
correlation
bias
variance
25. Anything in a survey design that influences response
randomization
response bias
normal percentile
area principle
26. Individuals on whom an experiment is performed
direction
experimental units
shape
prospective study
27. Values of this record the results of each trial with respect to what we were interested in
response variable
comparing distributions
nonresponse bias
model
28. The best defense against bias - in which each individual is given a fair - random chance of selection
placebo
outlier
randomization
center
29. A variable whose levels are controlled by the experimenter
correlation
factor
dotplot
case
30. A hump or local high point in the shape of the distribution of a variable; the apparent locations of these can change as the scale of a histogram is changed
shifting
quartile
mode
distribution
31. A numerical summary of how tightly the values are clustered around the 'center'
bimodal
case
spread
units
32. When the levels of one factor are associated with the levels of another factor so their effects cannot be separated
confounded
randomized block
bimodal
placebo
33. The tendency of many human subjects (often 20% or more of experiment subjects) to show a response even when administered a placebo
placebo effect
matched
boxplot
rescaling
34. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model
mean
lurking variable
census
simulation
35. These are hard to generate - but several websites offer an unlimited supply of equally likely random values
statistically significant
lurking variable
response
random numbers
36. All experimental units have an equal chance of receiving any treatment
control group
completely randomized design
shape
random
37. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage
regression to the mean
predicted value
shape
outlier
38. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small
stem-and-leaf display
range
cluster sample
leverage
39. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean
residuals
regression to the mean
multistage sample
variance
40. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data
median
pie chart
random numbers
subset
41. Doing this is equivalent to changing its units
response
randomized block
block
changing center and spread
42. Value calculated from data to summarize aspects of the data
changing center and spread
histogram
statistic
z-score
43. Distributions with more than two modes
least squares
multimodal
case
trial
44. The ith ___ is the number that falls above i% of the data
changing center and spread
extrapolation
sampling frame
percentile
45. The square root of the variance
mode
standard deviation
response
area principle
46. Extreme values that don't appear to belong with the rest of the data
outliers
distribution
tails
center
47. The distribution of a variable restricting the who to consider only a smaller group of individuals
correlation
stratified random sample
conditional distribution
simulation
48. A numerically valued attribute of a model for a population
stem-and-leaf display
factor
population parameter
z-score
49. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value
comparing distributions
randomized block
residuals
parameter
50. To be valid - an experiment must assign experimental units to treatment groups at random
retrospective study
random assignment
convenience sample
statistically significant