SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. A variable that names categories (whether with words or numerals)
context
r2
representative
categorical variable
2. Extreme values that don't appear to belong with the rest of the data
response
outlier
single-blind
outliers
3. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups
tails
simpson's paradox
median
blinding
4. Lists the categories in a categorical variable and gives the count or percentage of observations for each category
frequency table
random
normal model
conditional distribution
5. A representative subset of a population - examined in hope of learning about the population
distribution
sample
spread
outlier
6. The distribution of a variable restricting the who to consider only a smaller group of individuals
conditional distribution
symmetric
sample
parameter
7. These are hard to generate - but several websites offer an unlimited supply of equally likely random values
5-number summary
random numbers
strength
simple random sample
8. The process - intervention - or other controlled circumstance applied to randomly assigned experimental units
simulation
timeplot
distribution
treatment
9. A sample drawn by selecting individuals systematically from a sampling frame
systematic sample
form
least squares
bimodal
10. The most basic situation in a simulation in which something happens at random
simulation component
random numbers
independence
placebo
11. Shows the relationship between two quantitative variables measured on the same cases
principles of experimental design
correlation
scatterplots
multistage sample
12. A sample that consists of the entire population
variable
census
completely randomized design
scatterplots
13. This corresponding to a z-score gives the percentage of values in a standard normal distribution found at that z-score or below
bimodal
outlier
5-number summary
normal percentile
14. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value
residuals
factor
tails
uniform
15. The entire group of individuals or instances about whom we hope to learn
population
multimodal
correlation
statistically significant
16. Gives the possible values of the variable and the relative frequency of each value
timeplot
distribution
placebo
pie chart
17. Summarized with the standard deviation - interquartile range - and range
treatment
stem-and-leaf display
intercept
spread
18. An equation or formula that simplifies and represents reality
mean
contingency table
model
random numbers
19. A variable in which the numbers act as numerical values; always has units
quantitative variable
center
independence
representative
20. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment
correlation
statistically significant
nonresponse bias
control group
21. Useful family of models for unimodal - symmetric distributions
normal model
center
random
distribution
22. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small
simulation component
leverage
boxplot
experimental units
23. Distributions with more than two modes
outlier
variance
multimodal
changing center and spread
24. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion
regression line
multimodal
representative
confounded
25. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant
rescaling
influential point
categorical variable
tails
26. Any attempt to force a sample to resemble specified attributes of the population
response variable
sampling frame
matching
normal model
27. To be valid - an experiment must assign experimental units to treatment groups at random
unimodal
random assignment
normal model
z-score
28. All experimental units have an equal chance of receiving any treatment
experiment
completely randomized design
residuals
model
29. This - b0 - gives a starting value in y-units; it's the y-hat-value when x is 0
sampling frame
intercept
68-95-99.7 rule
regression line
30. Shows quantitative data values in a way that sketches the distribution of the data
stem-and-leaf display
center
intercept
units
31. The ith ___ is the number that falls above i% of the data
placebo effect
stratified random sample
distribution
percentile
32. When an observed difference is too large for us to believe that is is likely to have occurred naturally
observational study
statistically significant
direction
mode
33. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set
scatterplots
influential point
treatment
re-express data
34. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data
subset
5-number summary
response
variance
35. An observational study in which subjects are followed to observe future outcomes
random numbers
census
multimodal
prospective study
36. Consists of the minimum and maximum - the quartiles Q1 and Q3 - and the median
contingency table
population
5-number summary
blinding
37. A sampling design in which the population is divided into several subpopulations - and random samples are then drawn from each stratum
68-95-99.7 rule
pie chart
stratified random sample
regression to the mean
38. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean
direction
68-95-99.7 rule
random assignment
multimodal
39. The difference between the first and third quartiles
shifting
form
interquartile range
simpson's paradox
40. A variable that is not explicitly part of a model but affects the way the variables in the model appear to be related
predicted value
standardized value
lurking variable
normal probability plot
41. Places in order the effects that many re-expressions have on the data
bias
response variable
outliers
ladder of powers
42. When groups of experimental units are similar - it is a good idea to gather them together into these
data table
block
outliers
range
43. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate
contingency table
direction
frequency table
retrospective study
44. In a statistical display - each data value should be represented by the same amount of area
sampling variability
sample survey
area principle
observational study
45. Doing this is equivalent to changing its units
center
changing center and spread
convenience sample
normal percentile
46. Shows a bar representing the count of each category in a categorical variable
bar chart
normal percentile
statistically significant
pie chart
47. A variable whose levels are controlled by the experimenter
parameter
residuals
multistage sample
factor
48. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative
slope
direction
simple random sample
simulation
49. In a retrospective or prospective study Subjects who are similar in ways not under study may be ____ and then compared with each other on the variables of interest
standard normal model
histogram
matched
mean
50. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other
median
retrospective study
symmetric
matched