SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. When groups of experimental units are similar - it is a good idea to gather them together into these
bar chart
factor
block
multistage sample
2. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups
blinding
predicted value
center
context
3. A distribution that's roughly flat
double-blind
percentile
uniform
shape
4. When either those who could influence or evaluate the results is blinded
control group
voluntary response bias
symmetric
single-blind
5. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small
z-score
leverage
form
shape
6. Anything in a survey design that influences response
response bias
single-blind
slope
normal probability plot
7. Summarized with the standard deviation - interquartile range - and range
spread
outliers
stratified random sample
bimodal
8. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table
outliers
marginal distribution
pie chart
experiment
9. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other
least squares
bar chart
completely randomized design
independence
10. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data
subset
level
simpson's paradox
double-blind
11. Places in order the effects that many re-expressions have on the data
units
population parameter
factor
ladder of powers
12. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage
interquartile range
influential point
outlier
factor
13. Displays data that change over time
nonresponse bias
sample survey
form
timeplot
14. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model
simulation
5-number summary
mode
bias
15. The entire group of individuals or instances about whom we hope to learn
population
factor
standardized value
outliers
16. Distributions with more than two modes
statistically significant
normal percentile
response bias
multimodal
17. The middle value with half of the data above and half below it
rescaling
median
block
principles of experimental design
18. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population
matching
percentile
re-express data
undercoverage
19. When both those who could influence and evaluate the results are blinded
independence
data
randomization
double-blind
20. Consists of the individuals who are conveniently available
unimodal
standardizing
convenience sample
center
21. The number of individuals in a sample
sample size
distribution
quartile
confounded
22. Gives the possible values of the variable and the relative frequency of each value
shape
68-95-99.7 rule
pie chart
distribution
23. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____
placebo effect
statistic
bias
interquartile range
24. A variable that names categories (whether with words or numerals)
categorical variable
undercoverage
census
placebo effect
25. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x
r2
marginal distribution
intercept
variance
26. A numerical summary of how tightly the values are clustered around the 'center'
interquartile range
representative
spread
statistically significant
27. A variable that is not explicitly part of a model but affects the way the variables in the model appear to be related
convenience sample
r2
lurking variable
marginal distribution
28. Doing this is equivalent to changing its units
changing center and spread
distribution
area principle
shape
29. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one
simulation
model
z-score
independence
30. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels
experiment
principles of experimental design
sample size
center
31. The distribution of a variable restricting the who to consider only a smaller group of individuals
skewed
median
conditional distribution
model
32. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value
residuals
data
outlier
scatterplots
33. A scatterplot shows an association that is this if there is little scatter around the underlying relationship
shifting
strength
model
population parameter
34. Extreme values that don't appear to belong with the rest of the data
outliers
principles of experimental design
categorical variable
matched
35. The parts of a distribution that typically trail off on either side; they can be characterized as long or short
timeplot
spread
tails
68-95-99.7 rule
36. The natural tendency of randomly drawn samples to differ
placebo
sampling frame
sampling variability
units
37. Gives the possible values of the variable and the frequency or relative frequency of each value
matched
mean
68-95-99.7 rule
distribution
38. A numerical measure of the direction and strength of a linear association
census
systematic sample
placebo
correlation
39. Shows the relationship between two quantitative variables measured on the same cases
outcome
data
scatterplots
sampling frame
40. This corresponding to a z-score gives the percentage of values in a standard normal distribution found at that z-score or below
unimodal
linear model
normal percentile
trial
41. Bias introduced to a sample when a large fraction of those sampled fails to respond
contingency table
center
nonresponse bias
data
42. Uses adjacent bars to show the distribution of vales in a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in an interval of values
representative
histogram
cluster sample
symmetric
43. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment
predicted value
control group
nonresponse bias
median
44. The sum of squared deviations from the mean - divided by the count minus one
form
case
variance
multimodal
45. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals
least squares
categorical variable
center
random assignment
46. The ____ we care about most is straight
linear model
bar chart
form
lurking variable
47. The sequence of several components representing events that we are pretending will take place
trial
predicted value
stratified random sample
level
48. An equation or formula that simplifies and represents reality
range
treatment
random assignment
model
49. A study based on data in which no manipulation of factors has been employed
observational study
residuals
normal probability plot
simple random sample
50. These are hard to generate - but several websites offer an unlimited supply of equally likely random values
prospective study
simulation component
random numbers
timeplot