SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small
shifting
leverage
regression line
retrospective study
2. Gives the possible values of the variable and the frequency or relative frequency of each value
distribution
level
z-score
lurking variable
3. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed
percentile
skewed
context
control group
4. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population
sample survey
matching
lurking variable
systematic sample
5. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR
categorical variable
control group
outcome
shifting
6. To be valid - an experiment must assign experimental units to treatment groups at random
observational study
random assignment
center
trial
7. Uses adjacent bars to show the distribution of vales in a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in an interval of values
bias
histogram
standard normal model
z-score
8. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data
distribution
subset
residuals
double-blind
9. The ith ___ is the number that falls above i% of the data
percentile
nonresponse bias
extrapolation
rescaling
10. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x
center
randomized block
response bias
r2
11. Bias introduced to a sample when individuals can choose on their own whether to participate in the sample
voluntary response bias
statistic
dotplot
frequency table
12. Any attempt to force a sample to resemble specified attributes of the population
matching
distribution
randomized block
response
13. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals
data
randomization
least squares
statistic
14. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted
distribution
categorical variable
extrapolation
timeplot
15. The sequence of several components representing events that we are pretending will take place
standard deviation
trial
bimodal
lurking variable
16. Values of this record the results of each trial with respect to what we were interested in
leverage
response variable
influential point
randomized block
17. Holds information about the same characteristic for many cases
percentile
outcome
bias
variable
18. Consists of the individuals who are conveniently available
convenience sample
nonresponse bias
z-score
normal model
19. An individual about whom or which we have data
trial
census
case
pie chart
20. In a statistical display - each data value should be represented by the same amount of area
census
blinding
area principle
simple random sample
21. Shows quantitative data values in a way that sketches the distribution of the data
comparing distributions
principles of experimental design
stem-and-leaf display
shape
22. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment
distribution
control group
response variable
simulation
23. Distributions with more than two modes
5-number summary
variable
multimodal
form
24. Shows a bar representing the count of each category in a categorical variable
bar chart
placebo
simulation component
symmetric
25. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate
randomized block
contingency table
blinding
experiment
26. Gives the possible values of the variable and the relative frequency of each value
distribution
spread
ladder of powers
parameter
27. The lower of this is the value with a quarter of the data below it; the upper of this has a quarter of the data above it
quartile
normal model
variable
principles of experimental design
28. A scatterplot shows an association that is this if there is little scatter around the underlying relationship
predicted value
sampling variability
strength
regression line
29. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model
slope
simulation
frequency table
placebo
30. A variable that names categories (whether with words or numerals)
spread
stratified random sample
retrospective study
categorical variable
31. A sample is this if the statistics computed from it accurately reflect the corresponding population parameters
shifting
representative
single-blind
level
32. Lists the categories in a categorical variable and gives the count or percentage of observations for each category
shifting
experiment
frequency table
statistically significant
33. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean
simple random sample
parameter
symmetric
68-95-99.7 rule
34. This of sample size n is one in which each set of n elements in the population has an equal chance of selection
simple random sample
level
simpson's paradox
matching
35. Done to eliminate units; values can be compared and combined even if the original variables had different units and magnitudes
variable
standardizing
interquartile range
lurking variable
36. The entire group of individuals or instances about whom we hope to learn
outlier
r2
population
double-blind
37. A distribution that's roughly flat
subset
uniform
experiment
randomized block
38. A sampling design in which the population is divided into several subpopulations - and random samples are then drawn from each stratum
uniform
prospective study
stratified random sample
dotplot
39. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value
68-95-99.7 rule
center
median
cluster sample
40. Bias introduced to a sample when a large fraction of those sampled fails to respond
nonresponse bias
quartile
trial
completely randomized design
41. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other
normal probability plot
factor
random
independence
42. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage
statistic
leverage
outlier
slope
43. A sampling design in which entire groups are chosen at random
timeplot
retrospective study
cluster sample
population
44. When both those who could influence and evaluate the results are blinded
level
variable
confounded
double-blind
45. Having one mode; this is a useful term for describing the shape of a histogram when it's generally mound-shaped
unimodal
regression to the mean
standard normal model
multistage sample
46. The tendency of many human subjects (often 20% or more of experiment subjects) to show a response even when administered a placebo
skewed
placebo effect
spread
experiment
47. The distribution of a variable restricting the who to consider only a smaller group of individuals
conditional distribution
population parameter
population
prospective study
48. Anything in a survey design that influences response
matching
response bias
shape
ladder of powers
49. When omitting a point from the data results in a very different regression model - the point is an ____
random
population parameter
retrospective study
influential point
50. Shows how a 'whole' divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category
undercoverage
z-score
experimental units
pie chart