SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
AP Statistics Vocab
Start Test
Study First
Subjects
:
statistics
,
ap
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Ideally tells who was measured - what was measured - how the data were collected - where the data were collected - and when and why the study was performed
histogram
representative
context
principles of experimental design
2. The number of individuals in a sample
percentile
sample size
treatment
statistic
3. A sampling scheme that biases the sample in a way that gives a part of the population less representation than it has in the population
principles of experimental design
linear model
undercoverage
voluntary response bias
4. When groups of experimental units are similar - it is a good idea to gather them together into these
frequency table
block
outcome
treatment
5. Value found by subtracting the mean and dividing by the standard deviation
standardized value
simple random sample
boxplot
influential point
6. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean
nonresponse bias
68-95-99.7 rule
normal probability plot
strength
7. The square of the correlation between y and x; gives the fraction of the variability of y accounted for by the least squares linear regression on x; an overall measure of how successful the regression is in linearly relating y to x
r2
factor
block
population parameter
8. Places in order the effects that many re-expressions have on the data
center
confounded
population parameter
ladder of powers
9. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate
contingency table
conditional distribution
normal model
control group
10. The entire group of individuals or instances about whom we hope to learn
spread
population
leverage
z-score
11. A quantity or amount adopted as a standard of measurement - such as dollars - hours - or grams
representative
convenience sample
units
factor
12. To describe this aspect of a distribution - look for single vs. multiple modes - and symmetry vs. skewness
standardizing
shape
lurking variable
sampling frame
13. A variable that names categories (whether with words or numerals)
response bias
categorical variable
5-number summary
direction
14. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one
simple random sample
residuals
center
z-score
15. Shows quantitative data values in a way that sketches the distribution of the data
stem-and-leaf display
center
contingency table
outlier
16. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data
treatment
systematic sample
normal model
subset
17. Shows how a 'whole' divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category
variable
pie chart
center
matching
18. Design Randomization occurring within blocks
slope
lurking variable
standard deviation
randomized block
19. Gives the possible values of the variable and the frequency or relative frequency of each value
placebo
distribution
quantitative variable
standard deviation
20. A sampling design in which the population is divided into several subpopulations - and random samples are then drawn from each stratum
subset
center
stratified random sample
voluntary response bias
21. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model
simulation
sample survey
comparing distributions
changing center and spread
22. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant
convenience sample
rescaling
bias
lurking variable
23. Gives a value in 'y-units per x-unit'; changes of one unit in x are associated with changes of b1 units in predicted values of y
mean
slope
range
regression to the mean
24. Distributions with two modes
skewed
normal model
bimodal
simulation
25. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR
shifting
re-express data
single-blind
sampling frame
26. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population
median
spread
data table
sample survey
27. Any attempt to force a sample to resemble specified attributes of the population
extrapolation
simulation component
area principle
matching
28. In a statistical display - each data value should be represented by the same amount of area
spread
trial
influential point
area principle
29. This of sample size n is one in which each set of n elements in the population has an equal chance of selection
sampling frame
strength
simple random sample
outlier
30. The sum of squared deviations from the mean - divided by the count minus one
variance
5-number summary
convenience sample
shape
31. An equation of the form y-hat = b0 + b1x
sample size
linear model
experimental units
extrapolation
32. A sample drawn by selecting individuals systematically from a sampling frame
distribution
lurking variable
systematic sample
statistic
33. A numerically valued attribute of a model for a population
percentile
standard deviation
standardized value
population parameter
34. An observational study in which subjects are selected and then their previous conditions or behaviors are determined
normal probability plot
experiment
retrospective study
systematic sample
35. When an observed difference is too large for us to believe that is is likely to have occurred naturally
statistically significant
dotplot
standard normal model
convenience sample
36. Numerically valued attribute of a model
convenience sample
parameter
predicted value
population
37. The distribution of a variable restricting the who to consider only a smaller group of individuals
conditional distribution
rescaling
trial
spread
38. The square root of the variance
direction
standard deviation
quartile
systematic sample
39. A sampling design in which entire groups are chosen at random
data table
level
normal model
cluster sample
40. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage
bias
model
outlier
frequency table
41. Found by summing all the data values and dividing by the count
simulation component
observational study
single-blind
mean
42. The ____ we care about most is straight
lurking variable
simulation
spread
form
43. Useful family of models for unimodal - symmetric distributions
spread
normal model
response bias
random assignment
44. A distribution is this if the two halves on either side of the center look approximately like mirror images of each other
symmetric
multistage sample
intercept
factor
45. The middle value with half of the data above and half below it
median
population
population parameter
rescaling
46. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two
dotplot
lurking variable
normal percentile
influential point
47. The ith ___ is the number that falls above i% of the data
variable
statistic
spread
percentile
48. Done to eliminate units; values can be compared and combined even if the original variables had different units and magnitudes
parameter
marginal distribution
treatment
standardizing
49. Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values
boxplot
pie chart
outlier
unimodal
50. Value calculated from data to summarize aspects of the data
undercoverage
statistic
linear model
median