## Test your basic knowledge |

# AP Statistics Vocab

**Instructions:**

- Answer 50 questions in 15 minutes.
- If you are not ready to take this test, you can study here.
- Match each statement with the correct term.
- Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.

**1. Holds information about the same characteristic for many cases**

**2. Extreme values that don't appear to belong with the rest of the data**

**3. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion**

**4. The difference between the first and third quartiles**

**5. The ith ___ is the number that falls above i% of the data**

**6. A sample that consists of the entire population**

**7. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment**

**8. The square root of the variance**

**9. The lower of this is the value with a quarter of the data below it; the upper of this has a quarter of the data above it**

**10. A representative subset of a population - examined in hope of learning about the population**

**11. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals**

**12. The difference between the lowest and highest values in a data set**

**13. When omitting a point from the data results in a very different regression model - the point is an ____**

**14. Summarized with the mean or the median**

**15. The sequence of several components representing events that we are pretending will take place**

**16. Consists of the minimum and maximum - the quartiles Q1 and Q3 - and the median**

**17. Summarized with the standard deviation - interquartile range - and range**

**18. A sampling design in which the population is divided into several subpopulations - and random samples are then drawn from each stratum**

**19. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant**

**20. The differences between data values and the corresponding values predicted by the regression model; ____ = observed value - predicted value**

**21. When either those who could influence or evaluate the results is blinded**

**22. When an observed difference is too large for us to believe that is is likely to have occurred naturally**

**23. When doing this - consider their shape - center - and spread**

**24. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one**

**25. Any systematic failure of a sampling method to represent its population; common errors are voluntary response - undercoverage - nonresponse ____ - and response ____**

**26. Control - randomize - replicate - block**

**27. Displays counts and - sometimes - percentages of individuals falling into named categories on two or more variables; categorizes the individuals on all variables at once - to reveal possible patterns in one variable that may be contingent on the cate**

**28. A sample drawn by selecting individuals systematically from a sampling frame**

**29. This of sample size n is one in which each set of n elements in the population has an equal chance of selection**

**30. An arrangement of data in which each row represents a case and each column represents a variable**

**31. An event is this if we know what outcomes could happen - but not which particular values will happen**

**32. A numerical measure of the direction and strength of a linear association**

**33. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative**

**34. A list of individuals from whom the sample is drawn**

**35. Numerically valued attribute of a model**

**36. This - b0 - gives a starting value in y-units; it's the y-hat-value when x is 0**

**37. The distribution of a variable restricting the who to consider only a smaller group of individuals**

**38. A quantity or amount adopted as a standard of measurement - such as dollars - hours - or grams**

**39. Shows the relationship between two quantitative variables measured on the same cases**

**40. An individual result of a component of a simulation**

**41. A sample is this if the statistics computed from it accurately reflect the corresponding population parameters**

**42. Uses adjacent bars to show the distribution of vales in a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in an interval of values**

**43. Found by summing all the data values and dividing by the count**

**44. A point that does not fit the overall pattern seen in the scatterplot**

**45. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population**

**46. A scatterplot shows an association that is this if there is little scatter around the underlying relationship**

**47. An equation of the form y-hat = b0 + b1x**

**48. Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values**

**49. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean**

**50. Graphs a dot for each case against a single axis**