Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Uses adjacent bars to show the distribution of vales in a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in an interval of values






2. A sample that consists of the entire population






3. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other






4. The ____ we care about most is straight






5. The ith ___ is the number that falls above i% of the data






6. The sum of squared deviations from the mean - divided by the count minus one






7. A sample is this if the statistics computed from it accurately reflect the corresponding population parameters






8. Summarized with the mean or the median






9. The difference between the lowest and highest values in a data set






10. Shows how a 'whole' divides into categories by showing a wedge of a circle whose area corresponds to the proportion in each category






11. In a statistical display - each data value should be represented by the same amount of area






12. A hump or local high point in the shape of the distribution of a variable; the apparent locations of these can change as the scale of a histogram is changed






13. Any attempt to force a sample to resemble specified attributes of the population






14. A treatment known to have no effect - administered so that all groups experience the same conditions






15. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted






16. Summarized with the standard deviation - interquartile range - and range






17. Adding a constant to each data value adds the same constant to the mean - the median - and the quartiles - but does not change the standard deviation or IQR






18. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






19. A sampling design in which entire groups are chosen at random






20. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative






21. A numerically valued attribute of a model for a population






22. Sampling schemes that combine several sampling methods






23. Lists the categories in a categorical variable and gives the count or percentage of observations for each category






24. A variable whose values are compared across different treatments






25. Tells how many standard deviations a value is from the mean; have a mean of zero and a standard deviation of one






26. The middle value with half of the data above and half below it






27. To be valid - an experiment must assign experimental units to treatment groups at random






28. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






29. The process - intervention - or other controlled circumstance applied to randomly assigned experimental units






30. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






31. Done to eliminate units; values can be compared and combined even if the original variables had different units and magnitudes






32. The number of individuals in a sample






33. When the levels of one factor are associated with the levels of another factor so their effects cannot be separated






34. A distribution is this if it's not symmetric and one tail stretches out farther than the other






35. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data






36. A variable that names categories (whether with words or numerals)






37. Displays data that change over time






38. This - b0 - gives a starting value in y-units; it's the y-hat-value when x is 0






39. A study based on data in which no manipulation of factors has been employed






40. Value found by subtracting the mean and dividing by the standard deviation






41. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






42. Consists of the individuals who are conveniently available






43. The linear equation y-hat = b0 + b1x that satisfies the least squares criterion






44. Any individual associated with an experiment who is not aware of how subjects have been allocated to treatment groups






45. The most basic situation in a simulation in which something happens at random






46. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set






47. This corresponding to a z-score gives the percentage of values in a standard normal distribution found at that z-score or below






48. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant






49. When both those who could influence and evaluate the results are blinded






50. Gives the possible values of the variable and the relative frequency of each value