Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Summarized with the standard deviation - interquartile range - and range






2. A display to help assess whether a distribution of data is approximately normal; if it is nearly straight - the data satisfy the nearly normal condition






3. An observational study in which subjects are followed to observe future outcomes






4. An individual about whom or which we have data






5. The middle value with half of the data above and half below it






6. Uses adjacent bars to show the distribution of vales in a quantitative variable; each bar represents the frequency (or relative frequency) of values falling in an interval of values






7. A scatterplot shows an association that is this if there is little scatter around the underlying relationship






8. This of sample size n is one in which each set of n elements in the population has an equal chance of selection






9. We do this by taking the logarithm - the square root - the reciprocal - or some other mathematical operation on all values in the data set






10. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






11. An equation or formula that simplifies and represents reality






12. When omitting a point from the data results in a very different regression model - the point is an ____






13. Numerically valued attribute of a model






14. A sample that consists of the entire population






15. Gives the possible values of the variable and the relative frequency of each value






16. A numerically valued attribute of a model for a population






17. Multiplying each data value by a constant multiplies both the measures of position and the measures of spread by that constant






18. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






19. An equation of the form y-hat = b0 + b1x






20. Consists of the individuals who are conveniently available






21. The number of individuals in a sample






22. A positive ____ or association means that - in general - as one variable increases - so does the other; when increases in one variable generally correspond to decreases in the other - the association is negative






23. A variable in which the numbers act as numerical values; always has units






24. Value found by subtracting the mean and dividing by the standard deviation






25. The sum of squared deviations from the mean - divided by the count minus one






26. Manipulates factor levels to create treatments - randomly assigns subjects to these treatment levels - and then compares the responses of the subject groups across treatment levels






27. Sampling schemes that combine several sampling methods






28. The most basic situation in a simulation in which something happens at random






29. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small






30. An individual result of a component of a simulation






31. Distributions with two modes






32. An event is this if we know what outcomes could happen - but not which particular values will happen






33. A variable whose levels are controlled by the experimenter






34. When an observed difference is too large for us to believe that is is likely to have occurred naturally






35. Lists the categories in a categorical variable and gives the count or percentage of observations for each category






36. An observational study in which subjects are selected and then their previous conditions or behaviors are determined






37. These are hard to generate - but several websites offer an unlimited supply of equally likely random values






38. Displays the 5-number summary as a central box with whiskers that extend to the non-outlying data values






39. A sample is this if the statistics computed from it accurately reflect the corresponding population parameters






40. A numerical summary of how tightly the values are clustered around the 'center'






41. Bias introduced to a sample when individuals can choose on their own whether to participate in the sample






42. Useful family of models for unimodal - symmetric distributions






43. The parts of a distribution that typically trail off on either side; they can be characterized as long or short






44. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage






45. A hump or local high point in the shape of the distribution of a variable; the apparent locations of these can change as the scale of a histogram is changed






46. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals






47. Anything in a survey design that influences response






48. A variable that names categories (whether with words or numerals)






49. A value that attempts the impossible by summarizing the entire distribution with a single number - a 'typical' value






50. A distribution is this if it's not symmetric and one tail stretches out farther than the other