Test your basic knowledge |

AP Statistics Vocab

Subjects : statistics, ap
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. When averages are taken across different groups - they can appear to contradict the overall averages

Warning: Invalid argument supplied for foreach() in /var/www/html/basicversity.com/show_quiz.php on line 183


2. Each predicted y-hat tends to be fewer standard deviations from its mean than its corresponding x was from its mean






3. The ____ we care about most is straight






4. In a normal model - about 68% of values fall within 1 standard deviation of the mean - about 95% fall within 2 standard deviations of the mean - and about 99.7% fall within 3 standard deviations of the mean






5. A study based on data in which no manipulation of factors has been employed






6. The square root of the variance






7. Control - randomize - replicate - block






8. Found by substituting the x-value in the regression equation; they're the values on the fitted line






9. Places in order the effects that many re-expressions have on the data






10. An observational study in which subjects are followed to observe future outcomes






11. The number of individuals in a sample






12. A variable that names categories (whether with words or numerals)






13. To be valid - an experiment must assign experimental units to treatment groups at random






14. Variables are said to be this if the conditional distribution of one variable is the same for each category of the other






15. Design Randomization occurring within blocks






16. Models random events by using random numbers to specify event outcomes with relative frequencies that correspond to the true real-world relative frequencies we are trying to model






17. Displays data that change over time






18. Shows a bar representing the count of each category in a categorical variable






19. A hump or local high point in the shape of the distribution of a variable; the apparent locations of these can change as the scale of a histogram is changed






20. The distribution of a variable restricting the who to consider only a smaller group of individuals






21. Shows quantitative data values in a way that sketches the distribution of the data






22. Any data point that stands away from the others; can be extraordinary by having a large residual or by having high leverage






23. The ith ___ is the number that falls above i% of the data






24. A variable other than x and y that simultaneously affects both variables - accounting for the correlation between the two






25. Although linear models provide an easy way to predict values of y for a given value of x - it is unsafe to predict for values of x far from the ones used to find the linear model equation; predictions should not be trusted






26. The experimental units assigned to a baseline treatment level - typically either the default treatment - which is well understood - or a null - placebo treatment






27. A numerical measure of the direction and strength of a linear association






28. Found by summing all the data values and dividing by the count






29. To describe this aspect of a distribution - look for single vs. multiple modes - and symmetry vs. skewness






30. Data points whose x-values are far from the mean of x are said to exert ____ on a linear model; with high enough ____ - residuals can appear to be deceptively small






31. An equation or formula that simplifies and represents reality






32. Lists the categories in a categorical variable and gives the count or percentage of observations for each category






33. The distribution of either variable alone in a contingency table; the counts or percentages are the totals found in the margins (last row or column) of the table






34. An arrangement of data in which each row represents a case and each column represents a variable






35. Consists of the minimum and maximum - the quartiles Q1 and Q3 - and the median






36. A study that asks questions of a sample drawn from some population in the hope of learning something about the entire population






37. The middle value with half of the data above and half below it






38. If data consist of two or more groups that have been thrown together - it is usually best to fit different linear models to each group than to try to fit a single model to all of the data






39. This criterion specifies the unique line that minimizes the variance of the residuals or - equivalently - the sum of the squared residuals






40. A numerically valued attribute of a model for a population






41. A variable whose values are compared across different treatments






42. Summarized with the mean or the median






43. Values of this record the results of each trial with respect to what we were interested in






44. A variable in which the numbers act as numerical values; always has units






45. When groups of experimental units are similar - it is a good idea to gather them together into these






46. The parts of a distribution that typically trail off on either side; they can be characterized as long or short






47. Shows the relationship between two quantitative variables measured on the same cases






48. An event is this if we know what outcomes could happen - but not which particular values will happen






49. Useful family of models for unimodal - symmetric distributions






50. The difference between the lowest and highest values in a data set