Test your basic knowledge |

Data Mining

Subject : it-skills
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Which statement removes the table Salesrep from a DBMS?






2. ___________ is not a characteristic of a data warehouse.






3. An analytical-oriented organizational structure is a data warehouse _____________.






4. R- squared(and adjusted r-squared) - A measure of how much of the variability around the target mean is explained by your predictive variables. Doesn't mean you have a good predictive model—only validation will tell you that






5. Which function calculates the number of entries in a table?






6. This is not considered one of the four major categories of processing algorithms and rule approaches.






7. The SQL command for deleting the Warehouse field from the Part table is _____.






8. Are a data mining technology.






9. A powerful trend in IT is known as - which maintains that Computer transmission speed doubles every 18 months.






10. Semantic object link (SOL) attributes establish a relationship between one _______ and another.






11. Which data mining technique utilizes linkage analysis to search operational transactions for patterns with a high probability of repetition?






12. The _____ operation of two tables results in a single table with the same columns as the first table - and containing all rows that are in the first table merged with all the rows in the second table - minus any duplicate rows.






13. When an entity has a minimum cardinality of one it means the entity is required in _______.






14. You can save the results of a query as a table by including the _____ clause in the query.






15. _________ seeks to ensure that each application under development is fully integrated within its own boundaries and to eliminate any inconsistencies in the final software product.






16. A common example of the use of association methods where a retailer can mine the data generated by a point-of-sale system - such as the price scanner you are familiar with at the grocery store is referred to as:






17. A Star diagram has two types of tables (objects). They are called the___________________ tables and ; fact tables.






18. The ACCESS feature that tests to see if your tables are normalized properly is the ____.






19. The term _____ has been generally agreed to represent the broadest category of software technology that enables decision makers to conduct many dimensional analysis of consolidated enterprise data.






20. The deletion of a record that also deletes related records is referred to as a(n) _____.






21. Twice as likely to identify the important class (compared to avg. prevalence)






22. A compound semantic object is an object that contains at least one ____.






23. An alternative to the data warehouse concept is a lower-cost - scaled-down version referred to as the _____________.






24. These are considered an alternate storage techniques for data warehousing include.






25. An ___________ relates two other objects.






26. ____________ would not normally be associated with ROUTINE data warehouse maintenance.






27. Gives an idea of systematic over- or under-prediction. Magnitude of average absolute error.






28. Generally Semantic Object Modeling (SOM) is consideredmore bottom-up oriented than _____________.






29. Not the same as goodness-of-fit; We want to know how well the model predicts new data - not how well it fits the data it was trained with; Key component of most measures is difference between actual y and predicted y (error)






30. On an ER Diagram the number (mark) on relationship line that is farthest away from each entity (rectangle) represents the _______ cardinality.






31. Increased affordability of ____________ is a reason for the growth in popularity of data mining.






32. Useful for assessing performance in terms of identifying the most important class. Helps such choices as: How many tax records to examine; How many loans to grant; How many customers to mail an offer






33. The product of two tables is also called the ________ product.






34. The set of activities used to find new - hidden - or unexpected patterns in data is referred to as _____.






35. A synonym for data mining






36. To create the primary key clause for the Customer table on the CustomerNum field - which of the following is the correct statement?






37. In general - ______________ are transformed to relations/tables by defining one relation for the object itself and another relation for each multivalued attribute.






38. Which of the following is at the center of a star schema?






39. Which clause would be used to create groups of records?






40. The SQL built-in functions - which may appear on the same line as the SELECT statement (before the FROM clause) are called _____ functions.






41. The term "ETL" in data warehousing stands for: Extraction - ________________________ - & Loading.






42. Why are Star Schemas so useful in Financial Planning and Accounting Information Systems?






43. Organizes and analyzes data as an n-dimensional cube. The cube can be thought of as a common spreadsheet with two extensions: (1) support for multiple dimensions and (2) support for multiple concurrent users.






44. Which of the following database design and data warehouse design approaches is viewed to take a more strategic rather than operational perspective?






45. The minimum cardinality and m is the maximum cardinality Cardinalities in Semantic Objects are shown as subscripts in the format n-m where _____






46. A _____________ is a system-generated primary key.






47. 'Signatures' are used for intrusion detection by _______?






48. ___________ determines exactly what level of detail constitutes a fact record.






49. A ___________ combines result sets from more than one fact table.






50. Which rule would you be violating - if you tried to delete a sales rep record - who currently has customers on file?