Test your basic knowledge |

Data Mining

Subject : it-skills
Instructions:
  • Answer 50 questions in 15 minutes.
  • If you are not ready to take this test, you can study here.
  • Match each statement with the correct term.
  • Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.

This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. Which statement will take away user privileges to the database?






2. A Star diagram has two types of tables (objects). They are called the___________________ tables and ; fact tables.






3. The _____ operation of two tables results in a single table with the same columns as the first table - and containing all rows that are in the first table merged with all the rows in the second table - minus any duplicate rows.






4. Twice as likely to identify the important class (compared to avg. prevalence)






5. Not the same as goodness-of-fit; We want to know how well the model predicts new data - not how well it fits the data it was trained with; Key component of most measures is difference between actual y and predicted y (error)






6. __________ occurs when the initial scope of a project continues to expand as new features are incorporated into the project.






7. This is not considered one of the four major categories of processing algorithms and rule approaches.






8. A compound semantic object is an object that contains at least one ____.






9. The ACCESS feature that tests to see if your tables are normalized properly is the ____.






10. ___________ is not a characteristic of a data warehouse.






11. The SQL command for deleting the Warehouse field from the Part table is _____.






12. ___________________ is used to relate one set of outcomes (dependent variable) to a set of predictor (independent) variables (e.g. - in time series analysis). Through this analysis we attempt to predictive future events - as the dependent variables b






13. Increased affordability of ____________ is a reason for the growth in popularity of data mining.






14. A ___________ combines result sets from more than one fact table.






15. Gives an idea of systematic over- or under-prediction. Magnitude of average absolute error.






16. Which rule would you be violating - if you tried to delete a sales rep record - who currently has customers on file?






17. Which data mining technique utilizes linkage analysis to search operational transactions for patterns with a high probability of repetition?






18. Information about tables in the database is kept in the _____.






19. Are a data mining technology.






20. The process by which numerical data is converted into graphical images is referred to as:






21. ____________ would not normally be associated with ROUTINE data warehouse maintenance.






22. Which of the following is at the center of a star schema?






23. You can save the results of a query as a table by including the _____ clause in the query.






24. Generally Semantic Object Modeling (SOM) is consideredmore bottom-up oriented than _____________.






25. Models that do ___________: MLR; KNN; Regression and Classification Trees; ANN; SVM






26. Useful for assessing performance in terms of identifying the most important class. Helps such choices as: How many tax records to examine; How many loans to grant; How many customers to mail an offer






27. The SQL built-in functions - which may appear on the same line as the SELECT statement (before the FROM clause) are called _____ functions.






28. The term "ETL" in data warehousing stands for: Extraction - ________________________ - & Loading.






29. In general - ______________ are transformed to relations/tables by defining one relation for the object itself and another relation for each multivalued attribute.






30. A powerful trend in IT is known as - which maintains that Computer transmission speed doubles every 18 months.






31. A single column that you create for an entity to serve as the primary key - because you otherwise would need many concatenated columns to do so - is called a(n) ____________.






32. 'Signatures' are used for intrusion detection by _______?






33. _________ seeks to ensure that each application under development is fully integrated within its own boundaries and to eliminate any inconsistencies in the final software product.






34. An analytical-oriented organizational structure is a data warehouse _____________.






35. The _______________________ represents the source data for the DW. This layer is comprised - primarily - of operational transaction processing systems and external secondary databases.






36. A common example of the use of association methods where a retailer can mine the data generated by a point-of-sale system - such as the price scanner you are familiar with at the grocery store is referred to as:






37. The term _____ has been generally agreed to represent the broadest category of software technology that enables decision makers to conduct many dimensional analysis of consolidated enterprise data.






38. Within most organizations - the person known as the _____ determines the type of access various users can have to the corporate or enterprise database.






39. The product of two tables is also called the ________ product.






40. These are considered an alternate storage techniques for data warehousing include.






41. ___________ determines exactly what level of detail constitutes a fact record.






42. The deletion of a record that also deletes related records is referred to as a(n) _____.






43. A _____________ is a system-generated primary key.






44. Which function should be used to calculate the total of all entries in a given column?






45. To create the primary key clause for the Customer table on the CustomerNum field - which of the following is the correct statement?






46. To add a new row to a table - use the _____ command.






47. Semantic object link (SOL) attributes establish a relationship between one _______ and another.






48. The process that records how data from operational data stores and external sources are transformed on the way into the warehouse is referred to as ________________.






49. Which clause would be used to create groups of records?






50. An economic feasibility measure. So is Internal rate of return.