SUBJECTS
|
BROWSE
|
CAREER CENTER
|
POPULAR
|
JOIN
|
LOGIN
Business Skills
|
Soft Skills
|
Basic Literacy
|
Certifications
About
|
Help
|
Privacy
|
Terms
|
Email
Search
Test your basic knowledge |
Data Mining
Start Test
Study First
Subject
:
it-skills
Instructions:
Answer 50 questions in 15 minutes.
If you are not ready to take this test, you can
study here
.
Match each statement with the correct term.
Don't refresh. All questions and answers are randomly picked and ordered every time you load a test.
This is a study tool. The 3 wrong answers for each question are randomly chosen from answers to other questions. So, you might find at times the answers obvious, but you will see it re-enforces your understanding as you take the test each time.
1. ___________ is not a characteristic of a data warehouse.
degrees of summarization
decile chart
volatile data
recognizing known patterns
2. Which rule would you be violating - if you tried to delete a sales rep record - who currently has customers on file?
transformation mapping
Sum
Referential integrity
near-line secondary storage devices
3. Information about tables in the database is kept in the _____.
system catalog
project readiness assessment factor
Top-down approach
semantic object (SOL) attribute
4. This is not considered one of the four major categories of processing algorithms and rule approaches.
drill-across report
principle component analysis
aggregate
machine learning
5. The product of two tables is also called the ________ product.
Cartesian
association semantic object
Group By
dimension
6. An analytical-oriented organizational structure is a data warehouse _____________.
Top-down approach
Regression analysis
machine learning
project readiness assessment factor
7. To add a new row to a table - use the _____ command.
Transformation
market basket analysis
changing/UPDATE-ing
Insert
8. A ___________ combines result sets from more than one fact table.
near-line secondary storage devices
average error
drill-across report
Sum
9. When an entity has a minimum cardinality of one it means the entity is required in _______.
Horizontal integration
the relationship
degrees of summarization
Transformation
10. Models that do ___________: MLR; KNN; Regression and Classification Trees; ANN; SVM
transformation mapping
the relationship
database administrator
numeric prediction
11. ____________ would not normally be associated with ROUTINE data warehouse maintenance.
changing/UPDATE-ing
OLAP
near-line secondary storage devices
Association
12. Which of the following database design and data warehouse design approaches is viewed to take a more strategic rather than operational perspective?
composite semantic objects
Group By
Revoke
Top-down approach
13. The set of activities used to find new - hidden - or unexpected patterns in data is referred to as _____.
Regression analysis
aggregate
data mining
Top-down approach
14. __________ occurs when the initial scope of a project continues to expand as new features are incorporated into the project.
Scope creep
knowledge data discovery
association semantic object
groves law
15. The SQL built-in functions - which may appear on the same line as the SELECT statement (before the FROM clause) are called _____ functions.
system catalog
Horizontal integration
aggregate
decile chart
16. ___________ determines exactly what level of detail constitutes a fact record.
The degree of granularity
artificial Key
project readiness assessment factor
data visualization
17. Organizes and analyzes data as an n-dimensional cube. The cube can be thought of as a common spreadsheet with two extensions: (1) support for multiple dimensions and (2) support for multiple concurrent users.
cascading delete
artificial Key
market basket analysis
MOLAP
18. A compound semantic object is an object that contains at least one ____.
system catalog
DROP TABLE Salesrep;
Top-down approach
semantic object (SOL) attribute
19. In general - ______________ are transformed to relations/tables by defining one relation for the object itself and another relation for each multivalued attribute.
composite semantic objects
data visualization
average error
Transformation
20. The SQL command for deleting the Warehouse field from the Part table is _____.
UNION
ALTER TABLE Part DELETE Warehouse;
Revoke
drill-across report
21. 'Signatures' are used for intrusion detection by _______?
project readiness assessment factor
MAE (Mean Absolute Error) deviation
recognizing known patterns
Document Analyzer
22. Which function calculates the number of entries in a table?
surrogate key
Count
ALTER TABLE Part DELETE Warehouse;
Insert
23. Gives us an idea of the magnitude of errors. Actual value - estimated value.
MAE (Mean Absolute Error) deviation
average error
Sum
Count
24. The term _____ has been generally agreed to represent the broadest category of software technology that enables decision makers to conduct many dimensional analysis of consolidated enterprise data.
Scope creep
degrees of summarization
OLAP
data mining
25. Which statement removes the table Salesrep from a DBMS?
measuring predictive error
Breakeven analysis
Sum
DROP TABLE Salesrep;
26. Are a data mining technology.
performance metrics - Numeric Prediction
Breakeven analysis
Regression analysis
neural networks & Decision Trees
27. A single column that you create for an entity to serve as the primary key - because you otherwise would need many concatenated columns to do so - is called a(n) ____________.
knowledge data discovery
The degree of granularity
machine learning
artificial Key
28. Which data mining technique utilizes linkage analysis to search operational transactions for patterns with a high probability of repetition?
Transformation
Sum
Association
market basket analysis
29. Which function should be used to calculate the total of all entries in a given column?
Sum
Breakeven analysis
the relationship
performance metrics - Numeric Prediction
30. Twice as likely to identify the important class (compared to avg. prevalence)
system catalog
Sum
data visualization
decile chart
31. Increased affordability of ____________ is a reason for the growth in popularity of data mining.
MOLAP
transformation mapping
database administrator
machine learning
32. Which of the following is at the center of a star schema?
Fact or Measurement table
Sum
n
recognizing known patterns
33. Which clause would be used to create groups of records?
data mining
Group By
Cartesian
ERD Modeling
34. A Star diagram has two types of tables (objects). They are called the___________________ tables and ; fact tables.
MAE (Mean Absolute Error) deviation
dimension
surrogate key
MOLAP
35. The _____ operation of two tables results in a single table with the same columns as the first table - and containing all rows that are in the first table merged with all the rows in the second table - minus any duplicate rows.
UNION
average error
operational and external layer
Regression analysis
36. The term "ETL" in data warehousing stands for: Extraction - ________________________ - & Loading.
Horizontal integration
volatile data
Transformation
Group By
37. An ___________ relates two other objects.
knowledge data discovery
numeric prediction
data visualization
association semantic object
38. Why are Star Schemas so useful in Financial Planning and Accounting Information Systems?
principle component analysis
Horizontal integration
Revoke
degrees of summarization
39. The process that records how data from operational data stores and external sources are transformed on the way into the warehouse is referred to as ________________.
transformation mapping
data mart
Into
Fact or Measurement table
40. The ACCESS feature that tests to see if your tables are normalized properly is the ____.
aggregate
dimension
Document Analyzer
Count
41. The _______________________ represents the source data for the DW. This layer is comprised - primarily - of operational transaction processing systems and external secondary databases.
aggregate
The degree of granularity
operational and external layer
near-line secondary storage devices
42. A synonym for data mining
system catalog
OLAP
knowledge data discovery
volatile data
43. R- squared(and adjusted r-squared) - A measure of how much of the variability around the target mean is explained by your predictive variables. Doesn't mean you have a good predictive model—only validation will tell you that
OLAP
recognizing known patterns
performance metrics - Numeric Prediction
machine learning
44. Not the same as goodness-of-fit; We want to know how well the model predicts new data - not how well it fits the data it was trained with; Key component of most measures is difference between actual y and predicted y (error)
principle component analysis
Document Analyzer
performance metrics - Numeric Prediction
measuring predictive error
45. These are considered an alternate storage techniques for data warehousing include.
near-line secondary storage devices
operational and external layer
Horizontal integration
lift charts
46. A powerful trend in IT is known as - which maintains that Computer transmission speed doubles every 18 months.
database administrator
drill-across report
groves law
The degree of granularity
47. An economic feasibility measure. So is Internal rate of return.
Regression analysis
Breakeven analysis
volatile data
operational and external layer
48. Gives an idea of systematic over- or under-prediction. Magnitude of average absolute error.
system catalog
average error
composite semantic objects
Cartesian
49. You can save the results of a query as a table by including the _____ clause in the query.
Sum
measuring predictive error
Into
operational and external layer
50. Semantic object link (SOL) attributes establish a relationship between one _______ and another.
semantic object
near-line secondary storage devices
surrogate key
artificial Key