Saturday, December 12, 2009

Data Warehouse

Data Warehouse UNIT I, II, III

1. A Data Warehouse is collection of ___________.
a) Corporate Companies b) Corporate information c) Product d) None

2. The purpose of Data Warehouse is ____________
a) to Support business decisions b) to Support business operations c) Both a &b d) None

3. The answers to business “What-if?” ensures business is _____
a) active b)reactive c) proactive d) None

4. _____ means that you are information deficient.
a) overloaded b) underloaded c) void d) None

5. _____ means you are overwhelmed with enormous glut of information.
a) overloaded b) underloaded c) void d) None

6. The originator of data warehousing concept is ____________
a) Bill Inmon b) Bill gates c)Bill clinton d) None

7. Data warehouses are interested in ____
a) Query processing b) transaction processing c) Both a & b d) None

8. Access to, and the understanding of, information is __________.
a) Decision making b) Power c) Competition d) None

9. DSS stands for ________
a) Decision Support systems b) Decision Source systems
c) Demand support systems d) Decision Secure systems

10. EIS stands for
a) Executive Information support b) Expert Information systems
c) Executive Information systems d) Expert Information support

11. ______ means to preserve security and integrity of mission-critical OLTP.
a) Data cleansing b) insulate c) integrity d) None

12. Only ________ systems offer sufficient bandwidth.
a) Parallel processing b) single processing c) concurrent processing d) None

13. ______ has to remove duplication and reconcile differences between various styles of data collection.
a) Data cleansing b) insulate c) integrity d) None

14. ___ are small warehouses to provide subsets of main store and summarized information depending on the requirements of a specific department.
a) mini data warehouses b) data marts c) Both a & b d) None

15. Data are organized according to ________
a) Subject b) isolate c) absolute d) none

16. Expand SMP
a)Simple multi processing b) Symmetric multi processing c) Ssystematic muti processing d) none

17. Expand MPP
a)Massively parallel processing b) Massively parallel project c) Master parallel program d) none

18. _______ is the final component of the data warehouse.
a) Metadata b) actual c) data d)none

19. Data warehouses rarely exist in _______
a)transparent b) isolation c) group d) none

20. OLAP systems cannot be repositories of _________data
a)Facts and historical data b) Facts c) History d) none

(U-2)
1. We can ensure data quality with a foundation of _________
a) Data warehouse b) Operational systems c) data modeling d) None

2. The data warehouses databases are often called as _________.
a) Informational data b) Enterprise data c) Both a & b d) None

3.Michael Goldberg and Jaikumar Vijayan was published in the __________
a) April 8,1996 b) April 1, 1998 c) May 1, 1999 d) none

4. Data analysis need help from end users and executives to enable to understand their _________
a) Business b) service c) donate d) none

5. Expand ER
a) Entity Redundancy b) Engine relation c) Entity Relationship d) none

(u-3)
1. A typical operational system deals with ______
a) Multiple orders b) One order c) Both a & b d) None

2. A typical data warehouse deals with _____
a) data that is changing b) data that is aggregate in nature c) Both a &b d) None

Each carries 2 marks:-
(u-1)
1. A data warehouse contains information extracted from
i) Operational systems ii) internal datasources iii) External datasources
a) I only b) iii only c) I,iii d) All


2. What are the 2 phases in process of Data warehouse:
i) Extract ii) Insulate iii) Data cleansing iv) summarize
a) ii, iii b) I, iii c) I, iv d) ii, i

3. The “metadata” contains
i) The structure of the data
ii) The algorithm used for summarization
iii) the mapping from the operational env., to data warehouse.
a) I, ii only b) ii, iii only c) I, iii only d) all

4. Which of the following are true with respect to informational data?
i) Updated often and through online transactions
ii) Possibly “read-only”
iii) Summarized operational data
iv) Non- historical data
a) All b) ii, iii only c) I, iii d) None

5. Multi-dimensional solutions provide the ability to:
i) Analyze potentially large amounts of data very quickly
ii) “Slice and dice” through the data
iii) Quickly identify trends or problem areas
a) I, iii only b) I, ii only c) I only d) All

6. Data mining is used for:
i) To analyze the data
ii) Extract the data from different operational systems
iii) Research the data and determine the patterns, classifications and associations.
iv) Multi-dimensional analysis
v) two- dimensional analysis
a) ii, iii, v b) I, iii,iv c) ii, iii, iv d) All

7. “Metadata” has 2 types of data:
i) Technical data ii) business data iii) functional data iv) operation data
a) I, ii b) ii, iii c) iii, iv d) None

8. Which of the following are requirements of data warehouse servers?
i) performance ii) capacity iii) scalability iv) open interfaces v) Multiple data structures
a) I, ii, iv,v b) I, ii, iii, iv c) All d) None

(u-2)
1. The new techniques in data modeling are:
i) star schema ii) ER modeling iii) snowflake iv) snowblake
a) I, iii only b) I, ii c) I, iv d) ALL

2. The 3 different domains that data professional has to manage are:
i) the development environment ii) the execution iii) data warehouse iv) database
a) I, ii, iv b) I, ii, iii c) All d) None

Each carries 4 mark
(u-1)

1. Which of the following is true about Data warehouse?
i) It is loosely defined as any centralized data repository which can be queried for
business benefit.
ii) Extract archived operational data and overcome inconsistencies between different
legacy data formats.
iii) It provides day to day activities data.
iv) It provides data already transformed and summarized for DSS & EIS
a) I, ii only b) I, ii, iv c) I, ii, iii d) All

2. Which of the following are characteristics of data warehouse?
i) subject-variant ii) Load performance iii) Integrated iv) non - volatile
v) Time-variant iv) mass user scalability
a) I, iii, iv, v b) I, ii, iv, v c) I, iii, iv, vi d) All

3. Which of the following are not the criteria of data warehouse?
i) Load performance ii) capacity iii) Data quality management iv) open interfaces
a) I, iii b) ii, iv c) iii, iv d) ii, iii

(u-2)
1. Advantages of data warehouse are:
i) Movement toward the de-massification of business practices.
ii) Consolidation of database records tending toward a single customer view
iii) General recognition of the untapped value in large databases
iv) Reduction in cost of storage and processing.
a) I, ii, iii b) ii, iii, iv c) iv. Ii only d) All

(u-3 & u-1)

1. Which of the following are valid differences between operational systems and data warehouse:
Operational system Data warehouse
i) Deals with only current data i) deals with data stored for longer period
ii) RDBMS ii) other than RDBMS
iii) Data that runs the business iii) Data that analyses business
iv) Changing, incomplete iv) Historical, descriptive
a) I, iii, iv b) I, ii, iii c) iii, iv only d) All
Answers
Each carries 1 mark
1. b 2.a 3.c 4.b 5.a 6.a 7.a 8.b 9.a 10.c 11.b 12.a
13.a 14.b 15.a 16.b 17.a 18.a 19.b 20.a (u-2)1.c 2.b 3.a
4.a 5.c (u-3)1.b 2.b
Each carries 2marks
1.c 2.a 3.d 4.b 5.d 6.b 7.a 8.c (u-2)1.a 2.b
Each carries 4 marks
1.b 2.a 3.b (u-2)1.d (u-3&u-1)1.a

UNIT 4
Each one mark
1. Data warehouse projects require a __________ in approach.
a) bypass b) shift c) change d) none

2. The term back end describes the ________
a) Data repository b) data gate c) data element d) none

3. Front end describes the________ used
a) software b) tools c) hardware d) none

4.__________ allows users to control their own destiny.
a) Empowerment b) Employment c) Enhancement d) none

5. Expand OLAP__________
a)Online Analytical Processing. b) Onlime analysis process c) Onlevel analysis processing d) none

6. Tools that implement OLAP allow users to do _____________
a) business b) Forecasting c) planning d) none

7. A __________ process exists through which systems have been developed.
a) basic b) classic c) populate d) none

8. The team responsibilities are ___________
a) Modular b) classic c) linear d) none

9. The team identifies _______________
a) Responsibilities b) Requirements c) Refinement d) none

10. The design effort is undertaken based on the modeling_________
a) Exercises and findings b) objects c) both d) none

11. Sample data is assembled with the help of the __________
a) Data expert b) Backend expert c) Base expert d) none

12. The programmers play a _________
a) Key role b) best role c) least role d) none

13. The __________ of the data is decided.
a) Granularity b) Gratuity c) Gist d) none

14. Expand KDD_________
a) Knowledge discovery in databases b) Known database c) Kiosk database d) none

15. Data mining helps avoid a ____________
a) Common throw b) common pit fall c) auto pitfall d) none

16. A steering committee consists of a manageable number of ______
a)Senior business executives b) Junior Executive c) laymen executive d) none

17. A user group that contains a select group of ___________
a) Senior business executives b) Junior Executive c) Working management d) none

Two or 4 marks:

1.The key is scope for_________,___________ and_________
i) Success ii) educate iii) planning iv)communicate
a) I, ii only b) I, ii, iv c) I, iv only d) all

2. The fourth key component of our approach is the __________and_____________
a) Methodology and business tools b) methods & tests c) tools& data d) none

3. Business Performance measures derived from the _ and________
a) Strategies and goals b) Strategies & gist’s c) goals & plan d) none

4. Which of the following are true for requirements gathering
i) Cultural requirements ii) technical requirements iii) Enterprise requirements iv) performance based requirement
a) I, ii only b) I, ii, iv c) I, iv only d) all

5. Which of the following are key components for data warehouse:
i) Organization structure ii) User requirements iii) change facilitation
iv) expectation management v) proven methodology
a) I, iii,iv b) ii, iii, v, iv c) I, iii, iv, v d) All
Answers
Each carries one mark
1.b 2.a 3.b 4.a 5.a 6.b 7.b 8.c 9.b 10.a 11.b 12.a 13.a
14.a 15.b 16.a 17.c
Each 2 or 4 marks
1.b 2.a 3.a 4.d 5.c






UNIT 5
Each carries one mark:
1. We should review our situations from two different perspectives: function and _____.
a) responsibility b) person c) Both d) None

2.The most important attribute of a great project manager is an ability to keep an eye on ____ constantly.
a) milestones b) goals c) Both d) None

3. _____ is responsible for creating and reviewing all test plans.
a) Project manager b) Quality assurance specialist c) both d) None

4. ___ must have intimate knowledge of the current legacy system.
a) Power user b) DBA c) system administrator d) None

Each carries 2 or 4 marks:
1. Match the following
Set 1 :A)Project Director B) Project Manager C) Provision Specialist
Set 2 :i) Should establish vision ii) Responsible to senior management
iii) Corporate politician iii) Plan and allocate resources v) Excellent people skills
a) A-I,B-III,C-V b) A-V,B-IV,C-I c) A-V,B-II,C-I d) NONE


2. State which of the following are roles of DBA:
i) Must possess a thorough knowledge of database design ii) Must have political skills
iii) must work closely with technical team iv) must ensure appropriate backup and recovery procedures
v) must have good business analysis skills
a) ii, iii, iv b) I, iii, iv c) All d) None

3. State which of the following are roles of datawarehouse architect:
i) Must possess a thorough knowledge of database design ii) Must have political skills
iii) must work closely with technical team iv) must ensure appropriate backup and recovery procedures
v) must have good business analysis skills
a) ii, iii, iv b) I, v c) All d) None
Answers:
Each 1 marks
1.b 2.a 3.b 4.a
Each 2 or 4 marks
1.a 2.a 3.b
UNIT- 6
1. Project management is the _________
a) Application of knowledge b) skill c) Software d) none

2. Clear business objectives are __________
a) immeasurable b) Measurable c) both d) none

3. A data warehouse is NOT a project, it is a ________
a) Process b) project c) module d) none

4. Warehouses are best built in an_________
a) Interative fashion b) iterative fashion c) evolution fashion d) none

5. ___ is necessary to convey the concepts of data warehouse.
a) training b) initial education c) Both d) None
(2 or 4 marks)
.1. state the order of providing education:
i) data acquisition will probably need to be trained on a transformation tool
ii) Check the web component used to access the datawarehouse and metadata
iii) initial education must be provided on what is a data warehouse
iv) data administration developers will need training on a tool that will integrate company’s data
a) I, ii, iii, iv b) iii, I, iv, ii c) ii, I, iv, iii d) iv, iii, I, ii
Answers
Each carries 1 mark
1.a 2.b 3.a 4.a 5.b
2 or 4 marks
1.b


UNIT 7
1. The project management institute is a __________ organization
a) Nonprofit b) profit c) service d) none

2. A _________ is a written document by which you begin to define the job at hand and all key deliverables.
a) Scope document b) Scope statement c) scale statement d) none

3. Work break down structure we use this technique to help
a) Fill in any gaps or any missing items b) to fill wrongs c) to fill rights d) none

4. The ______ life cycle is specific to what you are building
a) Product development b) process development c) project development d) none

5. Project life cycle follows processes:_____________
a) Initiate,Plan,Executive b) Initiate,Plan,Executive,Control c) Initiate,Plan,Execute, control & close d) none

6. _________ plan is imperative for a data warehouse project
a) An integrated b) nonintegrated c) isolated d) none

7. ________ is used for monitoring and controlling the project
a) plan b) an integrated plan c) design d) none
(2 or 4 marks)
1. Effective project management for a data warehouse includes a focus on __________, ___________, ___________ and __________________.
a) Risk Management, Communications, Planning and Expectations management.
b) Risk Management, Communications, Planning
c) Risk Management, Communications
d) none

2. Which of the following are major elements in the breakdown of a scope statement:
i) project justification ii) project completion iii) project title & description iv) project objective
a) All b) I, iii, iv c) None d) ii only
Answers
Each carries 1 mark
1. a 2.b 3.a 4.a 5.c 6.a 7.b
(2 or 4 marks)
1.a 2.b
Unit 8
1. Activity estimates are low level educated guesses at ____
a) funding decisions b) time taken decisions c) both d) none

2. Project estimates are high level guesses to help you make __________
a) funding decisions b) time taken decisions c) both d) none

3. Activity estimating is not a ________
a) planning b) negotiation c) both d) none

4. A ______should be limited to just one business process
a) Pilot b) project c) process d) none

5. ____________ is critical to the success of the pilot
a) Documentation b) plan c) process d) none

6. A _________ should facilitate documentation, communication
a) design tool b) case tool c) case study d) none
(2 or 4 marks)
1. which of the following require additional procedures.
i) design ii) security iii) audit iv) estimating
a) ii, iii b) I, ii c) All d) None
Answers
Each carries 1 marks
1. b 2.a 3.b 4.a 5.a 6.b
Each carries 2 or 4 marks
1.a




UNIT 9
1. The bell curve is ___ at center.
a) Bulge b) skew c) Both d) None

2. __________- means you get caught by restricting factors.
a) Constraints b) critical paths c) constraints d) none

3. ___________- items you could never predict
a) Assumptions b) unknowns c)constraints d) none

4. _______ are yet another form of risk within a project
a) Critical paths b) unknown c) constraint d) none
(2 or 4 marks)
1. Which of the following are internal risks
i) market shift ii) schedule bumps iii) cost hiccups iv) change in law
a) I, iii, iv b) ii, iv c) ii,iii d) all
Answers
Each carries 1 mark
1.b 2.a 3.b 4.a
Each 2 or 4 marks
1.a
UNIT 10
1. _________ indexing mechanism consists of B-tree indexes
a) Oracle b) Sybase c) Sql server d) none

2. The new ______ index feature, query processing and index access can improve.
a) B-tree b) bitmap c) Both d) None

3. ____ are Query intensive applications.
a) DSS/EIS OLAP b) OLTP server b) OLFF d) none

4. Oracle goes through ________ process for building an index.
a) Two- step process b) three step process c) many step process d) none

5. __________is a measurement of the number of unique values
a) degree b) cardinality c) index d) none

6. If the degree of cardinality of a column is ___________ percent, it is an ideal candidate for a bit mapped
index.
a) <= .1 b) >=.1 c) >=100 d) none

7. __________ is neither a primary reason for using bitmap indexes.
a) Saving space b) more space c) more time d) none
Answers
1. a 2.b 3.a 4.a 5.b 6.a 7.a
UNIT 11

1. The _____________ is a common method used to store data in the warehouse.
a) Star schema b) Tabular c) B Tree d) none

2. The ____________ processes large amounts of data.
a) OLTP query b) DSS query c) UII query d) none

3. The star schema has _________ parts.
a) Three. B) two c) four d) none


4. The dimensional tables contains ____ about data stored in fact tables.
a) Attributes b) Entity c) Tuples d) none

5. The warehouse user begins with _______________
a) Fact b) dimensions c) both d) none


6. The fact table is normally limited to ________
a) generic data b) Char data c) numeric data d) none

7. Oracle’s _________ optimized.
a) Cost-based b) Performance-based c) Process based d) none

8. The star schema is best suited for ______
a) transaction processing b) analytical processing c) both d) none

9. Expand DISC__________
a) Dynamic Information Systems Corporation b) Dynamic Interaction Systems corp.,
c) Det-Information Systems Corporation d) none

10. The more complex schema is referred to as ____.
a) Star schema b) snowflake c) B Tree d) none
Answers
1.a 2.b 3.b 4.a 5.a 6.c 7.a 8.b 9.a 10.b
UNIT 12
1. Read only table spaces are useful for improving the _______ time.
a) Response b) request c) process d) none

2. A tablespace may contain one or many __________
a) control files b) data files c) redo log files d) none

3. Compatible must be set to __________
a) Compatible =7.n.0.0 b) Compatible =8.0 c) Compatible =90 d) none

4. Which command is valid for making Table space read – only.
a) alter system tablespace read only b) alter tablespace read only c) Both d) None

(2 or 4 marks)
1. Which of the following are reasons for not taking backups of read-only tablespaces:
i) Their contents never change ii) Oracle manages their status iii) Oracle takes the backup of these auto
a) I only b) I, ii c) all d) None

2. What are the requirements for read-only tablespces:
i) The tablespace must be offline ii) The tablespace must not have any active transactions.
iii) The SYSTEM tablespace can be read only iv) the tablespace cannot be currently in backup mode
a) I, ii only b) I, iii only c) ii, iv d) All
Answers
Each 1 marks
1.a 2.b 3.a 4.b
2 or 4 marks
1.b 2.c
UNIT 13
1 To take advantage of parallel processing, a computer must have _____
a) >1 CPU b) 0 CPU C) 100 cpu d) none

2. The data warehouse is an ideal candidate for _________
a) single operations b) parallel operations c) simple operations d) none

3. _________ are the building block for parallel processing
a) Query servers b) Query items c) Query list d) none

4. SMP architecture entails multiple processors sharing ________
a) Common memory b) Share memory c) static memory d) none

5. ____ ensures that the consistent status of the database remains intact after a system crash.
a) backup b) recovery c) Both d) None

6. If any of the extra processes are idle for a number of minutes in ____ they are terminated.
a) PARALLEL_SERVER_IDLE_TIME b) SERVER_IDLE_TIME c) Both d) None
Answers
1.a 2..b 3.a 4.a 5.b 6.a




UNIT 14
1. In operational systems, database objects can grow at _____.
a) slow rate b) exponential rates c) faster rates d) None

2. The optimizer can follow different ___________
a) Query execution plans b) Query plans c) Query elements d) none

3. The query execution plan is selected by the optimizer based on information it finds in Oracle’s ____
a) memory b) data dictionary c) Both d) None
(2 or 4)
1. Which of the following are true for optimizer:
i) It is a set of internal routines ii) it makes decisions on the efficient, cost-effective plan for queries
iii) it uses cost-based approach iv) it an follow different query execution plans.
a) ii, iii only b) I, iii, iv c) I, ii, iv d) all
Answers
1.b 2.a 3.b
2 or 4 marks
1.d
UNIT 15
1. Oracle records every transaction in a file it calls the_________
a) Redo log b) Control log c) Dead log d) none

2. Each Oracle database needs at least ___ redo log files.
a) 2 b) 1 c) 0 d) None

3. What is the meaning of “unrecoverable”
a) no way exists to destroy b) no way exists to reconstruct c) Both d) none
Answers
1.a 2.a 3.b
UNIT 16
1. ____ is a discovery process.
a) Data warehousing b) data mining c) Both d) None

2. ____ goes out with no pre determined idea of what the search will find.
a) Invention b) event c) discovery d) None

3. Discovering ________ is key to successful marketing.
a) Relationship b) Pattern c) Element d) none

4.A ____________ exists in pattern discovery .
a) Temporal component b) Temporal c)Component d) none

5. Patterns are closely related to_____________
a) Habit b) No habit c) hanger d) none

6 Data mining is the process of finding __________
a) facts & figures b) Correlations or patterns c) data flow d) none

7. ___ is an example of Operational Or transactional data
a) Sales b) market c) product d) none

8. _____ is an example of Non Operational data
a) Forecast data b) sales data c) cost data d) none

9. Information can be converted into _______
a)decision making b) Information c) unknown things d) none

10. Walmart computers processed over __________ complex data queries.
a) One million b) hundred c) two hundred d) none

11) Expand NBA________
a) Nested basketball Association b) National basketball Association
c) Nation basketball Association d) none

12) __________ is stored data to locate data in predetermined groups.
a)Classes b) Object c) String d) none

13) _____are non-linear predictive models that learn through training.
a) genetic algorithms b) Artificial neural networks c) decision trees d) none

14) ____ are optimization techniques that uses processes such as mutation, combination, etc.,
a) genetic algorithms b) Artificial neural networks c) decision trees d) none

15) _____ of data is needed to maximize user access and analysis.
a) distribution b) centralized c) Both d) None

16) Selection is also said to be __________
a)Segmenting b) Clustering c) Association d) none

17) __________ is the inference of information .
a)deduction b) Induction c) segmentation d) none

18)Induction is therefore the ____________
a)Extraction of patterns b) analyzing patterns c) Both d) none

19) ____ is the automation of learning process.
a)Machine learning b) Machine reading c) Machine teaching d) none

20) In _____ model the system automatically discovers important information hidden in the data.
a) verification b)Discovery c) Both d) None
(2 or 4 marks)
1) Which of the following are the valid differences between KDD and ML:
i) KDD is concerned with very large real-world databases, while ML deals with small data sets.
ii) KDD is concerned with finding understandable knowledge, while ML is concerned with improving performance.
iii) ML is a broader field which includes not only learning from examples but also reinforcement learning.
a) I only b) ii, iii c) I, iii d)all

2) state the correct order of the stages:
i) preprocessing ii) transformation iii)selection iv) interpretation v) data mining
a) iii, I, ii, v, iv b) ii, iii, i, v, iv c) iii, I, ii, iv, v d) iii, I, ii, v, iv
Answers
Each 1 marks
1.b 2.c 3.a 4.a 5.a 6.b 7.a 8.a 9.b 10.a 11.b 12.a 13.b 14.a 15.b 16.a 17.b 18.a 19.a 20.b
Each 2 or 4 marks
1.d 2.a
UNIT 17
1) The primary benefit of data mining is the ability to turn feelings into _________
a)Facts b) Transaction c) Manage d) none

2) Data mining can discover unexpected patterns in_________
a)Information b) Behavior c) Association d) none

3)The data mining started with an idea of what they are looking for, is called _____.
a) source data mining b) targeted data mining c) Both d) None

4) Fraud detection is seen primarily as ________ data mining.
a)Out-of-the-blue. b) in-the-blue c) a or b d) none

5) Error in either the values of attributes or class information are known as __________
a)incomplete b) noise c) both d) none

6) ROI stands for
a) return on input b) revenue on investment c) Return on investment d) None

7) ____ refers to the severity of the error and the degree of noise in the data.
a) Uncertainity b) noise c) both d) None
Answers
1.a 2.b 3.b 4.a 5.b 6.c 7.a
UNIT 18
1) Data mining can fast track the ___________ process.
a)decision making b) Discovery c) Association d) none

2) Decision making should be driven by knowledge of _______
a) past performance b) sales forecast c) maintaining accounts d) None


3) Data mining assists the __________
a)decision making b) Discovery c) Association d) none

4) Data mining helps transform vast amounts of data into _________
a) Discovery b) data c) information d) none

5) Data mine tools have to ____________ a model from the database.
a)Infer b) Intra c) inbound d) none

6) The few attributes that denote the class of a tuple are known as____
a) predicting attributes b) predicted attributes c) both d) none

7) The combination of values for predicted attributes defines a ____.
a) object b) class c) both d) none

8) ___ allows some exceptions, but the exceptions have a given limit.
a) exact rule b) strong rule c) probabilistic d) none

9) ___ allows no exceptions.
a) exact rule b) strong rule c) probabilistic d) none

10) _________ functions analyze a collection of records over period of time.
a) Sequential/ Temporal. b) Sequential c) Temporal d) none

11) Clustering and segmentation are the processes of creating a_________
a)subsets b) Partition c) cells d) none

12) A ____ is a set of objects grouped together because of their similarity or proximity.
a) classes b) objects c) clusters d) None
(2 or 4 marks)
1. Which of the following are the functions of data mining:
i) classification ii) associations iii) generalizations iv) clusters\segmentation v) sequential\temporal
a) I, ii only b) I, ii, iv, v c) All d) None
Answers
Each 1 marks
1.b 2.a 3.a 4.c 5.a 6.b 7.b 8.b 9.a 10.a 11.b 12.c
(2 or 4 marks)
1.b
UNIT 19
1) Neural network based mining is especially suited to identify ________
a) patterns b) associations c) both d) none

2) A ___ identifies a movement in habit based on past behavior.
a) data-flow b) trend c) pattern d) None

3) The ___ layer is the hidden layer which performs the work.
a) Input b) middle c) both d) None

4) Each node in every layer is _________ to each node in the adjacent layer.
a) Interconnected b) connected c) disconnected d) none

5) _________ is a technique to infer information that is generalized from the database.
a)deduction b) induction c) both d) none

6) _________ is a technique to infer information that is logical consequence of the information.
a)deduction b) induction c) both d) none

7) ___ involves grouping data together based on asset of similarities predefined by analysts.
a) association discovery b) classification c) clustering d) None

8) _____ attemptes to find patterns between events that occur in a progression over a period of time.
a) sequential discovery b) association discovery c) clustering d) None

9) Decision trees are simple knowledge __________
a)presentation b) Diagram c) representation d) none

10) ____ rules are widely used to represent information in expert systems.
a) Deduction b) production c) deduction d) None

11) __ refers to the ability to look at the database from different view points.
a) slicing & dicing b) consolidation c) drill-down d) none

12) FASMI stands for ____
a) Fast Analysis of Shared Multidimensional Information
b) Fast Addition Shared Multidimensional Information
c) Frequent Analysis of Shared Multidimensional Information
d) none

13) ____ makes it possible for the analyst to gain deeper, more intuitive understanding of the data.
a) Data aggregation b) Data visualization c) Both d) None
(2 or 4 marks)
1) Which of the following are the components of OLAP:
i) A multidimensional database representation must be simpler ii) Intuitative navigation iii) Instant response
a) I only b) ii only c) iii only d) All

2) Which of the following are true with respect to OLAP queries.
i) Access very large amounts of data
ii) Involve aggregated data
iii) There is no requirement for a join
iv) involve complex calculations
a) I, ii, iii only b) I, ii, iv c) All d) None
Answers
Each 1 mark
1.a 2.b 3.b 4.a 5.b 6.a 7.b 8.a 9.c 10.b 11.a 12.a 13.b
(2 or 4 mark)
1.d 2.b
UNIT 20
1) Data marts are ___________
a) subject –oriented. B) process oriented c) bypass oriented d) none

2) Data marts must provide a _________ rapid solution.
a)performance-effective b) cost-effective c) Cord effective d) none

3) A ___ is simply executed over and over again.
a) iterative process b) interactive process c) both d) None

4) _________ are a subset of a large data warehouses.
a) Data marts b) work groups c) work elements d) none

5) In a largely decentralized company, segments of business community have funded, developed and deployed the data mart solution virtually without the involvement of the personnel tasked with the management of computer system solutions. Such data marts are called as ___
a) Distributed data marts b) stand-alone data marts c) C/S data marts d) None
Answers
1.a 2.b 3.b 4.a 5.b
UNIT 21
1) Data marts help provide solutions for __________
a)Corporate businesses b) smaller businesses c) both d) none

2) Expand COTS________
a)Commercial-Off-the-shelf b) Commercial Off-shelf C) Component-off-the-shelf. D) none

3) Data mart is built around a _________ structure
a)inner-join b) star-join c) Outer-join d) none

4) The data mart is typically housed in __________
a) Multidimensional technology b) Dimensional c) rows d) none

5)There are _________ kinds of data marts
a) Two b) Three c) many d) none

6) The data warehouse contains the most__________ data
a)summary data b) granular data c) summation d) none

7) The data warehouse data is integrated from the many _______
a) Legacy sources b) ERP source c) RDBMS d) none

8) The _________ is designed to suit the needs of a department
a)Data warehouse b) Data mart c) both d) none

9) The data mart contains _________ data
a)Aggregated b) summary c) both d) none

10) The data warehouse data structure is essentially _____.
a) star-schema b) normalized c) denormalized d) None
(2 or 4 marks)
1. Which of the following are not true with respect to data marts?
i) They contain detailed information
ii) The structure of the data in data mart is faintly historical
iii) They are specific to a department
iv) They do not contain subject-wise information
a) I only b) I, iii only c) I, iv d) all

2. Which of the following are true with respect to data warehouse?
i) They contain robust historical information
ii) They are lightly indexed
iii) The data structure in this is normalized structure
iv)The data is integrated from many legacy sources.
a) I, ii b) ii, iv c) all d) None
Answers
1 marks
1.b 2.a 3.b 4.a 5.a 6.b 7.a 8.b 9.c 10.b
2 or 4 marks
1.c 2.c
UNIT 22
1) _________ want a cost – effective tool.
a) Consumers b) Customer c) professional d) none

2) Extraction is closely related to _________
a) Translation b) Transformation c) Transport d) none

3) data mart providers must play a role in facilitating _______ process.
a) Extraction b) loading c) both d) none

4) ________________ ensures that as data is moved into the data mart according to standards.
a)Transformation b) Translation c) Transport d) none

5) During loading, The data can be put directly into the target tables or moved into _________
a) fact tables b) Middle table c) intermediary table d) none

6) Machine with ___ processors are ideal candidates for housing the data mart and warehouse.
a) One b) more than one c) more than two d) None
(2 or 4 marks)
1) Which of the following are core modules of data marts:
i) Extraction ii) transformation iii) load
a) I only b) ii only c) iii only d) All
Answers
Each carries 1 mark
1.a 2.b 3.a 4.a 5.c 6.b
2 or 4 marks
1.d
UNIT 23
1) ________ is a category of technology that enables users to gain insight into their data.
a) OLAP b) OLDD c) OLTT d) none

2) A flat file is a collection of _________data
a)Graphic data b) text data c) both d) none

3) A dimensions is some way to locate the value of a ____________
a)Performance measure b) attribute c) column d) none

4) The ability to organize the data in the way users think about it is known as ___.
a) multidimensionality b) RDBMS c) Both d) None


5) The key to the OLAP database is its ________
a)Tables b)Dimensions c) Tuples d) none

6) A __________is a single data point
a) Cell b) Call c) Process d) none

7) Means that the system is targeted to deliver most responses to users within about _______
a) 15 seconds b) 5 seconds c) 500 seconds d) none

8) __________ means that the system can cope with any business logic and statistical analysis.
a) Fast b) Information c) Analysis d) none

9) __________ means that the system implements all the security requirements
a) Fast b) shared c) proper d) none

10) The ____________must provide a multidimensional conceptual view
a) System b) process c) hardware d) none

11) ___________ as a mediator
a)OLAP b) OLTT c) OLDD d) none

12) ___ refers to integration between an OLAP engine and denormalized source data.
a) Transparency b) treatment on non-normalized data c) Both d) None
(2 or 4 mks)
1) Which of the following are features of OLAP?
i) Multi dimensional conceptual view
ii) Intuitive data manipulation
iii) Accessibility
iv) Transparency
a) I only b) ii, I, iii c) ii, iv, I d) all
Answers
Each 1 mark
1.a 2.b 3.a 4.a 5.b 6.a 7.b 8.c 9.b 10.a 11.a 12.b
2 or 4 marks
1.d
UNIT 24
1) ____________ key architectures exists for OLAP.
a) Two.. b) Three c) Ten d) none

2) MOLAP stands for
a) Mono- OLAP b) Multidimensional OLAP c) Multi – OLAP d) None

3) ROLAP stands for
a) relational- OLAP b) robust OLAP c) realistic OLAP d) None

4) ___ deals with large data sets.
a) MOLAP b) ROLAP c) Both d) None

5) In ____ the data can be recompiled quickly.
a) MOLAP b) ROLAP c) Both d) None
UNIT 25
1) The primary operation of OLAP is _____.
a) reporting b) analyze c) Both d) None

2) the type of data dealt by DSS is ____
a) details b) summary c) Both d) None
(2 or 4 mks)
1) the timeliness of data must be current & historical in which cases:
i) OLTP ii) DSS iii) OAP
a) I only b) I, iii c) ii, iii d) all
Answers(UNIT 24 & 25)
Each 1 mark
(u24) 1.a 2.b 3.a 4.b 5.a
(u25) 1.b 2.c
Each 2 or 4 marks
(25) 1.c
*********************