Vous êtes sur la page 1sur 5

What is the difference between a data warehouse and a

data mart?
This is a heavily debated issue. There are inherent similarities between the basic constructs used
to design a data warehouse and a data mart. In general a Data Warehouse is used on an
enterprise level, while Data Marts is used on a business division/department level. A data mart
only contains the required subject specific data for local analysis.
 Back to top of file

What is the difference between a W/H and an OLTP


application?
Typical relational databases are designed for on-line transactional processing (OLTP) and do not
meet the requirements for effective on-line analytical processing (OLAP). As a result, data
warehouses are designed differently than traditional relational databases.

Warehouses are Time Referenced, Subject-Oriented, Non-volatile (read only) and Integrated.

OLTP databases are designed to maintain atomicity, consistency and integrity (the "ACID" tests).
Since a data warehouse is not updated, these constraints are relaxed.

 Back to top of file

What is the difference between OLAP, ROLAP, MOLAP


and HOLAP?
ROLAP, MOLAP and HOLAP are specialized OLAP (Online Analytical Analysis) applications.

ROLAP stands for Relational OLAP. Users see their data organized in cubes with dimensions,
but the data is really stored in a Relational Database (RDBMS) like Oracle. The RDBMS will store
data at a fine grain level, response times are usually slow.

MOLAP stands for Multidimensional OLAP. Users see their data organized in cubes with
dimensions, but the data is store in a Multi-dimensional database (MDBMS) like Oracle Express
Server. In a MOLAP system lot of queries have a finite answer and performance is usually critical
and fast.

HOLAP stands for Hybrid OLAP, it is a combination of both worlds. Seagate Software's Holos is
an example HOLAP environment. In a HOLAP system one will find queries on aggregated data
as well as on detailed data.

 Back to top of file

What is the difference between an ODS and a W/H?


An ODS (Operational Data Store) is an integrated database of operational data. Its sources
include legacy systems and it contains current or near term data. An ODS may contain 30 to 90
days of information.

A warehouse typically contains years of data (Time Referenced). Data warehouses group data by
subject rather than by activity (subject-oriented). Other properties are: Non-volatile (read only)
and Integrated.

 Back to top of file

What is the difference between Oracle Express and


Oracle Discoverer?
Express is an MD database and development environment. Discoverer is an ad-hoc end-user
query tool.
 Back to top of file

Q. Why should the OLTP database different from data warehouse database?

 OLTP and data warehousing require two very differently configured systems
 Isolation of Production System from Business Intelligence System
 Significant and highly variable resource demands of the data warehouse
 Cost of disk space no longer a concern
 Production systems not designed for query processing

Data warehouse usually contains historical data that is derived from transaction data, but it can
include data from other sources. Having separate databases will separate analysis workload from
transaction workload and enables an organization to consolidate data from several sources.

Q. What is the main difference between Data Warehousing and Business Intelligence?

The differentials are:

DW - is a way of storing data and creating information through leveraging data marts. DM's are
segments or categories of information and/or data that are grouped together to provide
'information' into that segment or category. DW does not require BI to work. Reporting tools can
generate reports from the DW.

BI - is the leveraging of DW to help make business decisions and recommendations. Information


and data rules engines are leveraged here to help make these decisions along with statistical
analysis tools and data mining tools.

Q. What is the difference between OLTP and OLAP?

OLAP - Online Analytical processing, mainly required for DSS, data is in denormalized manner
and mainly used for non volatile data, highly indexed, improve query response time
OLTP - Transactional Processing - DML, highly normalized to reduce deadlock & increase
concurrency

Q. What is the difference between OLTP and data warehouse?

Operational System Data Warehouse


Transaction Processing Query Processing
Time Sensitive History Oriented
Operator View Managerial View
Organized by transactions (Order, Organized by subject (Customer,
Input, Inventory) Product)
Relatively smaller database Large database size
Many concurrent users Relatively few concurrent users
Volatile Data Non Volatile Data
Stores all data Stores relevant data
Not Flexible
Flexible

What is the difference between sequential file and a dataset? When to use
the copy stage?

Sequentiial Stage stores small amount of the data with any extension in order to
acces the file where as DataSet is used to store Huge amount of the data and it
opens only with an extension (.ds ) .The Copy stage copies a single input data set to
a number of output datasets. Each record of the input data set is copied to every
output data set.Records can be copied without modification or you can drop or
change theorder of columns.

What is the exact difference betwwen Join,Merge and Lookup Stage??

The exact difference between Join,Merge and lookup is

The three stages differ mainly in the memory they use

DataStage doesn't know how large your data is, so cannot make an informed choice
whether to combine data using a join stage or a lookup stage. Here's how to decide
which to use:

if the reference datasets are big enough to cause trouble, use a join. A join does a
high-speed sort on the driving and reference datasets. This can involve I/O if the
data is big enough, but the I/O is all highly optimized and sequential. Once the sort
is over the join processing is very fast and never involves paging or other I/O

Unlike Join stages and Lookup stages, the Merge stage allows you to specify several
reject links as many as input links.

What is the difference between Datastage and Datastage TX?


Its a critical question to answer, but one thing i can tell u that Datastage Tx is not a
ETL tool & this is not a new version of Datastage 7.5.
Tx is used for ODS source ,this much i know

Difference between Hashfile and Sequential File?

Hash file stores the data based on hash algorithm and on a key value. A sequential
file is just a file with no key column. Hash file used as a reference for look up.
Sequential file cannot
Difference between Hashfile and sequential file is , searching a record is too fast in
hash file based on the hashkey, we can get the address of record directly in hashfile
based on the hashkey, and in sequential file it should search record sequential mode
only, it has to search for record by record, and we can remove duplicate records
based on the hash key in hashfile, we cannot in sequential file

What are Static Hash files and Dynamic Hash files?

The hashed files have the default size established by their modulus and separation
when you create them, and this can be static or dynamic.

Overflow space is only used when data grows over the reserved size for someone of
the groups (sectors) within the file. There are many groups as the specified by the
modulus.

There are two types of hashfiles are there


1.Static
2.dynamic

dynamic will use only when we dont know howmuch data will coming from the
source side, this will allow data loading grow automatically,

only we use static when we know the fixed amount of data we are trying to load in
the target DB,
this is the scenario for use both types

2) Differences between star and snowflake schemas?


The star schema is created when all the dimension tables directly link to the fact table. Since
the graphical representation resembles a star it is called a star schema. It must be noted that
the foreign keys in the fact table link to the primary key of the dimension table. This sample
provides the star schema for a sales_ fact for the year 1998. The dimensions created are
Store, Customer, Product_class and time_by_day. The Product table links to the product_class
table through the primary key and indirectly to the fact table. The fact table contains foreign
keys that link to the dimension tables.

What is the difference between view and materialized view?


View - store the SQL statement in the database and let you use it as a table. Everytime you
access the view,? the SQL statement executes.
materialized view - stores the results of the SQL in table form in the database. SQL statement
only executes once and after that everytime you run the query, the stored result set is used.
Pros include quick query results.

Differences between star and snowflake schemas?


Star schema - all dimensions will be linked directly with a fat table.
Snow schema - dimensions maybe interlinked or may have one-to-many relationship with
other tables.

Vous aimerez peut-être aussi