Académique Documents
Professionnel Documents
Culture Documents
Data Warehouse:
Holds multiple subject areas
Holds very detailed information
Works to integrate all data sources
Does not necessarily use a dimensional model but feeds dimensional models.
Data Mart
Often holds only one subject area- for example, Finance, or Sales
May hold more summarised data (although many hold full detail)
Concentrates on integrating information from a given subject area or set of source
systems
Is built focused on a dimensional model using a star schema.
The interesting thing about these approaches, is that the harder you
work on really conforming your dimensions, the more your data marts
look like the data warehouse that Inmon advocates. (Data modellers in
the know will be jumping up and down right now shouting NO they don't-
but this is a high level conversation...) But the reality is, even in a data
warehouse, issues will arise that require compromise- things that just
don't map or conform, and budget, schedule and business reality will
mean that nothing is ever perfect, and in the end the world is full of data
warehouses that are less conformed than some data mart clusters. Its
just not simple.
Generally, it is probably true that data warehouses provide a solution
that is closer to the "single version of the truth", but they do take a HUGE
amount of effort, and an ability to coordinate across the entire
organisation. If you have not already built at least half a dozen data
marts, don't think you can estimate how much effort a data warehouse
will take. You can't. And bring your cheque book.
Whereas data marts might deliver some value early, if built without
sufficient effort on cross functional mapping and data cleanup they are
just more silo systems and have their own set of costs and issues. Don't
measure payback on datamarts in years- nothing is the same in a few
years, you'll be back to the drawing board shortly.
It's a real dilemma. So which one? Data warehouse? Data mart? In my
view, the right answer is "it depends" and "yes". However, never launch a
data warehouse project as your first shot. A successful strategy will
balance the fast, pain point addressing solutions, with a medium and
long term plan, and investment in infrastructure and competencies to
build the technology, processes and culture that a company needs to
manage information. Depending on your industry and how sucessful you
are, a massive data warehouse might be in your future. But sorry, no
magic bullet.
Don't do these things in order- this isn't step 1, 2, 3- actively work on all
three levels at once and ensure the plans at each level are coordinated.
Data to the People
People are building spreadsheets, and spending money on data base
development now- you know they are. Give them better tools, help them
better use the spreadsheets, and formalize the way they do. The do-it-
yourself exists- but it doesn't have to be completely informal.
The Datamartist tool is adding another capability that can accelerate the
process- letting you move more quickly, proto-typing and analysing to
determine which areas are ready for additional analysis capability.
In some cases Datamartist itself might simply be the best choice for
certain types of analysis, cutting costs dramatically. In addition, if your
end users are building their own data marts, when it comes time to build
server based data marts they know the concept, they understand the
structure, and can even provide concrete examples of the dimensions
they need.
But whatever you do- don't make the mistake of thinking this is all you
need. Work on all levels at once.