Jul
31
2010
Data Warehousing
User Rating: / 0
PoorBest 
Data warehousing generally gets utilized by the companies for having analyzed the trends over the passage of time. To put it simple, companies might make use of data warehousing for having viewed day-to-day operations. Common data warehousing accessing systems are inclusive of reporting, analysis, and queries. As data warehousing goes on to create single database at last, numbers of the sources can just be anything desired by you, provided that system is capable of handling the volume, by all means. Ultimately, however, a homogeneous data that can get manipulated more simply.

Primary function of data warehousing is that of having facilitated strategic planning which results from the long-term overviews of data. Through such overviews, forecasts, models, business, projections and other reports can get made. If thought from the routine point of view, as data that is stored in the data warehouses has been intended towards having provided more ‘overview-like’ reporting, data turns out to be ‘read-only’. If you wish of having updated data that is stored through data warehousing, a new-fangled query would be needed of being built when you are done.

The above-mentioned principle isn’t to say that the data warehousing goes on to involve the data which never gets updated. Instead, data that is stored in the data warehouses is constantly updated. It is, in fact, the analysis and reporting which consume more of long-term view. Remember that data warehousing isn’t the ‘be-all and end-all’ in case of storing every data of the company. Rather, the thing ‘data warehousing’ is usually utilized for housing the essential data with regards to specified analysis. Data storage, if meant to be more comprehensive needs diverse capacities which happen of being more static, less simply manipulated in comparison with those made use of for ‘data warehousing’.

Larger companies typically make use of data warehousing for having analyzed bigger sets of the data, especially for ‘enterprise purposes’. Small-scale companies wanting to have analyzed just a single subject, for instance, generally access the data marts that are much more specified and also targeted in reporting and storage of theirs. Data warehousing is usually inclusive of smaller quantities of the data which get grouped in to the data marts. This way, a bigger company would have data marts as well as data warehousing at disposal of it, thereby having permitted users of having chosen the functionality and source based on the latest needs.

Architecture


In context of efforts relating to ‘data warehousing’ being put on by an organization, architecture can be referred to as conceptualization of the way in which data warehouse is constructed. Nothing like wrong or right architecture exists. Architecture’s worthiness can get judged with regards to the way in which conceptualization helps in building, usage, and maintenance of data warehouse. A possible easy conceptualization of ‘data warehouse’ architecture goes on to consist of interconnected layers a follows:

Operational database layer


Source data with respect to data for data warehouse- ERP systems of an organization fall in to this layer.

Informational access layer


Data accessed to report, and analyze, along with tools to report and analyze the data- Tools relating to business intelligence fall in to this layer. Inmon-Kimball differences regarding design methodology are also related to this layer.

Data access layer


Interface between informational and operational access layer- the tools used for extracting, transforming, and loading data in to warehouse go on to fall in to this layer.

Metadata layer


Data directory- It is typically more detailed in comparison with data directory of operational system. There’re dictionaries with respect to the whole warehouse; at times, dictionaries for data which can be very well accessed by a specific reporting as well as analysis tool.

‘Normalized v/s dimensional’ approach for data storage


There’re 2 leading approaches with regards to storage of data in the data warehouse- normalized approach and dimensional approach.

In normalized approach, data contained in data warehouse get stored as per ‘Codd normalization rule’. Tables get grouped altogether by the subject areas which go on to reflect usual data categories. For instance: data on finance, products, customers, etc. major advantage of such an approach is that it’s straightforward in terms of addition of information in to database. Its disadvantage is that due to loads of tables involved in it, difficulty can arise in terms of joining data from diverse sources in to meaningful information, and accessing information without exact understanding of data sources and of data warehouse’s data structure.

In dimensional approach, the transaction data get partitioned in to either ‘facts’ that are usually ‘numeric transaction data’, along with ‘dimensions’. These dimensions are nothing but ‘reference information’ giving context to facts. For instance, sales transaction could be broken up in to the facts like price paid regarding products and numbers of the products ordered, that too, in to dimensions like customer name, order rate, bill-to locations, and product number.

Add as favourites (42) | Quote this article on your site

Be first to comment this article

Write Comment
  • Please keep the topic of messages relevant to the subject of the article.
  • Personal verbal attacks will be deleted.
  • Please don't use comments to plug your web site. Such material will be removed.
  • Just ensure to *Refresh* your browser for a new security code to be displayed prior to clicking on the 'Send' button.
  • Keep in mind that the above process only applies if you simply entered the wrong security code.
Name:
Title:
BBCode:Web AddressEmail AddressBold TextItalic TextUnderlined TextQuoteCodeOpen ListList ItemClose List
Comment:

Code:* Code

 
< Prev   Next >