Logo

Navigation
  • Home
  • Services
    • ERP Solutions
    • Implementation Solutions
    • Support and Maintenance Solutions
    • Custom Solutions
    • Upgrade Solutions
    • Training and Mentoring
    • Web Solutions
    • Production Support
    • Architecture Designing
    • Independent Validation and Testing Services
    • Infrastructure Management
  • Expertise
    • Microsoft Development Expertise
    • Mobile Development
    • SQL Server Database and BI
    • SAP BI, SAP Hana, SAP BO
    • Oracle and BI
    • Oracle RAC
  • Technical Training
    • Learn Data Management
      • Business Intelligence
      • Data Mining
      • Data Modeling
      • Data Warehousing
      • Disaster Recovery
    • Learn Concepts
      • Application Development
      • Client Server
      • Cloud Computing Tutorials
      • Cluster Computing
      • CRM Tutorial
      • EDI Tutorials
      • ERP Tutorials
      • NLP
      • OOPS
      • Concepts
      • SOA Tutorial
      • Supply Chain
      • Technology Trends
      • UML
      • Virtualization
      • Web 2.0
    • Learn Java
      • JavaScript Tutorial
      • JSP Tutorials
      • J2EE
    • Learn Microsoft
      • MSAS
      • ASP.NET
      • ASP.NET 2.0
      • C Sharp
      • MS Project Training
      • Silverlight
      • SQL Server 2005
      • VB.NET 2005
    • Learn Networking
      • Networking
      • Wireless
    • Learn Oracle
      • Oracle 10g
      • PL/SQL
      • Oracle 11g Tutorials
      • Oracle 9i
      • Oracle Apps
    • Learn Programming
      • Ajax Tutorial
      • C Language
      • C++ Tutorials
      • CSS Tutorial
      • CSS3 Tutorial
      • JavaScript Tutorial
      • jQuery Tutorial
      • MainFrame
      • PHP Tutorial
      • VBScript Tutorial
      • XML Tutorial
    • Learn Software Testing
      • Software Testing Types
      • SQA
      • Testing
  • Career Training
    • Career Improvement
      • Career Articles
      • Certification Articles
      • Conflict Management
      • Core Skills
      • Decision Making
      • Entrepreneurship
      • Goal Setting
      • Life Skills
      • Performance Development
      • Personal Excellence
      • Personality Development
      • Problem Solving
      • Relationship Management
      • Self Confidence
      • Self Supervision
      • Social Networking
      • Strategic Planning
      • Time Management
    • Education Help
      • Career Tracks
      • Essay Writing
      • Internship Tips
      • Online Education
      • Scholarships
      • Student Loans
    • Managerial Skills
      • Business Communication
      • Business Networking
      • Facilitator Skills
      • Managing Change
      • Marketing Management
      • Meeting Management
      • Process Management
      • Project Management
      • Project Management Life Cycle
      • Project Management Process
      • Project Risk Management
      • Relationship Management
      • Task Management
      • Team Building
      • Virtual Team Management
    • Essential Life Skills
      • Anger Management
      • Anxiety Management
      • Attitude Development
      • Coaching and Mentoring
      • Emotional Intelligence
      • Stress Management
      • Positive Thinking
    • Communication Skills
      • Conversation Skills
      • Cross Culture Competence
      • English Vocabulary
      • Listening Skills
      • Public Speaking Skills
      • Questioning Skills
    • Soft Skills
      • Assertive Skills
      • Influence Skills
      • Leadership Skills
      • Memory Skills
      • People Skills
      • Presentation Skills
    • Finding a Job
      • Etiquette Tips
      • Group Discussions
      • HR Interviews
      • Interview Notes
      • Job Search Tips
      • Resume Tips
      • Sample Resumes
 

Data Warehouse Overview

By Exforsys | on August 14, 2006 |
Data Warehousing

Data Warehouse Overview

The word data warehouse was first developed by Bill Inmon in the early 1990s. He referred to it as being a integrated collection of information that could help companies and organizations make better decisions.

To be effective, a data warehouse had to be integrated, subject oriented, non-volatile, and time variant. In this article, I will go over all these factors in detail. If you are building a data warehouse, it is important for you to understand why they are important.

Being subject oriented means that the data will provide information about a specific subject rather than the information about the functions of a company. Because a data warehouse is subject oriented, it will allow you to analyze information that is connected to a specific subject. Being integrated means that the data that is collected within the data warehouse can come from different sources, but can be combined into one unit that is relevant and logical. Having a time-variant means that all the information within the data warehouse can be found with a given period of time.

It is important that the information contained within a data warehouse is stable. While data can be added, it should never be deleted. This property is referred to as being non-volatile. When a company uses a data warehouse that is stable, this will allow them to get a better understanding of the operations within their company. Despite the fact that these terms were first coined in the the 1990s, they are still highly accurate today. However, it should be noted that some data warehouses are volatile. The reason for this is because many modern data warehouses deal with terabytes of data.

Because they must store terabytes of data, many companies are forced to delete some of their information after a certain period of time. For instance, some companies will systematically delete data that has reached three years of age. Before a data warehouse can be built, the correct data must be located. Generally, the information that will be added to the warehouse will come from daily information or historical information. The historical information may be stored in a legacy system, and is challenging to extract.

The design of the data warehouse is important as well. It is important for designers to make sure the design is consistent with the queries that will be conducted within the warehouse. To do this successfully, it is important for designers to understand the database schema. It is crucial to make sure the data warehouse is designed correctly, as it is difficult to recreate some forms of data. Another important aspect of data warehouses is data acquisition. Data acquisition can be defined as transferring data from a source to the warehouse. Data acquisition is one of the most expensive parts of building a data warehouse. This process will often be conducted with an ETL tool.

As of this time, there are just over 50 ETL tools being sold. It may cost a company millions of dollars in order to transfer data from sources to the warehouse. Once the initial data has been transferred to the data warehouse, the process must be repeated consistently. Data acquisition is a continous process, and the goal of a company is to make sure the warehouse is updated on a regular basis. When the warehouse is updated, it is often hard to determine which information in the source has changed since the previous update. The process of dealing with this issue is called changed data capture. This process has become a separate field, and there are a number of products currently be sold to deal with it.

It is important for data to be cleaned before it can be placed in the warehouse. The data cleansing process is usually done during the data acquisition phase. Any data that is placed in a warehouse before being clean will pose a danger to the system, and it cannot be used.

The reason for this is because the data may not be correct if it is not cleaned, and a company may make incorrect decisions based on it. This could lead to a number of problems. For example, all the information within a data warehouse that means the same thing must be stored in the same form. If there is information that reads "MS" and "Microsoft," even though they mean the same thing, only one of them can be used to recognize the element within the data warehouse.

« « What You Should Know About Publishing Your Essays
Creative Problem Solving » »

Author Description

Avatar

Editorial Team at Exforsys is a team of IT Consulting and Training team led by Chandra Vennapoosa.

Free Training

RSSSubscribe 394 Followers
  • Popular
  • Recent
  • How Does a Data Warehouse Differ From a Database

    February 11, 2007 - 0 Comment
  • How To Use Data Warehouses Strategically

    September 11, 2006 - 0 Comment
  • The Benefits of Data Warehouses

    February 4, 2007 - 0 Comment
  • Creating an Efficient Process for Data Warehouses

    February 13, 2007 - 0 Comment
  • How To Manage Current and Historical Information Within Your Data Warehouse

    September 11, 2006 - 0 Comment
  • The Disadvantages of a Data Warehouse

    February 4, 2007 - 0 Comment
  • Understanding Quality Management For Data Warehouses

    February 13, 2007 - 0 Comment
  • How To Properly Manage a Data Warehouse

    September 16, 2006 - 0 Comment
  • Rules to Use With Your Data Warehouse

    February 5, 2007 - 0 Comment
  • Data Warehousing Introduction

    July 30, 2006 - 0 Comment
  • Data Warehouses Non Technical Issues

    June 23, 2007 - 0 Comment
  • How To Evaluate The Software For your Data Warehouse

    February 15, 2007 - 0 Comment
  • Understanding The Challenges of Using Data Warehouses

    February 15, 2007 - 0 Comment
  • Understanding Quality Management For Data Warehouses

    February 13, 2007 - 0 Comment
  • Creating an Efficient Process for Data Warehouses

    February 13, 2007 - 0 Comment
  • How Does a Data Warehouse Differ From a Database

    February 11, 2007 - 0 Comment
  • How Data Is Stored Within a Data Warehouse

    February 11, 2007 - 0 Comment
  • How To Rate Your Data Warehouse

    February 10, 2007 - 0 Comment
  • What You Should Know About Building a Data Warehouse

    February 10, 2007 - 0 Comment
  • Fundamental Themes For Your Data Warehouse

    February 8, 2007 - 0 Comment

Exforsys e-Newsletter

ebook
 

Related Articles

  • Data Warehouses Non Technical Issues
  • How To Evaluate The Software For your Data Warehouse
  • Understanding The Challenges of Using Data Warehouses
  • Understanding Quality Management For Data Warehouses
  • Creating an Efficient Process for Data Warehouses

Latest Articles

  • Project Management Techniques
  • Product Development Best Practices
  • Importance of Quality Data Management
  • How to Maximize Quality Assurance
  • Utilizing Effective Quality Assurance Strategies
  • Sitemap
  • Privacy Policy
  • DMCA
  • Trademark Information
  • Contact Us
© 2023. All Rights Reserved.IT Training and Consulting
This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish.AcceptReject Read More
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT