What Is A Data Warehouse?
Knowledge

What Is A Data Warehouse?

In recent years, the e-business of enterprises is not limited to single-system issues such as whether the process is smooth or complete storage of transaction records. It often pays more attention to the integration of heterogeneous information systems, how to effectively collect and present data, and has an increasingly specific impact on the operating efficiency of enterprises. The concept of Data Warehouse refers to the concept of warehouse storage.
Published: Jun 07, 2023
What Is A Data Warehouse?

What Is A Data Warehouse?

Data warehouse is usually used for data mining, business intelligence, can cover mountains and seas, and can also deal with a single topic. In recent years, the e-business of enterprises is not limited to single-system issues such as whether the process is smooth or complete storage of transaction records. It often pays more attention to the integration of heterogeneous information systems, how to effectively collect and present data, and has an increasingly specific impact on the operating efficiency of enterprises. The concept of data warehouse refers to the concept of warehouse storage. It not only stores physical raw materials and finished products, but also integrates abstract file data in the information system and converts them into physical data warehouse.

The Difference Between Database, Data Warehouse and Data Warehouse System

数据仓库是一个数据库存储大mounts of data, but it is not the same as a database. The data stored in the database is related to operations, and the data warehouse will organize and transfer the data to another data system for data analysis after the data has been accumulated for a period of time. Data warehouse usually refers to a database that stores integrated data, and data warehouse system generally refers to the entire decision-making support system, including system software and hardware, data and reports.

The term "Data Warehouse" was coined by Bill Inmon in 1990, so he is known as the father of Data Warehouse. In the book "What is a Data Warehouse", he believes that the data collection of data warehouse has 4 characteristics. : Subject-oriented, integrated, time-variant, and non-volatile. According to these characteristics, the data warehouse can provide data for decision-making management system for processing. Another representative of data warehouse, Ralph Kimball, believes in the book "The Data Warehouse Toolkit" that data warehouse is a structured copy of transaction data that can be queried and analyzed.

"Subject-oriented" means that the data warehouse can concentrate information related to a specific topic, not just the company's current operating information; "integrated" means that the data stored in the data warehouse is merged from different sources and maintained consistently organized ; "Change according to time" indicates that the data warehouse identifies the stored data at a specific point in time; "no loss" means that the data in the data warehouse will only continue to increase and will not be removed, which enables the management to gain business continuity observations.

Types of Data Warehouse

Data warehouse can be divided into enterprise data warehouse (EDW), operational data store and data mart. Some people think that in addition to enterprise data warehouse and data mart, data warehouse can also add virtual data warehouse and hybrid data warehouse.

  1. Enterprise Data Warehouse
  2. The enterprise data warehouse contains the information of the entire enterprise and consists of several topics, such as customer, product, business, etc., which can be used for decision support, including real-time information and aggregated information.

  3. Operational Data Provider
  4. "Operation" is relative to the informativeness of data warehouses. ODS provides detailed data, especially recent consolidated data, which can meet the needs of real-time reports. Operational data stores can only analyze very recent data and cannot analyze longer-term historical data. Bill Inmon published "The Operational Data Store" in 1995. He believed that the data collection of ODS is subject-oriented and integrated. However, the difference from data storage is that the data of ODS will be lost, and the current value is the main one. It does not contain historical and cumulative data, and ODS data can be collected in real time and integrated. According to the frequency of synchronous update of data, ODS also has grades for data transfer and storage schedule.

  5. Data Marketplace
  6. Roughly the same as the definition of data warehouse, data warehouse covers the data and personnel of the entire company, while data mart only contains a specific range of data, and users will lock the personnel of a certain work group. A group of data marts can form an enterprise data warehouse, and vice versa. Assuming that a company adopts a mode where several data supermarkets exist at the same time, differences in the definition of data of the same dimension will turn the data market into a data island. Data islands are a big problem for the enterprise as a whole. The integration function is limited to departmental groups and cannot be extended to the integration of overall information. Cross-departmental data analysis cannot be performed, and different job attributes cannot be linked. Cross-departmental data analysis, the previous data market structure can only continue to accumulate in a stacked way, and cannot be integrated.

    如今,数据仓库的建设仍然l mostly starts with data marts, because the dimensional model adopted by data marts is easier to understand than the individual relationship model, and the analysis speed is faster, but it still depends on the needs of enterprises and users.

  7. Virtual Data Warehouse
  8. The enterprise directly uses the existing operating database and assists some intermediary tools for effective data processing. The construction is faster, the chance of success is high, and real-time data analysis can be achieved.

  9. Hybrid Data warehouse
  10. If the data mart is represented as a virtual data warehouse, it becomes a hybrid data warehouse. The storage space required is less than that of enterprise data storage. Since the data is already stored in a standardized data environment, the process of data reorganization will be simpler than reading the running data through the application program, and it will not affect the running data. The hybrid data warehouse can also cope with the data island phenomenon encountered in the data market, and can flexibly respond to different needs through virtual methods.

  11. Benefits of Data Warehouse
  12. Data warehouse can achieve integration across data sources, so that data in different databases can be linked to each other. The establishment of an information system certainly solves the need for regular output and immediate storage of data. Once an enterprise wants to retrieve all kinds of integrated statistical information from the information system, it will immediately face the problem of different data sources, and it is impossible to cross-system at the same time. Access, and further automated processing and analysis is not possible. The data warehouse can be regarded as a single window for extracting data. Through the automatic conversion of the information system, the possibility of errors in manual exchange of files can be reduced.

Summary

The development of data warehouse initially only required the review of aggregated data, and then each transaction data began to be kept in the data warehouse to analyze the relationship between customer groups and products. At present, in addition to storing aggregate data and transaction data, it also retains detailed data to analyze customers' shopping.

这一历史进程表明,公司使用to only want to know the total turnover, but now they are more concerned about how customers make choices in the transaction process.

Data warehouse is often compared with data mining and business intelligence. When used in marketing business, it can be used to understand customer habits, allowing companies to predict customer behavior in order to carry out appropriate promotions; internally, data warehouse can be used in internal operations. The evaluation allows senior executives to find out the crux of the poor operating conditions from specific data and evidence.

Published by Jun 07, 2023 Source :iThome

Further reading

You might also be interested in ...

Headline
Knowledge
An In-Depth Exploration of The Electroplating Process for Plastic ABS
In recent years, plastic electroplating has been widely used in decorative electroplating of plastic parts. ABS plastic is the most widely used kind of plastic electroplating.
Headline
Knowledge
Delving into Precision Stamping Technology
Metal stamping refers to the use of the power of punching machinery and the use of molds as metal plate forming tools to produce punching separation or plastic deformation effects to achieve the production technology of parts in terms of size, shape, and performance requirements.
Headline
Knowledge
A Closer Look at the Structural Features of Horizontal Lathes
The advantage of the gearbox is that it can accurately control the speed of the main shaft without excessively high main shaft speed, belt friction consumption, and slippage. Because the main shaft is placed horizontally, it is also called a horizontal lathe.
Headline
Knowledge
Exploring the Fundamentals and Key Principles of Welding
Welding is a process that uses "heat" and "electricity" to connect two pieces of metal, and the type of welding metal will also affect the welding results and technical requirements; like many professional skills, welding technology also has different levels of difficulty. First understand the most common types of welding introduction, principle teaching, and skill analysis.
Headline
Knowledge
The Advantages of Powder Coatings: Exploring Their Benefits
Powder coatings were developed in the 1950s as an alternative to traditional finishes such as liquid coatings. While the versatility and appeal of liquid coatings isn't likely to disappear anytime soon, powder coatings offer many advantages and are growing in popularity.
Headline
Knowledge
Understanding CBN Tools
CBN turning tools are tightly sintered from boron nitride and tungsten carbide bases. The hardness of boron nitride is next to PCD. It has excellent chemical stability and will not produce affinity with iron, cobalt, and nickel-based metals. Therefore, it is especially suitable for work hardening steel, with a hardness above HRC45. Chilled cast iron and heat-resistant steel (Inconel) are also suitable.
Headline
Knowledge
Understanding the Structure and Operation of a Slotting Machine
Slotting machines are reciprocating machines that are mainly used to manufacturing horizontal, vertical or flat surfaces.
Headline
Knowledge
A Comprehensive Examination of Hyperautomation and its Impact on Business Processes
Hyperautomation is the use of the power of multiple technologies to achieve end-to-end automation. Hyperautomation is the process of continuously integrating automation into an organization's business processes, combining advanced technologies such as robotic process automation (RPA), artificial intelligence, and machine learning to enhance the results of human work. Not only does it automate key processes, but it also builds an automation ecosystem that finds more processes that can be automated without human intervention.
Headline
Knowledge
5 Essential Values to Understand Automation
The emergence of automation has had a great impact on many industries. Many highly repetitive factory operations may restrict production capacity. With the introduction of Industry 4.0, various technologies have led to the automation of production lines, from the supplying of raw materials, to assembly, distribution, and packaging. People are taking notice of these very important developments, and this article will explore some of these new forms of automation, and the value they bring.
Headline
Knowledge
Understanding the Role of Rotary Broaching
Rotary broaches (also known as oscillating or hexagonal broaches) are one of the commonly used CNC tools in metal-cutting production. It is used to process polygonal parts and inner holes (especially suitable for blind holes) and other special-shaped inner holes.
Headline
Knowledge
Frequently Employed Processing Techniques
In the field of manufacturing processing, common processing methods include broaching, boring, grinding, milling, etc.
Headline
Knowledge
The Functioning of Laser Cutting Machines
Laser cutting technology Compared with traditional cutting, the laser cutting process has better accuracy, and precise details are cut through a high-intensity laser beam.
Agree