关于数据仓库-Inmon-企业信息工厂(CIF)概览

来源:互联网 发布:淘宝卖家如何上传照片 编辑:程序博客网 时间:2024/05/22 10:52

  原创于2007年04月12日,2009年10月15日迁移至此。


翻译总是一件很痛苦的事情,看着别人翻译的很烂,心里总是会暗暗骂上几句,当自己翻译的时候,才了解翻译的痛苦。。。

关于DW2.0和CIF,我没什么直观的感觉,数据仓库终究还是数据仓库。。。

 

Corporate Information Factory (CIF) Overview

企业信息工厂(CIF)概览

The Corporate Information Factory and the Web Environment

CIF


摘要描述

Operational Systemsare the internal and external core systems that support the day-to-day businessoperations. They are accessed through application program interfaces (APIs) andare the source of data for the data warehouse and operational data store.(Encompasses all operational systems including ERP, relational and legacy.)

业务系统是支持日常业务操作的内外部的核心系统。通常它们可以通过应用程序接口(APIs)进行访问,同时也是数据仓库和ODS系统的数据源。(包括所有的业务系统例如ERP,相关和遗留的系统等等)

Data Acquisition isthe set of processes that capture, integrate, trans-form, cleanse, reengineerand load source data into the data warehouse and operational data store. Datareengineering is the process of investigating, standardizing and providingclean consolidated data.

数据获取是获取,集成,转换,清洗,重构和数据加载到数据仓库和ODS系统的一系列的过程。数据重构是调研、标准化以及清洗统一数据的过程。

The Data Warehouseis a subject-oriented, integrated, time-variant, non-volatile collection ofdata used to support the strategic decision-making process for the enterprise.It is the central point of data integration for business intelligence and isthe source of data for the data marts, delivering a common view of enterprise data.

数据仓库是基于主题的、集成的、时变的、非易失的数据的集合,为企业的战略决策制定过程提供支持。这是商业智能数据集成的核心,也是数据集市的数据源,同时提供了一个企业数据的公共视图。

Primary Storage Managementconsists of the processes that manage data within and across the data warehouseand operational data store. It includes processes for backup and recovery,partitioning, summarization, aggregation, and archival and retrieval of data toand from alternative storage.

基本存储管理由数据仓库和ODS中管理数据的一系列过程构成。它包含备份和恢复、分区、摘要、聚合、从替代存储中归档和恢复的一系列过程。

Alternative Storageis the set of devices used to cost-effectively store data warehouse andexploration warehouse data that is needed but not frequently accessed. Thesedevices are less expensive than disks and still provide adequate performancewhen the data is needed.

替代存储是这样一套设备,通常被用来低成本且有效的存储数据仓库数据,同时能够探测和访问那些必要但是低访问率的数据仓库数据。这些设备一般比磁盘便宜,同时能够提供足够的性能,当访问数据的时候。

Data Delivery is theset of processes that enable end users and their supporting IS group to buildand manage views of the data warehouse within their data marts. It involves athree-step process consisting of filtering, formatting and delivering data fromthe data warehouse to the data marts.

数据交付是一套能够保证终端用户和决策支持群组构建和管理数据仓库视图的过程。它包括3个步骤:从数据仓库中过滤、格式化、交付数据到数据集市中。

The Data Mart iscustomized and/or summarized data derived from the data warehouse and tailoredto support the specific analytical requirements of a business unit or function.It utilizes a common enterprise view of strategic data and provides businessunits more flexibility, control and responsibility. The data mart may or maynot be on the same server or location as the data warehouse.

数据集市是来源于数据仓库的定制化数据或者摘要数据,裁减后用来满足对业务功能的特殊分析需求。它利用企业的公共视图,向企业单元提供更大的弹性、控制和响应。数据集市和数据仓库不一定在同一台服务器和同一位置。

The Operational Data Store(ODS) is a subject-oriented, integrated, current, volatile collectionof data used to support the tactical decision-making process for theenterprise. It is the central point of data integration for businessmanagement, delivering a common view of enterprise data.

ODS是基于主题的、集成的、当前的、易失的数据的集合,用来向企业提供决策支持。这是业务管理中数据集成的核心,同时交付企业数据的公共视图。

Meta Data Managementis the process for managing information needed to promote data legibility, useand administration. Contents are described in terms of data about data,activity and knowledge.

元数据管理是提供数据可理解、使用和管理的管理信息的过程。主要用来记录数据、行为和知识。

The Exploration Warehouseis a DSS architectural structure whose purpose is to provide a safe haven forexploratory and ad hoc processing. An exploration warehouse utilizes datacompression to provide fast response times with the ability to access theentire database.

探测数据仓库是决策支持架构,它的目的是为探测和增强查询提供安全接口。一个探测数据仓库利用数据压缩技术提供快速响应能力。

The Data Mining Warehouseis an environment created so analysts may test their hypotheses, assertions andassumptions developed in the exploration warehouse. Specialized data miningtools containing intelligent agents are used to perform these tasks.

数据挖掘仓库是在探测数据仓库中开发的,分析员能够测试他们假设、推断、设想的创建的环境。专业的数据挖掘功能包括用来实施该任务的智能代理。

Activities are theevents captured by the enterprise legacy and/or ERP systems as well as externaltransactions such as Internet interactions.

行为是通过企业遗产系统或ERP系统获取的事件,同时也包括向互联网交互的外部交易。

Statistical Applicationsare set up to perform complex, difficult statistical analyses such asexception, means, average and pattern analyses. The data warehouse is thesource of data for these analyses. These applications analyze massive amountsof detailed data and require a reasonably performing environment.

统计应用是被用来实施复杂的、难度较大的统计分析,例如异常、方法、平均值和方式分析。数据仓库是这些分析的数据源。这类应用能够分析大量明细数据,同时也需要一个适度的实施环境。

Analytic Applicationsare pre-designed, ready-to-install, decision sup-port applications. Theygenerally require some customization to fit the specific requirements of theenterprise. The source of data is the data warehouse. Examples of theseapplications are risk analysis, database marketing (CRM) analyses, verticalindustry "data marts in a box," etc.

分析应用是预设计、预安装、决策支持应用。这通常需要一些定制工作,来满足企业的特殊需求。它的源数据来自于数据仓库。这些应用的例子通常是风险分析、CRM分析等等。

External Data is anydata outside the normal data collected through an enterprise's internalapplications. There can be any number of sources of external data such asdemographic, credit, competitor and financial information. Generally, externaldata is purchased by the enterprise from a vendor of such information.

外部数据是由企业内部应用系统采集的常规数据之外的所有数据。可能有许多的外部数据源例如人口统计、信用卡、竞争对手信息和财务信息。但是通常情况下,外部数据源是企业从外部信息提供商购买的。

原创粉丝点击