欢迎来到留学生英语论文网

当前位置:首页 > 论文范文 > Computer Science

A Proposed Data Stage Tool as ETL Tool in Data Warehouse

发布时间:2017-03-01
该论文是我们的学员投稿,并非我们专家级的写作水平!如果你有论文作业写作指导需求请联系我们的客服人员

Title of the Research Project

A proposed Data Stage tool as ETL tool in data warehouse


Executive Summary

It is possible to identify problem that will serve as a point of departure for the present research proposal. As has been noted and is clarified in the later review of literature of review section. One problem area is that since the processes in ETL (Extraction-Transformation-Loading) is complex, the implementation takes more time (Shaker H.Ali El-Sappagh, 2011). The second problem area involves while data is extracted from different data sources, many incorporate information are also extracted which cause data warehouse becomes heavy and requires processing power.

Specifically, this research with focus on two primary objective. The first objective is to improve performance of data abstraction by integrating Data Stage to ETL(Extraction-Transformation-Loading) tool . The second objective are to propose a novel approach of Data Stage tool as ETL tool.

For this study, secondary research based on recent literature related to ETL(Extraction-Transformation-Loading) tool and data warehouse will used. I used secondary research because within the recent journal I analyse the limitation of ETL(Extraction-Transformation-Loading) tool. Move over, the descriptive research will utilised. Thus this study will use the descriptive approach. I gather the information related to the studies of ETL(Extraction-Transformation-Loading) tool and data warehouse to makes use of Data Stage tool as an ETL(Extraction-Transformation-Loading) tool in data warehouse that brings some beneficial to user. Moreover, in this studies also will employ qualitative research method. I have choose qualitative research method for my research because to explain as an ETL(Extraction-Transformation-Loading) tool and the relationship of Data Stage tool as an ETL(Extraction-Transformation-Loading) tool in data warehouse.

After completion of my research activities, the findings will be helpful for the user or client. ETL(Extraction-Transformation-Loading) tool are able to integrate data efficiently when Data Stage tool are applied on ETL(Extraction-Transformation-Loading) tool. ETL(Extraction-Transformation-Loading) tool can extract, integrate and transform data even it is complex or consist large volume of data. ETL(Extraction-Transformation-Loading) tool are to be improve the speed and flexibility of data integration when Data Stage tool is applied.

The proposal research will give user an efficiency of implementation in ETL(Extraction-Transformation-Loading) tool which by having Data stage as an ETL(Extraction-Transformation-Loading) tool, it reduced maintenance with GUI tool. With Data stage that have parallel processing engine which provides unlimited performance and scalability which saves time of the user. The Data stage server performs very well on both Windows and Unix servers.

On the other hand, in business field, Data stage tool provides wide range of licensing option. In addition team communication and documentation of the jobs is supported by data flows and transformation self-documenting engine in HTML( Hypertext Mark-up Language) format. It also have ability to join data both at the source, and at the integration server and to apply any business rule within a single interface without having to write any procedure code.

Introduction

Data Stage is a tool is a software that helps ETL(Extraction-Transformation-Loading) tool to integrate data more effectively and efficiently. Data Stage able to collect, transform, validate and load data from different sources. Data Stage able to run in windows and Unix.

Justification of Research

ETL(Extraction-Transformation-Loading) tool consumes time and requires more resources and efforts. (Shaker H. Ali El-Sappagh, 2011), therefore the ETL(Extraction-Transformation-Loading) processes becomes more complexity. Data Stage tool is the one that make data transform effectively. Precisely, Data Stage tool make user easier to directly access file system in distributed data warehouse. Data Stage tool provides trustable, powerful and scalable platform to ETL(Extraction-Transformation-Loading) tool which it can collect, integrate and transform data even it is complex. ETL(Extraction-Transformation-Loading) tool itself couldn’t transform data effectively. The question that awake me in this research whether is it secure and reliable. It was secure since security controls allow user to have their “private” area which can only access by themselves and also have area that user can shared with others in a group.

Research Objectives

Specifically, this research with focus on two primary objective. The first objective is to improve performance of data abstraction by integrating Data Stage to ETL(Extraction-Transformation-Loading) tool . The second objective are to propose a novel approach of Data Stage tool as ETL(Extraction-Transformation-Loading) tool.

Literature Review

In this part, we will see some research that deals with ETL(Extraction-Transformation-Loading) tool. SIRIUS (Supporting the Incremental Refreshment of Information Warehouses) is a project developed at information technology department in Zurich University (A.VAVOURAS,"A Metadata-Driven Approach for DataWarehouse Refreshment", Phd Thesis, DER UNIVERSITÄT ZÜRICH,ZÜRICH, 2002. ). This researcher represent modelling and execution of ETL(Extraction-Transformation-Loading) by approaching metadata whereby it describe the features to implement ETL(Extraction-Transformation-Loading). Researcher use JAVA code to implement. Finally it was a successful implementation but requires sufficient amount of time. SIRIUS are also succeed in detecting changes in the database.

ARKTOS is a framework that are create to model and execute ETL(Extraction-Transformation-Loading). Indeed, ARKTOS provides primitives to capture ETL(Extraction-Transformation-Loading) tasks frequently used (P. Vassiliadis, Z.Vagena, S. Skiadopoulos, and N.Karayannidis, "ARKTOS: A Tool For Data Cleaning and Transformation in DataWarehouse Environments", Bulletin of the Technical Committee on Data Engineering, 23(4), 2000). Precisely, ETL(Extraction-Transformation-Loading) process consist of GUI( Graphical User Interface) and two language which is XADL( XML) and SADL(SQL). During execution, ARKTOS able to take action when error occur during implementation. There are few errors that can be solve by ARKTOS which is first violation of the primary key, violation of the uniqueness , violation of references , null existence for the elimination of missing values field mismatch and format mismatch related to domain errors and data format errors.

The next approaches deals with ETL(Extraction-Transformation-Loading) management. Authors represent ETL as K matrices which apply multiplication operation that explain the connection between input and output fields and how it produce in the attribute dependency. Moreover author creates an algorithm to detect if error occur in ETL (Extraction-Transformation-Loading) process when delete event occur in source or inside ETL(Extraction-Transformation-Loading). Unfortunately, the changes that occur during deletion detection are not addresses to user.

Research Methodology

The study intends to investigate of using Data Stage tool as ETL(Extraction-Transformation-Loading) tool in data warehouse. For this study, secondary research based on recent literature related to ETL(Extraction-Transformation-Loading) tool and data warehouse will used. I used secondary research because within the recent journal I analyse the limitation of ETL(Extraction-Transformation-Loading) tool. Data Stage tool have the ability to increase the performance when it corporates with ETL(Extraction-Transformation-Loading) tools because Data Stage consist of parallel framework that supports integration of data.

Move over, the descriptive research will utilised. Thus this study will use the descriptive approach. The descriptive type of research utilises observation in the study. To illustrate the descriptive type of research, (Cress Well, 1994) guided the researcher when he stated descriptive method of research is to gather information about the present existing condition. I gather the information related to the studies of ETL(Extraction-Transformation-Loading) tool and data warehouse to makes use of Data Stage tool as an ETL(Extraction-Transformation-Loading) tool in data warehouse that brings some beneficial to user. ETL(Extraction-Transformation-Loading) tool consumes more time and resources during data integration. Data Stage tool have flexible platform that can integrates all type of data which reduce the time taken.

This studies also will employ qualitative research method because it will try to find and build theories that will explain the relationship of one variable with another variable through qualitative elements in research. I have choose qualitative research method for my research because to explain as an ETL(Extraction-Transformation-Loading) tool and the relationship of Data Stage tool as an ETL(Extraction-Transformation-Loading) tool in data warehouse. Data Stage tool allow ETL(Extraction-Transformation-Loading) tool to do ETL(Extraction-Transformation-Loading) process much more efficiently because it help to improve speed and flexibility

References

  1. http://www.sciencedirect.com/science/article/pii/S131915781100019X
  2. http://www.academia.edu/4092386/Ontology-Based_Extraction-Transformation-Loading_ETL_Processes_Model_in_Data_Warehouse_Environments
  3. http://www.sciencedirect.com/science/article/pii/S2212017313001965
  4. http://www.etltools.net/etl-tools-comparison.html
  5. www.jatit.org/volumes/Vol54No2/3Vol54No2.pdf

上一篇:Performance Evaluation and Enhancement of Mobile Node Using MIK 下一篇:Mobile Ad Hoc Networking