Improving Quality Of Service In XML Data Stream Processing Using Load Shedding

ResearchCommons/Manakin Repository

Improving Quality Of Service In XML Data Stream Processing Using Load Shedding

Show full item record

Title: Improving Quality Of Service In XML Data Stream Processing Using Load Shedding
Author: Dash, Ranjan
Abstract: In recent years, we have witnessed the emergence of new types of systems that deal with large volumes of streaming data. Examples include nancial data analy- sis on feeds of stock tickers, sensor-based environmental monitoring, network tra c monitoring and click stream analysis to push customized advertisements or intrusion detection. Traditional database management systems (DBMS), which are very good at managing large volumes of stored data, fall short of serving this new class of appli- cations, which require low-latency processing on live data from push-based sources. Data Stream Management Systems (DSMS) are fast emerging to address this new type of data and processing requirements. A common but challenging issue in DSMS, is to deal with unpredictable data arrival rate. Data arrival may be fast and bursty at times that surpass available system capability to handle. When input rates exceed system capacity, the Quality of Service (QoS) of system outputs falls below the acceptable levels. The problem of system overloading is more acute in XML data streams than its counterpart in rela- tional streams, as XML streams have to spend extra resources on input processing and result construction. The main focus of this thesis is to nd out suitable ways to process this high volume of data streams dealing with the spikes in data arrival gracefully, under limited or xed system resources in the XML stream context. One established method is to shed load by selectively dropping tuples under these condi- tions. This method helps to improve the observed latency of the results but degrades the answer quality. In this dissertation, we rst de ne the QoS in the context of XML stream pro- cessing and then various mechanisms to improve the QoS, specially the method of load shedding. We provide a general solution framework for implementing Load Shed- ding using Synopses, while minimizing the loss in result accuracy. Then, we present speci c situations where issue of QoS is very critical, such as cases of aggregation and join queries. Finally, we provide techniques to handle load shedding in these cases that provide high QoS in the XML data stream systems. In the nal part of this thesis, we investigate issue of processing aggregation (group-by) join queries on data streams that provide exact results and we extend our solutions to address some of the OLAP issues in data streams.
Date: 2012-07-25

Files in this item

Files Size Format View
Dash_uta_2502D_11635.pdf 1.489Mb PDF View/Open

This item appears in the following Collection(s)

Show full item record


My Account


About Us