Stream data load prediction for resource scaling using online support vector regression (Q2632511)
From MaRDI portal
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| English | Stream data load prediction for resource scaling using online support vector regression |
scientific article |
Statements
Stream data load prediction for resource scaling using online support vector regression (English)
0 references
14 May 2019
0 references
Summary: A distributed data stream processing system handles real-time, changeable and sudden streaming data load. Its elastic resource allocation has become a fundamental and challenging problem with a fixed strategy that will result in waste of resources or a reduction in QoS (quality of service). Spark Streaming as an emerging system has been developed to process real time stream data analytics by using micro-batch approach. In this paper, first, we propose an improved SVR (support vector regression) based stream data load prediction scheme. Then, we design a spark-based maximum sustainable throughput of time window (MSTW) performance model to find the optimized number of virtual machines. Finally, we present a resource scaling algorithm TWRES (time window resource elasticity scaling algorithm) with MSTW constraint and streaming data load prediction. The evaluation results show that TWRES could improve resource utilization and mitigate SLA (service level agreement) violation.
0 references
streaming processing
0 references
dynamic prediction
0 references
auto-scaling
0 references
online support vector regression
0 references
time window maximum throughput
0 references