Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Stream data load prediction for resource scaling using online support vector regression - MaRDI portal

Stream data load prediction for resource scaling using online support vector regression (Q2632511)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Stream data load prediction for resource scaling using online support vector regression
scientific article

    Statements

    Stream data load prediction for resource scaling using online support vector regression (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    14 May 2019
    0 references
    Summary: A distributed data stream processing system handles real-time, changeable and sudden streaming data load. Its elastic resource allocation has become a fundamental and challenging problem with a fixed strategy that will result in waste of resources or a reduction in QoS (quality of service). Spark Streaming as an emerging system has been developed to process real time stream data analytics by using micro-batch approach. In this paper, first, we propose an improved SVR (support vector regression) based stream data load prediction scheme. Then, we design a spark-based maximum sustainable throughput of time window (MSTW) performance model to find the optimized number of virtual machines. Finally, we present a resource scaling algorithm TWRES (time window resource elasticity scaling algorithm) with MSTW constraint and streaming data load prediction. The evaluation results show that TWRES could improve resource utilization and mitigate SLA (service level agreement) violation.
    0 references
    streaming processing
    0 references
    dynamic prediction
    0 references
    auto-scaling
    0 references
    online support vector regression
    0 references
    time window maximum throughput
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references