Apache Spark
From MaRDI portal
Software:40132
No author found.
Source code repository: https://github.com/apache/spark
Related Items (72)
Equivalence classes and conditional hardness in massively parallel computations ⋮ Computation Against a Neighbour: Addressing Large-Scale Distribution and Adaptivity with Functional Programming and Scala ⋮ GSA for machine learning problems: a comprehensive overview ⋮ Semantic Foundations for Deterministic Dataflow and Stream Processing ⋮ Full likelihood inference from the site frequency spectrum based on the optimal tree resolution ⋮ Novel data-driven method for non-probabilistic uncertainty analysis of engineering structures based on ellipsoid model ⋮ Love and Hate During Political Campaigns in Social Networks ⋮ Temporal concatenation for Markov decision processes ⋮ MuLOT: multi-level optimization of the canonical polyadic tensor decomposition at large-scale ⋮ Triclustering in Big Data Setting ⋮ A cloud computing-based intelligent forecasting method for cross-border e-commerce logistics costs ⋮ GEODIS: towards the optimization of data locality-aware job scheduling in geo-distributed data centers ⋮ Iterative selection of categorical variables for log data anomaly detection ⋮ Statistical challenges of big brain network data ⋮ Scheduling Parallel-Task Jobs Subject to Packing and Placement Constraints ⋮ Fregel: a functional domain-specific language for vertex-centric large-scale graph processing ⋮ Computational fluid dynamics simulation based on hadoop ecosystem and heterogeneous computing ⋮ Least-Square Approximation for a Distributed System ⋮ A Novel Hybrid Sampling Algorithm for Solving Class Imbalance Problem in Big Data ⋮ Boosting evolutionary algorithm configuration ⋮ Unnamed Item ⋮ Large scale implementations for Twitter sentiment classification ⋮ A novel interval-valued data driven type-2 possibilistic local information c-means clustering for land cover classification ⋮ Privacy-preserving computation in cyber-physical-social systems: a survey of the state-of-the-art and perspectives ⋮ A distributed \(K\)-means segmentation algorithm applied to \textit{Lobesia botrana} recognition ⋮ Scaling up Bayesian variational inference using distributed computing clusters ⋮ A survey on the distributed computing stack ⋮ Distribution Policies for Datalog. ⋮ Parallel Weighted Random Sampling ⋮ KATZ centrality with biogeography-based optimization for influence maximization problem ⋮ Modern Datalog Engines ⋮ Big data: from collection to visualization ⋮ A safe reinforced feature screening strategy for Lasso based on feasible solutions ⋮ A Bayesian perspective of statistical machine learning for big data ⋮ A semi-parallel framework for greedy information-theoretic feature selection ⋮ Distributed cooperative learning over time-varying random networks using a gossip-based communication protocol ⋮ Unnamed Item ⋮ Performance Comparison of Machine Learning Platforms ⋮ Regression Neural Networks with a Highly Robust Loss Function ⋮ An effective and efficient MapReduce algorithm for computing BFS-based traversals of large-scale RDF graphs ⋮ A greedy feature selection algorithm for big data of high dimensionality ⋮ \(k\)-means, Ward and probabilistic distance-based clustering methods with contiguity constraint ⋮ Spark solutions for discovering fuzzy association rules in big data ⋮ An experience in using machine learning for short-term predictions in smart transportation systems ⋮ Genetic programming \(+\) proof search \(=\) automatic improvement ⋮ From distributed coordination to field calculus and aggregate computing ⋮ Unnamed Item ⋮ Optimal control in dynamic food supply chains using big data ⋮ Widening: using parallel resources to improve model quality ⋮ Property-Based Testing for Spark Streaming ⋮ A three-way cluster ensemble approach for large-scale data ⋮ Randomized Gradient Boosting Machine ⋮ Big data time series forecasting based on pattern sequence similarity and its application to the electricity demand ⋮ Minimum distance histograms with universal performance guarantees ⋮ Translating Scala Programs to Isabelle/HOL ⋮ Using machine learning with PySpark and MLib for solving a binary classification problem: case of searching for exotic particles ⋮ MLP-ANN-based execution time prediction model and assessment of input parameters through structural modeling ⋮ Evidential instance selection for \(K\)-nearest neighbor classification of big data ⋮ User-Defined Tensor Data Analysis ⋮ A Distributed and Incremental SVD Algorithm for Agglomerative Data Analysis on Large Networks ⋮ Parametric Gaussian process regression for big data ⋮ Elephant against Goliath: performance of big data versus high-performance computing DBSCAN clustering implementations ⋮ Distribution policies for Datalog ⋮ Mining maximal frequent patterns in transactional databases and dynamic data streams: a Spark-based approach ⋮ Unnamed Item ⋮ An intuitive fuzzy approach for evaluating financial resiliency of supply chain ⋮ Link prediction in multiplex networks using intralayer probabilistic distance and interlayer co-evolving factors ⋮ Combining Interval Time Series Forecasts. A First Step in a Long Way (Research Agenda) ⋮ A distributed ensemble of relevance vector machines for large-scale data sets on Spark ⋮ Traditional and context-specific spam detection in low resource settings ⋮ A new accelerated proximal boosting machine with convergence rate \(O(1/t^2)\) ⋮ A Detailed Study of the Distributed Rough Set Based Locality Sensitive Hashing Feature Selection Technique
This page was built for software: Apache Spark