Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Apache Spark - MaRDI portal

Apache Spark

From MaRDI portal
Software:40132



swMATH28418MaRDI QIDQ40132


No author found.

Source code repository: https://github.com/apache/spark




Related Items (72)

Equivalence classes and conditional hardness in massively parallel computationsComputation Against a Neighbour: Addressing Large-Scale Distribution and Adaptivity with Functional Programming and ScalaGSA for machine learning problems: a comprehensive overviewSemantic Foundations for Deterministic Dataflow and Stream ProcessingFull likelihood inference from the site frequency spectrum based on the optimal tree resolutionNovel data-driven method for non-probabilistic uncertainty analysis of engineering structures based on ellipsoid modelLove and Hate During Political Campaigns in Social NetworksTemporal concatenation for Markov decision processesMuLOT: multi-level optimization of the canonical polyadic tensor decomposition at large-scaleTriclustering in Big Data SettingA cloud computing-based intelligent forecasting method for cross-border e-commerce logistics costsGEODIS: towards the optimization of data locality-aware job scheduling in geo-distributed data centersIterative selection of categorical variables for log data anomaly detectionStatistical challenges of big brain network dataScheduling Parallel-Task Jobs Subject to Packing and Placement ConstraintsFregel: a functional domain-specific language for vertex-centric large-scale graph processingComputational fluid dynamics simulation based on hadoop ecosystem and heterogeneous computingLeast-Square Approximation for a Distributed SystemA Novel Hybrid Sampling Algorithm for Solving Class Imbalance Problem in Big DataBoosting evolutionary algorithm configurationUnnamed ItemLarge scale implementations for Twitter sentiment classificationA novel interval-valued data driven type-2 possibilistic local information c-means clustering for land cover classificationPrivacy-preserving computation in cyber-physical-social systems: a survey of the state-of-the-art and perspectivesA distributed \(K\)-means segmentation algorithm applied to \textit{Lobesia botrana} recognitionScaling up Bayesian variational inference using distributed computing clustersA survey on the distributed computing stackDistribution Policies for Datalog.Parallel Weighted Random SamplingKATZ centrality with biogeography-based optimization for influence maximization problemModern Datalog EnginesBig data: from collection to visualizationA safe reinforced feature screening strategy for Lasso based on feasible solutionsA Bayesian perspective of statistical machine learning for big dataA semi-parallel framework for greedy information-theoretic feature selectionDistributed cooperative learning over time-varying random networks using a gossip-based communication protocolUnnamed ItemPerformance Comparison of Machine Learning PlatformsRegression Neural Networks with a Highly Robust Loss FunctionAn effective and efficient MapReduce algorithm for computing BFS-based traversals of large-scale RDF graphsA greedy feature selection algorithm for big data of high dimensionality\(k\)-means, Ward and probabilistic distance-based clustering methods with contiguity constraintSpark solutions for discovering fuzzy association rules in big dataAn experience in using machine learning for short-term predictions in smart transportation systemsGenetic programming \(+\) proof search \(=\) automatic improvementFrom distributed coordination to field calculus and aggregate computingUnnamed ItemOptimal control in dynamic food supply chains using big dataWidening: using parallel resources to improve model qualityProperty-Based Testing for Spark StreamingA three-way cluster ensemble approach for large-scale dataRandomized Gradient Boosting MachineBig data time series forecasting based on pattern sequence similarity and its application to the electricity demandMinimum distance histograms with universal performance guaranteesTranslating Scala Programs to Isabelle/HOLUsing machine learning with PySpark and MLib for solving a binary classification problem: case of searching for exotic particlesMLP-ANN-based execution time prediction model and assessment of input parameters through structural modelingEvidential instance selection for \(K\)-nearest neighbor classification of big dataUser-Defined Tensor Data AnalysisA Distributed and Incremental SVD Algorithm for Agglomerative Data Analysis on Large NetworksParametric Gaussian process regression for big dataElephant against Goliath: performance of big data versus high-performance computing DBSCAN clustering implementationsDistribution policies for DatalogMining maximal frequent patterns in transactional databases and dynamic data streams: a Spark-based approachUnnamed ItemAn intuitive fuzzy approach for evaluating financial resiliency of supply chainLink prediction in multiplex networks using intralayer probabilistic distance and interlayer co-evolving factorsCombining Interval Time Series Forecasts. A First Step in a Long Way (Research Agenda)A distributed ensemble of relevance vector machines for large-scale data sets on SparkTraditional and context-specific spam detection in low resource settingsA new accelerated proximal boosting machine with convergence rate \(O(1/t^2)\)A Detailed Study of the Distributed Rough Set Based Locality Sensitive Hashing Feature Selection Technique


This page was built for software: Apache Spark