Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Learning heuristics for the TSP by policy gradient - MaRDI portal

Learning heuristics for the TSP by policy gradient

From MaRDI portal

Publication:1626725

Jump to:navigation, search

DOI10.1007/978-3-319-93031-2_12OpenAlexW2805798351MaRDI QIDQ1626725

Alexandre Lacoste, Michel Deudon, Yossiri Adulyasak, Pierre Cournut, Louis-Martin Rousseau

Publication date: 21 November 2018

Full work available at URL: https://doi.org/10.1007/978-3-319-93031-2_12

zbMATH Keywords

neural networks combinatorial optimization traveling salesman reinforcement learning policy gradient

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Approximation methods and heuristics in mathematical programming (90C59) Combinatorial optimization (90C27) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20)

Related Items (14)

Improving Variable Orderings of Approximate Decision Diagrams Using Reinforcement Learning ⋮ Learning the travelling salesperson problem requires rethinking generalization ⋮ A reinforcement learning approach to the orienteering problem with time windows ⋮ Reinforcement learning for combinatorial optimization: a survey ⋮ Guidelines for the computational testing of machine learning approaches to vehicle routing problems ⋮ Predicting the optimal period for Cyclic Hoist Scheduling Problems ⋮ The first AI4TSP competition: learning to solve stochastic routing problems ⋮ Adaptive solution prediction for combinatorial optimization ⋮ Learn and route: learning implicit preferences for vehicle routing ⋮ Deep learning-driven scheduling algorithm for a single machine problem minimizing the total tardiness ⋮ Research on improved ant colony optimization for traveling salesman problem ⋮ Generalization of machine learning for problem reduction: a case study on travelling salesman problems ⋮ Learning to Solve Large-Scale Security-Constrained Unit Commitment Problems ⋮ Neural large neighborhood search for routing problems

This page was built for publication: Learning heuristics for the TSP by policy gradient

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1626725&oldid=13933690"