Gradient methods for optimizing metaparameters in the knowledge distillation problem
From MaRDI portal
Publication:2689575
DOI10.1134/S00051179220100071zbMath1506.68092OpenAlexW4312589547MaRDI QIDQ2689575
M. Gorpinich, O. Yu. Bakhteev, Vadim V. Strijov
Publication date: 13 March 2023
Published in: Automation and Remote Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1134/s00051179220100071
machine learninggradient optimizationknowledge distillationmetaparameter assignmentmetaparameter optimization
Artificial neural networks and deep learning (68T07) Applications of mathematical programming (90C90) Learning and adaptive systems in artificial intelligence (68T05)
Uses Software
Cites Work
This page was built for publication: Gradient methods for optimizing metaparameters in the knowledge distillation problem