A platform for research: civil engineering, architecture and urbanism
Highway Value Iteration Networks
Value iteration networks (VINs) enable end-to-end learning for planning tasks by employing a differentiable "planning module" that approximates the value iteration algorithm. However, long-term planning remains a challenge because training very deep VINs is difficult. To address this problem, we embed highway value iteration -- a recent algorithm designed to facilitate long-term credit assignment -- into the structure of VINs. This improvement augments the "planning module" of the VIN with three additional components: 1) an "aggregate gate," which constructs skip connections to improve information flow across many layers; 2) an "exploration module," crafted to increase the diversity of information and gradient flow in spatial dimensions; 3) a "filter gate" designed to ensure safe exploration. The resulting novel highway VIN can be trained effectively with hundreds of layers using standard backpropagation. In long-term planning tasks requiring hundreds of planning steps, deep highway VINs outperform both traditional VINs and several advanced, very deep NNs.
Highway Value Iteration Networks
Value iteration networks (VINs) enable end-to-end learning for planning tasks by employing a differentiable "planning module" that approximates the value iteration algorithm. However, long-term planning remains a challenge because training very deep VINs is difficult. To address this problem, we embed highway value iteration -- a recent algorithm designed to facilitate long-term credit assignment -- into the structure of VINs. This improvement augments the "planning module" of the VIN with three additional components: 1) an "aggregate gate," which constructs skip connections to improve information flow across many layers; 2) an "exploration module," crafted to increase the diversity of information and gradient flow in spatial dimensions; 3) a "filter gate" designed to ensure safe exploration. The resulting novel highway VIN can be trained effectively with hundreds of layers using standard backpropagation. In long-term planning tasks requiring hundreds of planning steps, deep highway VINs outperform both traditional VINs and several advanced, very deep NNs.
Highway Value Iteration Networks
Wang, Yuhui (author) / Li, Weida (author) / Faccio, Francesco (author) / Wu, Qingyuan (author) / Schmidhuber, Jürgen (author)
2024
ICML 2024
Preprint
Electronic Resource
English
Pavement Resurfacing Planning for Highway Networks: Parametric Policy Iteration Approach
Online Contents | 2007
|Value Engineering on Highway Projects
British Library Online Contents | 2003
|Delivering Best Value Highway Performance
British Library Online Contents | 2004
|Value Engineering in Highway Projects
British Library Online Contents | 2000
|Analysis of cable networks using iteration technique
Engineering Index Backfile | 1967
|