Ablation Study Confirms Necessity of Dynamic Rates for RECKONING Performance

This content originally appeared on HackerNoon and was authored by The Tech Reckoning is Upon Us!

Table of Links

Abstract and 1. Introduction

\ A. Dataset

B. In-context Reasoning with Distractors

C. Implementation Details

D. Adaptive Learning Rate

E. Experiments with Large Language Models

D Adaptive Learning Rate

Prior works [3, 4] show that a fixed learning rate shared across steps and parameters does not benefit the generalization performance of the system. Instead, [3] recommends learning a learning rate for

\ Table 8: An example of 6-hop reasoning from the CLUTRR-SG dataset.

\ Table 9: Example of distractors (black) and relevant knowledge (red) in the ProofWriter dataset.

\ each network layer and each adaptation step in the inner loop. The layer parameters can learn to adjust the learning rates dynamically at each step. To control the learning rate α in the inner loop adaptively, we define α as a set of adjustable variable: α = {α0, α1, …αL}, where L is the number of layers and for every l = 0, …, L, αl is a vector with N elements given a pre-defined inner loop step number N. The inner loop update equation then becomes

\ Are dynamic learning rates necessary for RECKONING’s performance? Following prior works on meta-learning [3, 4], we dynamically learn a set of per-step-per-layer learning rates for RECKONING. In this ablation study, we analyze whether dynamic learning rates for the inner loop effectively improve the outer loop reasoning performance. Similarly, we fix other experimental settings and set the number of inner loop steps to 4. As Figure 8 shows, when using a static learning rate (i.e., all layers and inner loop steps share a constant learning rate), the performance drops by a large margin (average drop of 34.2%). The performance drop becomes more significant on questions requiring more reasoning hops (45.5% drop for 4-hop and 39.5% drop for 6-hop), demonstrating the importance of using a dynamic learning rate in the inner loop of our framework.

\ Figure 8: We study how much the dynamic learning rate in the inner loop contributes to the outer loop performance. We fix all the hyperparameters except the option of using the dynamic or fixed learning rate. We conduct the analysis using the CLUTRR-SG dataset since it is more complex and difficult (lower random performance).

:::info Authors:

(1) Zeming Chen, EPFL (zeming.chen@epfl.ch);

(2) Gail Weiss, EPFL (antoine.bosselut@epfl.ch);

(3) Eric Mitchell, Stanford University (eric.mitchell@cs.stanford.edu)';

(4) Asli Celikyilmaz, Meta AI Research (aslic@meta.com);

(5) Antoine Bosselut, EPFL (antoine.bosselut@epfl.ch).

:::

:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

This content originally appeared on HackerNoon and was authored by The Tech Reckoning is Upon Us!

Print Share Comment Cite Upload Translate Updates

APA

The Tech Reckoning is Upon Us! | Sciencx (2025-10-29T15:38:56+00:00) Ablation Study Confirms Necessity of Dynamic Rates for RECKONING Performance. Retrieved from https://www.scien.cx/2025/10/29/ablation-study-confirms-necessity-of-dynamic-rates-for-reckoning-performance/

MLA

" » Ablation Study Confirms Necessity of Dynamic Rates for RECKONING Performance." The Tech Reckoning is Upon Us! | Sciencx - Wednesday October 29, 2025, https://www.scien.cx/2025/10/29/ablation-study-confirms-necessity-of-dynamic-rates-for-reckoning-performance/

HARVARD

The Tech Reckoning is Upon Us! | Sciencx Wednesday October 29, 2025 » Ablation Study Confirms Necessity of Dynamic Rates for RECKONING Performance., viewed ,<https://www.scien.cx/2025/10/29/ablation-study-confirms-necessity-of-dynamic-rates-for-reckoning-performance/>

VANCOUVER

The Tech Reckoning is Upon Us! | Sciencx - » Ablation Study Confirms Necessity of Dynamic Rates for RECKONING Performance. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2025/10/29/ablation-study-confirms-necessity-of-dynamic-rates-for-reckoning-performance/

CHICAGO

" » Ablation Study Confirms Necessity of Dynamic Rates for RECKONING Performance." The Tech Reckoning is Upon Us! | Sciencx - Accessed . https://www.scien.cx/2025/10/29/ablation-study-confirms-necessity-of-dynamic-rates-for-reckoning-performance/

IEEE

" » Ablation Study Confirms Necessity of Dynamic Rates for RECKONING Performance." The Tech Reckoning is Upon Us! | Sciencx [Online]. Available: https://www.scien.cx/2025/10/29/ablation-study-confirms-necessity-of-dynamic-rates-for-reckoning-performance/. [Accessed: ]

rf:citation

» Ablation Study Confirms Necessity of Dynamic Rates for RECKONING Performance | The Tech Reckoning is Upon Us! | Sciencx | https://www.scien.cx/2025/10/29/ablation-study-confirms-necessity-of-dynamic-rates-for-reckoning-performance/ |

Please log in to upload a file.

There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.

Table of Links

D Adaptive Learning Rate

Related Posts