RECKONING: Reasoning through Dynamic Knowledge Encoding: Generalization to Real-World knowledge

This article evaluates RECKONING’s generalizability on the real-world multi-hop logical reasoning task, FOLIO.


This content originally appeared on HackerNoon and was authored by The Tech Reckoning is Upon Us!

Abstract and 1. Introduction

  1. Background

  2. Method

  3. Experiments

    4.1 Multi-hop Reasoning Performance

    4.2 Reasoning with Distractors

    4.3 Generalization to Real-World knowledge

    4.4 Run-time Analysis

    4.5 Memorizing Knowledge

  4. Related Work

  5. Conclusion, Acknowledgements, and References

\ A. Dataset

B. In-context Reasoning with Distractors

C. Implementation Details

D. Adaptive Learning Rate

E. Experiments with Large Language Models

4.3 Generalization to Real-World knowledge

To investigate how generalizable our method is to real-world knowledge beyond the synthetic setting, we evaluate RECKONING on a more real-world multi-hop logical reasoning task, FOLIO [29], and report the result in Table 2. The dataset has a rich vocabulary, diverse logic patterns, and abundant language variations. It has been shown to challenge LLMs in both supervised fine-tuning and in-context learning settings. We fine-tune the GPT-2 model following the in-context reasoning setting as the baseline. As before, we train the GPT-2 model and RECKONING using the multi-task objective. We also compare to more advanced baselines, including GPT-3.5 (text-davinci-003 [55]) and ChatGPT(gpt-3.5-turbo[2]), two popular large language models with around 175B parameters. For these two large models, we evaluate both in the zero-shot and few-shot settings. In the few-shot setting, we prompt the model with 8 single-task examples randomly sampled from the training set to perform in-context learning. We find that RECKONING’s performance (which is initiated here from GPT-2) is better than the GPT-2 in-context reasoning baseline. Compared to the two advanced large language models, RECKONING outperforms them by a significant margin (12% 0-shot and 7% 8-shot). We conclude that RECKONING is effective and significantly benefits reasoning tasks using real-world knowledge.

\ Table 2: Evaluation results on FOLIO. We compare RECKONING against the FT-ICR baseline with GPT-2 and two popular large language models.

\

:::info Authors:

(1) Zeming Chen, EPFL (zeming.chen@epfl.ch);

(2) Gail Weiss, EPFL (antoine.bosselut@epfl.ch);

(3) Eric Mitchell, Stanford University (eric.mitchell@cs.stanford.edu)';

(4) Asli Celikyilmaz, Meta AI Research (aslic@meta.com);

(5) Antoine Bosselut, EPFL (antoine.bosselut@epfl.ch).

:::


:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

2 https://openai.com/blog/chatgpt


This content originally appeared on HackerNoon and was authored by The Tech Reckoning is Upon Us!


Print Share Comment Cite Upload Translate Updates
APA

The Tech Reckoning is Upon Us! | Sciencx (2025-10-24T16:41:01+00:00) RECKONING: Reasoning through Dynamic Knowledge Encoding: Generalization to Real-World knowledge. Retrieved from https://www.scien.cx/2025/10/24/reckoning-reasoning-through-dynamic-knowledge-encoding-generalization-to-real-world-knowledge/

MLA
" » RECKONING: Reasoning through Dynamic Knowledge Encoding: Generalization to Real-World knowledge." The Tech Reckoning is Upon Us! | Sciencx - Friday October 24, 2025, https://www.scien.cx/2025/10/24/reckoning-reasoning-through-dynamic-knowledge-encoding-generalization-to-real-world-knowledge/
HARVARD
The Tech Reckoning is Upon Us! | Sciencx Friday October 24, 2025 » RECKONING: Reasoning through Dynamic Knowledge Encoding: Generalization to Real-World knowledge., viewed ,<https://www.scien.cx/2025/10/24/reckoning-reasoning-through-dynamic-knowledge-encoding-generalization-to-real-world-knowledge/>
VANCOUVER
The Tech Reckoning is Upon Us! | Sciencx - » RECKONING: Reasoning through Dynamic Knowledge Encoding: Generalization to Real-World knowledge. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2025/10/24/reckoning-reasoning-through-dynamic-knowledge-encoding-generalization-to-real-world-knowledge/
CHICAGO
" » RECKONING: Reasoning through Dynamic Knowledge Encoding: Generalization to Real-World knowledge." The Tech Reckoning is Upon Us! | Sciencx - Accessed . https://www.scien.cx/2025/10/24/reckoning-reasoning-through-dynamic-knowledge-encoding-generalization-to-real-world-knowledge/
IEEE
" » RECKONING: Reasoning through Dynamic Knowledge Encoding: Generalization to Real-World knowledge." The Tech Reckoning is Upon Us! | Sciencx [Online]. Available: https://www.scien.cx/2025/10/24/reckoning-reasoning-through-dynamic-knowledge-encoding-generalization-to-real-world-knowledge/. [Accessed: ]
rf:citation
» RECKONING: Reasoning through Dynamic Knowledge Encoding: Generalization to Real-World knowledge | The Tech Reckoning is Upon Us! | Sciencx | https://www.scien.cx/2025/10/24/reckoning-reasoning-through-dynamic-knowledge-encoding-generalization-to-real-world-knowledge/ |

Please log in to upload a file.




There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.