Understanding Concentrability in Direct Nash Optimization

In this section, we provide detailed proofs for our theoretical results. Note that, the definitions and assumptions presented heavily adopts the ideas related to version space and concentrability from reinforcement learning theory literature (esp., Xie et al., 2021, 2023). Nevertheless, the descriptions provided herein are intentionally simplified to elucidate the core insights into the algorithmic design. A full and exhaustive theoretical analysis falls outside the primary scope of this paper. We now make the following definitions and assumptions.

\ Definition 2 can be viewed as a natural extension of concentrability from the (offline) reinforcement learning literature to our setup.

\ Proof of Theorem 2. We will now present the proof using the following two-step procedure.

\ Step 1: From regression with log loss to squared error bound. By standard results on the regression with the logarithmic loss, we know

\ Note that similar results could also apply beyond finite Π. For simplicity, we omit the detailed discussion in our paper. For more in-depth discussions about regression with the logarithmic loss, the reader can refer to, e.g., Foster and Krishnamurthy (2021).

\ On the other hand, we have

:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

This content originally appeared on HackerNoon and was authored by Language Models (dot tech)

Print Share Comment Cite Upload Translate Updates

APA

Language Models (dot tech) | Sciencx (2025-04-17T15:00:11+00:00) Understanding Concentrability in Direct Nash Optimization. Retrieved from https://www.scien.cx/2025/04/17/understanding-concentrability-in-direct-nash-optimization/

MLA

" » Understanding Concentrability in Direct Nash Optimization." Language Models (dot tech) | Sciencx - Thursday April 17, 2025, https://www.scien.cx/2025/04/17/understanding-concentrability-in-direct-nash-optimization/

HARVARD

Language Models (dot tech) | Sciencx Thursday April 17, 2025 » Understanding Concentrability in Direct Nash Optimization., viewed ,<https://www.scien.cx/2025/04/17/understanding-concentrability-in-direct-nash-optimization/>

VANCOUVER

Language Models (dot tech) | Sciencx - » Understanding Concentrability in Direct Nash Optimization. [Internet]. [Accessed ]. Available from: https://www.scien.cx/2025/04/17/understanding-concentrability-in-direct-nash-optimization/

CHICAGO

" » Understanding Concentrability in Direct Nash Optimization." Language Models (dot tech) | Sciencx - Accessed . https://www.scien.cx/2025/04/17/understanding-concentrability-in-direct-nash-optimization/

IEEE

" » Understanding Concentrability in Direct Nash Optimization." Language Models (dot tech) | Sciencx [Online]. Available: https://www.scien.cx/2025/04/17/understanding-concentrability-in-direct-nash-optimization/. [Accessed: ]

rf:citation

» Understanding Concentrability in Direct Nash Optimization | Language Models (dot tech) | Sciencx | https://www.scien.cx/2025/04/17/understanding-concentrability-in-direct-nash-optimization/ |

Please log in to upload a file.

There are no updates yet.
Click the Upload button above to add an update.

You must be logged in to translate posts. Please log in or register.

Table of Links

B Detailed Proofs

Related Posts