From 91b06111d19f748d6b6895ebd26a10f483684314 Mon Sep 17 00:00:00 2001 From: aethrvmn Date: Fri, 4 Oct 2024 14:44:35 +0200 Subject: [PATCH] fixed single letter --- content/_index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/_index.md b/content/_index.md index 4c9baa5..c86a62b 100644 --- a/content/_index.md +++ b/content/_index.md @@ -11,7 +11,7 @@ Lyceum is a Reinforcement Learning (RL) playground designed for natural language ## Why Lyceum? *** -Modern NLP solutions like GPTs and BERTs have made great strides in language processing and generations, however they come with serious limitations. Even though a LLM can describe or make a game of chess, and even justify moved made, it is unable to *play* it. Why? Because there's no underlying mechanism for decision-making or reward-incentives during training. Transformers rely on static token distributions without real-time feedback, limiting their capacity to *actively* learn. +Modern NLP solutions like GPTs and BERTs have made great strides in language processing and generation, however they come with serious limitations. Even though a LLM can describe or make a game of chess, and even justify moved made, it is unable to *play* it. Why? Because there's no underlying mechanism for decision-making or reward-incentives during training. Transformers rely on static token distributions without real-time feedback, limiting their capacity to *actively* learn. Lyceum tries to address this gap by shifting the focus to active learning through reinforcement. In Lyceum, the agent doesn't just passively learn to generate language; it learns through *interaction*.