diff --git a/content/_index.md b/content/_index.md index 4c9baa5..c86a62b 100644 --- a/content/_index.md +++ b/content/_index.md @@ -11,7 +11,7 @@ Lyceum is a Reinforcement Learning (RL) playground designed for natural language ## Why Lyceum? *** -Modern NLP solutions like GPTs and BERTs have made great strides in language processing and generations, however they come with serious limitations. Even though a LLM can describe or make a game of chess, and even justify moved made, it is unable to *play* it. Why? Because there's no underlying mechanism for decision-making or reward-incentives during training. Transformers rely on static token distributions without real-time feedback, limiting their capacity to *actively* learn. +Modern NLP solutions like GPTs and BERTs have made great strides in language processing and generation, however they come with serious limitations. Even though a LLM can describe or make a game of chess, and even justify moved made, it is unable to *play* it. Why? Because there's no underlying mechanism for decision-making or reward-incentives during training. Transformers rely on static token distributions without real-time feedback, limiting their capacity to *actively* learn. Lyceum tries to address this gap by shifting the focus to active learning through reinforcement. In Lyceum, the agent doesn't just passively learn to generate language; it learns through *interaction*.