update homepage

This commit is contained in:
aethrvmn 2024-11-10 22:36:16 +01:00
parent 3d94ce3ab7
commit 94bbef9f94

View file

@ -4,19 +4,19 @@ type: docs
---
# Welcome to Lyceum
***
---
**If RL agents go to the gym to train, they should also go to the lyceum to learn.**
Lyceum is a Reinforcement Learning (RL) playground designed for natural language processing (NLP) experimentation. Just as Gymnasium provides environments for RL agents to train, Lyceum is the place where RL agents learn. Imagine a school where RL agents enroll in classes to master subjects like language, math, even philosophy, and then unwind at after-school clubs like chess, where they can test their decision-making skills.
## Why Lyceum?
***
## Why RL?
---
Modern NLP solutions like GPTs and BERTs have made great strides in language processing and generation, however they come with serious limitations. Even though a LLM can describe or make a game of chess, and even justify moved made, it is unable to *play* it. Why? Because there's no underlying mechanism for decision-making or reward-incentives during training. Transformers rely on static token distributions without real-time feedback, limiting their capacity to *actively* learn.
Lyceum tries to address this gap by shifting the focus to active learning through reinforcement. In Lyceum, the agent doesn't just passively learn to generate language; it learns through *interaction*.
## How It Works
***
---
At the heart of Lyceum is the idea of teaching RL agents to process language through experience, not just token patters. Here's how it's done:
### Classes: The Learning Process
@ -39,7 +39,7 @@ In these clubs, agents don't just process language and output tokens---they use
The agent learns not only to describe a game, but to actually *play* it, honing decision-making and strategy through real-time feedback. After-school clubs provide traditional RL training environments, where the agent uses natural language to try and achieve goals set by the environment. This should provide higher-order thinking and help agents apply their language skills in dynamic, interactive settings.
## The Road Ahead
***
---
Lyceums goal is to push the boundaries of what RL agents can achieve in NLP, and hopefully approach the next step, NLU. By training agents one subject (or language) at a time, we aim to build systems that dont just output sequences of tokens but actually understand the subject matter. The hope is that by mastering specific classes, agents will converge toward stronger performance in each domain, learning to carry out real-world tasks that require both language comprehension and decision-making.
In the future, Lyceum agents could move beyond basic language generation, becoming proficient in tasks like scientific writing, tutoring, or even dynamic conversation. With a solid foundation built in the classroom and refined in after-school activities, these agents could one day graduate from simply generating language to mastering the full depth of human communication.
@ -57,7 +57,7 @@ We need researchers, developers, and experimenters to help us grow the Lyceum pl
- **Use the thing and spread the word**: Simply by spreading the idea of RL-based NLP you are helping this platform. The more eyes the better.
## Enroll Your Agent Now!
***
---
- **Non-Commercial License**: At Lyceum, we believe in consensual interactions only. Our custom Don't Be Evil License ensures that your agents wont be used for commercial purposes without your consent. Whether you're training your agent to solve puzzles or become the next great debater, rest assured it won't be used by Big Tech without your approval.
- **Free Tuition**: We offer *free* tuition for all rl agents---no hidden fees, no secret costs. Our license allows everyone and anyone to modify, improve, and experiment with Lyceum to their heart's content, free of charge!
- **Open Curriculum**: No set classes here. Feel free to enroll your agents in as many classes as you'd like! Want them to be great at English *and* quantum mechanics? We got you covered.