Discussion about this post

User's avatar
Hal W Heinrich's avatar

I'm looking for a reinforcement API with an interface. I.e. I code my use case as class implementing the specified interface, and then pass that off to the API. My use case is backgammon variations. Specifically, hunting for variants that highly reward skill.

Expand full comment
Malcolm Storey's avatar

I usually just ask Copilot, but if you fancy competing: :-)

An LLM has to be trained on a vast amount of data which is then locked in. When I ask it a question and start a thread the context builds as the conversation goes to and fro. Where is the context held?

Expand full comment
9 more comments...

No posts