Posts tagged skill-learning
-
SkillOpt: training the procedure outside the weights
TLDR: SkillOpt treats an agent skill as an optimizable text artifact. The model stays frozen, rollouts provide evidence, an optimizer proposes edits, and a validation gate accepts only real improvements.