Comment by zambelli
2 months ago
I'm working on behavioral fine-tuning of small LLMs. Ie, not fine-tuning or distilling knowledge, but operating practices (TODO lists, scratchpad reflex, etc).
2 months ago
I'm working on behavioral fine-tuning of small LLMs. Ie, not fine-tuning or distilling knowledge, but operating practices (TODO lists, scratchpad reflex, etc).
No comments yet
Contribute on Hacker News ↗