Comment by wakeless

7 hours ago

I did a similar set of evals myself utilising the baseline capabilities that Phoenix (elixir) ships with and then skillified them.

Regularly the skills were not being loaded and thus not utilised. The outputs themselves were fine. This suggested that at some stage through the improvements of the models that baseline AGENTS.md had become redundant.