Comment by falcor84
16 days ago
> If an llm is fed on losing / scammy rubbish, how could it possibly produce a return?
Rather than just relying on pretraining, you'd use RL on the trade outcomes.
16 days ago
> If an llm is fed on losing / scammy rubbish, how could it possibly produce a return?
Rather than just relying on pretraining, you'd use RL on the trade outcomes.
RL would reasonably be expected to work if the market had some sort of discoverable static behavior.
The reason why RL by backtesting cannot work is that the real market is continuously changing, as all the agents within it, both human and automated, are constantly updating their opinions and strategies.