← Back to context

Comment by vonneumannstan

5 hours ago

>Of course, none of this scales. Some of our intro courses have a thousand students.

Any ideas are much appreciated.

Oral exams graded by LLMs? Scale with the improving models. Based on GPQA Diamond results they're mostly at PhD level for subject trivia anyway.

The problem here is that it will work for now, but how do you make sure the LLM talks to the student, and not a different LLM? I guess vision models FTW?

In the end, will be build a GAN loop?

Why am I now reminded of corewars?