Comment by whimsicalism

2 years ago

> I have built a Torch-like NN framework in C++ from scratch

Mm, care to share? I am skeptical when people make claims like this, even though it is very achievable. Many people are simply "playing house" when it comes to ML tech.

I would not believe everything you read even in the tech aligned press, it is very often false. Google Deepmind is not exclusively researching LLMs.

I never released my framework, and don't intend to (abandoned this a good few years ago), but it was more than "playing house" ... It was complete enough to build/train a convnet that worked with CIFAR-10, supporting both GPU via cuDNN and CPU via my own Tensor class with MKL/IPP BLAS/etc acceleration. The API was Torch-like where you build the graph (create nodes, then connect them), then run it. I was in process of writing a version 2 with support for RNNs and auto migration of tensors from CPU to/from GPU, but gave up since PyTorch had since appeared (obviously a better approach) and it became increasingly obvious how ridiculous it was for a one-man project to attempt to catch up to SOTA!