Comment by 4fterd4rk

8 hours ago

Great explanation, but the last question is quite simple. You determine the weights via brute force. Simply running a large amount of data where you have the input as well as the correct output (handwriting to text in this case).

3 comments

4fterd4rk

ggambetta 7 hours ago

"Brute force" would be trying random weights and keeping the best performing model. Backpropagation is compute-intensive but I wouldn't call it "brute force".

Ygg2 7 hours ago
"Brute force" here is about the amount of data you're ingesting. It's no Alpha Zero, that will learn from scratch.
- jazzpush2 4 hours ago
  
  What? Either option requires sufficient data. Brute force implies iterating over all combinations until you find the best weights. Back-prop is an optimization technique.