I wrote a home-brew neural network around 2006, just to see what would happen. I...

pico_creator · on May 23, 2023

A large percent of the RWKV community ain’t experts. And are here doing weird, dumb or crazy homebrew experiments

So keep doing weird experiments

anon291 · on May 23, 2023

> But other than scaling that from 1000 neurons to billions, I'm curious what has changed about the concepts of pathing or tolerance to make these models better? Maybe my concept of the principle behind modern LLMs is too archaic or rooted in a cartoon understanding of our own wetware that I tried to reproduce.

Realastically... the problem is a data problem. The math is mostly there. Transformers et al would have been figured out in no time had we had the sort of data tools we have today. I'm talking about the ease with which you can take gigabytes of data, throw it in S3, and analyze it in minutes.

That, combined with cheap and accessible compute (cloud) and the maturation of CUDA meant it was all a matter of time before this took place.

cypress66 · on May 23, 2023

> Transformers et al would have been figured out in no time had we had the sort of data tools we have today.

I disagree. Even things that seem obvious in retrospect, take some time to be figured out.

Resnets, batch norm, dropout, etc are examples of this.

And I don't think transformers are obvious.

cma · on May 23, 2023

What determined whether it's output was "ideal" for the certainty measure to go off of? Backprop?

noduerme · on May 23, 2023

I'd need to go back and look at the code, but it was something primitive but similar to backprop. There was an evaluation routine that compared what lit up on the back of the cube to the desired output. The farther off it was, the more the certainty got docked from any of the [connected] neurons one layer up from the bad pixel, and that dragged down the certainties on the next layer and so on. It didn't have the ability to back check which neurons were involved in which specific output node. But I guess it was a scatter shot attempt at that.