My crazy scientist idea - collaborators wanted!

W Balboos GHB

Well - I admit I'm not following - or it seems that the neural net already has been trained. That is, effectively, a rule set.

Tomaž Štih wrote:

Not all instructions are one byte long; you could insert a 2-byte or 3-byte instruction(s) to training samples.

You did say a gigabyte of random data, did you not? One does not insert things into random data, deliberately, and maintain calling it random. Perhaps you're trying to seed code in a sea of random data in order that the Neural Net will learn how to find it? I could see that as a way to search for messages or embedded code. But, as I started out - I guess I'm not following your vision.

Ravings en masse^

"The difference between genius and stupidity is that genius has its limits." - Albert Einstein

"If you are searching for perfection in others, then you seek disappointment. If you are seek perfection in yourself, then you will find failure." - Balboos HaGadol Mar 2010

Tomaz Stih 0

My fault, I am not a native speaker. Perhaps it is better to prepare a small sample and come back later. Just one last try, simplified case of fictional processor with just four 8 bit registers and only 8 bit instructions that only affect these registers and nothing else. So... 1. Neural network outputs can be from 0 .. 1. These values are float. This means that a single neural network output can -for example- assume value between 0 and 255 and even 0 and 65535 (if you want to represent values from 0 to 255 as values between 0 and 1 then you use n/255 for value n and 1 for value 255). 2. This means that a single neural network output can represent 8 or 16 bit register. For simplicity let us assume our neural network has only four 8 bit registers: A, B, C, and D. So we have four outputs. 3. Now let us feed neural network with 10 sequential bytes. Since values to feed the net are 0-255 we can use one single input for that (...or 8 inputs, and feed them binary pattern, it is a matter of choice). Structure of ANN: so let us have 8 inputs, we have 4 outputs and we have (let's invent it right now:) eight invisible levels, each one 8 inputs. 4. We now produce an emulation software (not a neural network, but a real emulator) for this fictional processor. And we execute these 10 sequential bytes on the emulator. Emulator will interpret instructions and correctly set the four registers after each instruction. 5. Now we have possibility to train our network. We feed it one byte. We then feed the same byte to the emulator too. Emulator provides correct value for each register after instruction. Neural network provides output. We calculate error (as difference between correct value and ANN output) and back-propagate into the network i.e. update weights. 6. Now we feed neural net with 100.000 random instructions (bytes) and we do the same with emulator, back propagating after each byte fed into network. The assumption is that after this training the neural network will actually know how to emulate our fictional processor and will produce correct values in registers (4 outputs) based on input (8 inputs). T.

Forogar

That won't work. It's like yelling random words from the dictionary at a baby and expecting it learn English! It'll all end in tears!

- I would love to change the world, but they won’t give me the source code.

megaadam

I have seen a lot of parents do just that and it actually seems to work! ;P

... such stuff as dreams are made on

Mark_Wallace

Funnily enough, random data would probably serve better, especially if backprop were used in place of the neural network. With AI, the best starting point is often not the one that makes logical (programming-type) sense, because it has to learn how to work processes out for itself, not how to take data and reprocess it.

I wanna be a eunuchs developer! Pass me a bread knife!

Mark_Wallace

That's the usual definition we use for AI procedures. The thing is, we settle for one line of Shakespeare, and use billions of iterations, rather than infinite monkeys.

I wanna be a eunuchs developer! Pass me a bread knife!

Mark_Wallace

Garbage in 1,000,000,000 times : garbage out 999,999,999 times is a good start. Iterate until it's: Garbage in 1,000,000,000 times : Whoa!

I wanna be a eunuchs developer! Pass me a bread knife!

Mark_Wallace

OK, let's simplify AI training*: 0. You have data. You might have an idea what it means to you, but it's really only ones and zeroes. 1. You stream this data into a neural network/backprop routine/whatever, which does something (it doesn't matter what) with the data and gives you output. 2a. If the output is useless (99.99999% of the first very many iterations, if you've got it right), you tell it "That output is not useful" (in a coded manner, of course). 2b. If the output is useful, you tell it "That output is useful" 3. Rinse and repeat. Eventually, you get quasi-self-written code that does exactly what you want it to do, and actually does better than you could code it, because, as in facial recognition, where you'd need a hundred billion lines of code to deal with every possible detail/angle/lighting effect/mm of hair growth, it can just look at a face and tell you who it is. * As in training an AI, not training people to use it

I wanna be a eunuchs developer! Pass me a bread knife!

Tomaz Stih 0

Mark_Wallace wrote:

OK, let's simplify AI training

That's pretty much how genetic algorithms do auto programming. Only they combine useful programs in hope that they'll become even more useful.

Mark_Wallace

Tomaž Štih wrote:

Only they combine useful programs in hope that they'll become even more useful.

Ha! Not twenty years ago, they didn't, and nor do they have to now. (Some of us old buggers have been in the game a while, you know)

I wanna be a eunuchs developer! Pass me a bread knife!