You understand things because you have developed all of the basic verbal behavior abilities that a human needs. Behavior is real. Verbal behavior is a special kind of behavior. It really exists also. (It can be measured/recorded.) You don’t have a measurable operational definition for the concept of “understanding”. But to the extent that you agree that “understanding” is something “real”, and you have interest in a machine being able to have understanding, it would behoove you to have a measurable operational definition of “understanding”. I merely suggest that is possible through utilizing the various components of verbal behavior.
So you think a 3 year old that toddles around the house and opens and closes doors does not have an “understanding” of what doors are even if it has not learned the word “door”.
von Neumann machines have memory locations in their CPUs called registers. A modern CPU might have 32 registers with 32 bits each.So that is only 128 bytes in the CPU. The contents of these registers can be manipulated and moved according to machine instructions in RAM. Now RAM these days can be in the gigabytes.But the CPU only executes one instruction at a time and there can only be 128 bytes of data in the CPU at a time. But what can that data mean?
Each pixel in a camera these days is three bytes per pixel. That makes 16,000,000 colors possible. So an 8 megapixel camera uses 24 megabytes. But only 128 bytes can fit into the CPU. So how does a program sitting in memory control a CPU that can only hold 128 bytes analyse a picture that is 24 megabytes. There is all kinds of finding edges and analysing shapes and things would get even more complex with stereo cameras trying to deal with three dimensions. Not even 43 pixels can fit into the CPU at one time.
Plenty of people say the CPU is the brain of the computer but it is the program in RAM that is any supposed intelligence. But all data is just a combination of bits. A given combination of bits can be a certain color but that exact same combination can be a temperature or a wind speed. It is all a matter of how the program treats the bits.
Here is the Kinect training:
The software memorised the responses to give to the shapes. He could have called them anything.