_xenoschema t1_j3pizle wrote on January 10, 2023 at 4:19 AM

Hey thanks for sharing all this - it's all very fascinating.

I'm interested in what kind of work you've been doing with models that use active inference.

Mental-Swordfish7129 t1_j3q1hej wrote on January 10, 2023 at 7:29 AM

It's a model that "chooses" its input stream from a 2d array of sensor data (cam, mics, and servo encoders) in real time using policies decoded from predictions of the bottom layer. Then, it processes this input up the hierarchy of identical layers. Higher layer predictions are used to modulate attention.

It may qualify as a general intelligence (idk) as any data can be encoded into the format of its input stream. What I mean is that I have a particular way of encoding video, audio, anything really, into a universal format which preserves the salient semantics.

Currently, it is greatly inhibited in what it can learn because I cannot feed it experiences at the rate it could take them. It has far more potential than realized knowledge.

jimmymvp t1_j3q5wmj wrote on January 10, 2023 at 8:26 AM

Sry, what's the "active" part here? Is the model actually generative? I'm aware of Karl Friston and the free-energy principle. Is the active part the input stream selection? I thought that the active part refers to learning, in a sense that I get to pick my training data along the way. Sounds like what you're doing is akin to Gato from DeepMind with tokenization and is about multi-modal policies (modulo the hierarchical processing and attention).

Is there a math writeup somewhere?

Mental-Swordfish7129 t1_j3q731g wrote on January 10, 2023 at 8:42 AM

Also, I do mean "active" in the ways you describe. The bottom layer actively controls the sensors via servos and a voice coil. The other layers actively modulate their input by masking it (ignoring it non-trivially).

Mental-Swordfish7129 t1_j3q6p7m wrote on January 10, 2023 at 8:37 AM

The model is generative. Each layer generates predictions about the patterns of the layers below. The bottom layer generates predictions about the sensory data, some of which is proprioception data.

I have never published anything. I do not have that much time and it would largely be redundant. You can look at Friston, et.al. for the math. I use nearly the same math and logic.

What I'm doing bears only a superficial similarity to Gato in my opinion, but I can't say I've looked into it deeply. I've been far too busy with life. I only have my tiny spare time for this project unfortunately.

jimmymvp t1_j3q74ms wrote on January 10, 2023 at 8:43 AM

So the active part is the self-predictive part?

Mental-Swordfish7129 t1_j3q81e2 wrote on January 10, 2023 at 8:56 AM

Active just means that it directly modifies its input stream. And, yes, it is also predicting what that input will be, so it is reasonable to say that it is, in part, self-predictive.

Crucially, its input stream also includes features that are not itself or have not been changed by itself. The proprioceptive signals help it learn which is which.

jimmymvp t1_j3q66hi wrote on January 10, 2023 at 8:30 AM

I meant more like research papers from top conferences in ML (neurips, iclr, icml)

Mental-Swordfish7129 t1_j3q7lbz wrote on January 10, 2023 at 8:49 AM

I don't think this model is within the realm ML (it's theoretical neuroscience; although there is much overlap) but does qualify as AI which is what was asked about in the post title.

There is an annual symposium called the International Workshop on Active Inference for about 3 years now where research is presented and the papers are linked there.

And of course the dozens of research papers you can find through Google Scholar on the topic.

Edit: I did find where a few active inference papers have been presented at NeurIPS.

[N] What's next for AI?

Mental-Swordfish7129 t1_j3leaf6 wrote on January 9, 2023 at 10:53 AM