Submitted by vsmolyakov t3_10766uz in MachineLearning
jimmymvp t1_j3ldd4r wrote
Reply to comment by Mental-Swordfish7129 in [N] What's next for AI? by vsmolyakov
Can you reference some works along these lines? "online unsupervised generative models implementing something akin to the free energy principle and active inference"
Mental-Swordfish7129 t1_j3leaf6 wrote
Here's a fairly accessible free e-book by the principal researcher on the topic, Karl Friston...
https://mitpress.mit.edu/9780262045353/active-inference/
He's got tons of papers. He's one of the most cited scientists alive.
Also, there are lectures and such on YouTube. Just search terms "free energy principle", "active inference", "predictive processing".
Some other good books are "Surfing Uncertainty" by Andy Clark and "The Predictive Mind" by Howhy.
_xenoschema t1_j3pizle wrote
Hey thanks for sharing all this - it's all very fascinating.
I'm interested in what kind of work you've been doing with models that use active inference.
Mental-Swordfish7129 t1_j3q1hej wrote
It's a model that "chooses" its input stream from a 2d array of sensor data (cam, mics, and servo encoders) in real time using policies decoded from predictions of the bottom layer. Then, it processes this input up the hierarchy of identical layers. Higher layer predictions are used to modulate attention.
It may qualify as a general intelligence (idk) as any data can be encoded into the format of its input stream. What I mean is that I have a particular way of encoding video, audio, anything really, into a universal format which preserves the salient semantics.
Currently, it is greatly inhibited in what it can learn because I cannot feed it experiences at the rate it could take them. It has far more potential than realized knowledge.
jimmymvp t1_j3q5wmj wrote
Sry, what's the "active" part here? Is the model actually generative? I'm aware of Karl Friston and the free-energy principle. Is the active part the input stream selection? I thought that the active part refers to learning, in a sense that I get to pick my training data along the way. Sounds like what you're doing is akin to Gato from DeepMind with tokenization and is about multi-modal policies (modulo the hierarchical processing and attention).
Is there a math writeup somewhere?
Mental-Swordfish7129 t1_j3q731g wrote
Also, I do mean "active" in the ways you describe. The bottom layer actively controls the sensors via servos and a voice coil. The other layers actively modulate their input by masking it (ignoring it non-trivially).
Mental-Swordfish7129 t1_j3q6p7m wrote
The model is generative. Each layer generates predictions about the patterns of the layers below. The bottom layer generates predictions about the sensory data, some of which is proprioception data.
I have never published anything. I do not have that much time and it would largely be redundant. You can look at Friston, et.al. for the math. I use nearly the same math and logic.
What I'm doing bears only a superficial similarity to Gato in my opinion, but I can't say I've looked into it deeply. I've been far too busy with life. I only have my tiny spare time for this project unfortunately.
jimmymvp t1_j3q74ms wrote
So the active part is the self-predictive part?
Mental-Swordfish7129 t1_j3q81e2 wrote
Active just means that it directly modifies its input stream. And, yes, it is also predicting what that input will be, so it is reasonable to say that it is, in part, self-predictive.
Crucially, its input stream also includes features that are not itself or have not been changed by itself. The proprioceptive signals help it learn which is which.
jimmymvp t1_j3q66hi wrote
I meant more like research papers from top conferences in ML (neurips, iclr, icml)
Mental-Swordfish7129 t1_j3q7lbz wrote
I don't think this model is within the realm ML (it's theoretical neuroscience; although there is much overlap) but does qualify as AI which is what was asked about in the post title.
There is an annual symposium called the International Workshop on Active Inference for about 3 years now where research is presented and the papers are linked there.
And of course the dozens of research papers you can find through Google Scholar on the topic.
Edit: I did find where a few active inference papers have been presented at NeurIPS.
Viewing a single comment thread. View all comments