Viewing a single comment thread. View all comments

PredictorX1 t1_j5rb8gp wrote

>I was in the understanding that two contiguous linear layers in a NN would be no better than only one linear layer.

This is correct: In terms of the functions they can represent, two consecutive linear layers are algebraically equivalent to one linear layer.

1