RetroPenguin_
RetroPenguin_ t1_j171fur wrote
Has anyone here found that the choice of pooling makes any difference?
RetroPenguin_ t1_iy4mjz5 wrote
Reply to comment by daking999 in [D] What method is state of the art dimensionality reduction by olmec-akeru
Yeah, can't be a preprocessing step for clustering etc. But nice for qualitative visualizations, I suppose.
RetroPenguin_ t1_jad51qy wrote
Reply to comment by abnormal_human in [R] Microsoft introduce Kosmos-1, a Multimodal Large Language Model (MLLM) that can perceive general modalities, learn in context (i.e., few-shot), and follow instructions (i.e., zero-shot) by MysteryInc152
For the >10B closed source models, I’d be really curious how many of those weights are zero with fp16 precision.