Disastrous_Elk_6375 t1_j2de4o2 wrote on December 31, 2022 at 12:32 PM

Can the 4090 pool their VRAM? I always thought that LLMs need GPUs from the A/V series so that they can pool memory. Am I wrong in thinking that?

zaptrem t1_j2e2lvb wrote on December 31, 2022 at 4:03 PM

You can do pipeline parallelism via FairScale and HF Accelerate on any identical (and sometimes non identical) GPUs.

Need to deploy the inference model with Colossal AI.