PLX 88096 - Opinions.
Does anyone use PLX 88096 or something similar?
If anyone has something similar, could they tell me what the tokens/s would look like using a PLX 88096 + five RTX 5060Ti 16GB with qwen3.6-35b-a3b?
I currently have four RTX 5060Ti cards in an MZ32-AR0 Ver3.0 motherboard. I currently use it with qwen3.6-27b, but I'd like to add five more to use with qwen3.6-35b-a3b and mistral-nemo-instruct-2407.
I actually wanted to assemble two PLX systems, each with 4-5 RTX 5060 Ti cards, so I would have one model in each PLX system.
However, I didn't find much information about performance, such as how it would be using PLX, and if token generation would be too slow.
If anyone could shed some light on how the performance would be affected, I would be very grateful.