u/offzinho3k

▲ 3 r/Vllm

PLX 88096 - Opinions.

Does anyone use PLX 88096 or something similar?
If anyone has something similar, could they tell me what the tokens/s would look like using a PLX 88096 + five RTX 5060Ti 16GB with qwen3.6-35b-a3b?

I currently have four RTX 5060Ti cards in an MZ32-AR0 Ver3.0 motherboard. I currently use it with qwen3.6-27b, but I'd like to add five more to use with qwen3.6-35b-a3b and mistral-nemo-instruct-2407.

I actually wanted to assemble two PLX systems, each with 4-5 RTX 5060 Ti cards, so I would have one model in each PLX system.

However, I didn't find much information about performance, such as how it would be using PLX, and if token generation would be too slow.

If anyone could shed some light on how the performance would be affected, I would be very grateful.

reddit.com
u/offzinho3k — 10 days ago

Hello friends, I've looked at several topics but haven't been able to reach a verdict.

I currently have the following configuration:
Motherboard: HUANANZHI H12D-8D
CPU: EPYC 7502
Memory: 8x Hynix DDR4 ECC 16GB 2666
Hard disk: 3x SSD M.2 Western Digital WD Black SN7100 2TB
GPU: 2x Asus Prime Geforce RTX 5060 Ti OC 16GB GDDR7
Power supply: Corsair AX1600i

And I would like to expand my context to qwen3.6-27B, which GPUs would you recommend to replace the 5060ti?

I currently use it in conjunction with the "Cursor" for projects in Node.js, React, and TypeScript.
If anyone could recommend a GPU model, for 2 or 4 GPUs working in parallel, that would perform well with qwen3.6-27B, I would be extremely grateful.

reddit.com
u/offzinho3k — 16 days ago