u/One-Kraken

Priority Mail Express stuck at Jersey City Distribution Center (NJ to CA) - Missed delivery date, no updates?

Priority Mail Express stuck at Jersey City Distribution Center (NJ to CA) - Missed delivery date, no updates?

I’m looking for some insight or similar experiences with a Priority Mail Express package. It was shipped from New Jersey and was supposed to be delivered to California by yesterday (guaranteed delivery date).
However, the tracking has been completely stuck at the Jersey City NJ Distribution Center with no new scans or updates for over 24 hours.
Since Express is supposed to be guaranteed, I'm getting a bit worried.
Has anyone else had Express packages get stuck in Jersey City recently?
Does Express usually bypass normal delays, or is Jersey City backed up right now?
Should I file for a refund/missing mail search immediately, or wait it out another day?
Appreciate any advice or insight from employees or anyone who has dealt with this specific scan loop!

u/One-Kraken — 2 days ago
▲ 3 r/Gemma4+1 crossposts

Gemma 4 on iOS: Anyone else stuck on CPU because of the "Buffer(31)" Metal crash?

Hey everyone,

I’m hitting a massive performance wall building an on-device AI app for the iPhone 17 Pro. I’m using MediaPipeTasksGenAI via CocoaPods to run Gemma 4 E2B, but the inference is incredibly slow.

Looking at the logs, it’s constantly defaulting to CPU fallback. The GPU initialization fails every time with this specific Metal compiler error:

'buffer' attribute parameter is out of bounds: must be between 0 and 30. device half4* src_tensor_buffer[[buffer(31)]]

It seems like Gemma 4’s graph is too complex for the standard MediaPipe delegate, hitting that hard 31-buffer limit on Apple’s newer chips. It’s frustrating because the official Google AI Edge Gallery app is blazing fast on the same hardware.

Has anyone else run into this? If so, how did you fix it?

•	Did you pivot to the LiteRT-LM path (Google's newer engine) despite the lack of Swift bindings?

•	Or did you jump ship to MLX-Swift for a more native Apple Silicon approach?

Would love to hear if anyone has successfully bypassed this "Buffer 31" ceiling!

How are you currently handling on-device Gemma 4 inference in your projects?

reddit.com
u/One-Kraken — 1 month ago