u/howtorewriteaname

Do VLMs in production still use fixed-patch ViTs for their vision capabilities? [D]

The research community has provided (already for some time) seemingly more efficient and effective tokenizations for vision. Do we have any hint on whether non-fixed-patches tokenization is being applied on the big player models?

I imagine not, and I'm trying to think why:

- marginal gains?

- pipelines needing a fixed number of tokens per image upfront for efficiency reasons (or even harder limitations)?

- scaling laws are not well understood for input-adaptive patching therefore big players do not bet on this?

or am I simply totally wrong and under the hood all the big players are doing dynamic tokenization for vision?

reddit.com

u/howtorewriteaname — 2 days ago

▲ 1 r/sex

my gf can come only once and after that her vibes are off

so this is totallt fine ofc, she's less in the mood after cumming but she lets me fuck her until i come. but idk all my previous partners were "multiorgasmic". like we'd just fuck and she would come several times untils she's not in the mood.

my question is: is this a "hard limit"? or can we work towards multiorgasms somehow?

p.s. extra info:

I meant orgasm from penetration. thing is after that, she usually loses her libido. just like men when we cum we are kinda done.

reddit.com

u/howtorewriteaname — 10 days ago