u/SuggestionMission516

Image 1 — Google's latest creation: Gemini 3.5 Flash vs all
Image 2 — Google's latest creation: Gemini 3.5 Flash vs all
Image 3 — Google's latest creation: Gemini 3.5 Flash vs all
Image 4 — Google's latest creation: Gemini 3.5 Flash vs all
Image 5 — Google's latest creation: Gemini 3.5 Flash vs all

Google's latest creation: Gemini 3.5 Flash vs all

https://gemini.google.com/share/c2a187275e26 archive link
https://claude.ai/share/8383747a-aaf1-4f6c-a516-0e839f46a698
https://grok.com/share/bGVnYWN5_3c63e371-eb9d-46c3-8ba2-0c745c6795a2
https://chatgpt.com/share/6a0f1e13-a0c8-8328-b989-1ac51b92e81c

same prompt
"""
300+140=460

Is this correct?

Breakdown?
"""

Remember guys. #1 in Finance Agent v2. SOTA performance right here.

Edit: For control, I explicitly tested all other models with minimal thinking effort too.

u/SuggestionMission516 — 5 hours ago
▲ 61 r/codex

What an unlucky week! There's still a 6% chance GPT 5.5 just randomly became this dumb, not *statistically significant(3.2%)* yet.

Well, I guess all my previously working harness setups are all by chances too.

I believe in OpenAI, and I will continue supporting them. There's absolutely no way they gave us the quantized model!

/s

What a disappointment, OpenAI. The last month of codex has been great, you just had to do this.

Reference: https://marginlab.ai/trackers/codex/
Math:

n = 150

p0 = 0.56

se = np.sqrt(p0*(1-p0)/n)

print(f'SE = {se:.5f}')

print()

print('Probability under H0 (true rate = baseline p0=56%):')

for thresh in [0.075, 0.0794, 0.063]:

z = thresh / se

p_one = 1 - stats.norm.cdf(z)

p_two = 2 * p_one

print(f' beyond +/-{thresh*100:.2f}%: z={z:.3f} -> one-sided {p_one*100:.2f}% , two-sided {p_two*100:.2f}%')

u/SuggestionMission516 — 3 days ago
▲ 107 r/codex+1 crossposts

How to check:
Go to chatgpt.com . F12, switch to Network tab. filter and search for user_segments

I genuinely don't know how I got coding_power_user:true. Must be me spamming gsd flow non stop somehow triggering the algorithm lmao.

Also is anyone here assigned professional_user?

u/SuggestionMission516 — 21 days ago

I don't see anything even after the session is complete, I haven't been using pro model on web for a while, now every new 5.5 Pro session doesn't show thinking traces anymore.
I can still see my old 5.4 Pro session thinking traces in chat history

Is it just me?

u/SuggestionMission516 — 21 days ago