u/Alarmed-Poet-5722

r/AskReddit r/deeplearning r/mathematics

If transformers struggle with math, is the real issue model size or the fact that we’re feeding them a notation they were never built to learn?

Human math notation is full of things transformers dislike: implicit structure, overloaded symbols, non‑canonical forms, and surface‑level transformations that hide the underlying graph.

I’m exploring whether small models reason better when math is represented in a canonical, explicit, graph‑native format. something closer to a transformer’s inductive biases than traditional notation.

Curious whether anyone has experimented with structured math tokenization, graph‑encoded expressions, or transformer‑friendly symbolic IRs in local models

reddit.com

u/Alarmed-Poet-5722 — 22 hours ago

▲ 0 r/mathematics+1 crossposts

Why might the way we represent mathematics shape what LLM are ultimately able to reason about — and is it possible to engineer a truly transformer‑native mathematical notation?

reddit.com

u/Alarmed-Poet-5722 — 1 day ago

u/Alarmed-Poet-5722

If transformers struggle with math, is the real issue model size or the fact that we’re feeding them a notation they were never built to learn?

Why might the way we *represent* mathematics shape what LLM are ultimately able to *reason about* — and is it possible to engineer a truly transformer‑native mathematical notation?

Why might the way we represent mathematics shape what LLM are ultimately able to reason about — and is it possible to engineer a truly transformer‑native mathematical notation?