u/Early-Importance8582

Personal continual learning for LLMs without GPU — position paper [OC]

I proposed two architectures for enabling LLMs to learn daily from personal interactions:

Internal KV-Sphere Architecture (IKSA)

Background Micro Fine-Tuning (BMFT) Both work with zero GPU and zero catastrophic forgetting.

Full paper:

huggingface.co/spaces/Persak/continual_learning_position_paper

https://github.com/paras2l/Continual-Learning-in-Large-Language-Models-.git

https://zenodo.org/records/20234100?token=eyJhbGciOiJIUzUxMiIsImlhdCI6MTc3ODkzODg2NiwiZXhwIjoyNTM1NzUzNTk5fQ.eyJpZCI6IjY4OTMxZTBmLWM0YTQtNDg2ZC05OGJhLTk0ZDQ2ZTVjNDJkOSIsImRhdGEiOnt9LCJyYW5kb20iOiJkYmQwM2ExZjk4ZmZiNWM1NTFlNDZlN2QzNTY5ZTA0YiJ9.n5VgFWg5SsC5L6KvZGZhsSK\_lll4syeSnvghb6uyAKBAZiOyd15Ov\_Ps6awungKdfVsdEE0GuvOWggspQuQDfw

Twitter thread: [ https://x.com/ParasLashkarin/status/2055644988592247081?s=20 ]

Looking for researchers to validate or disprove these ideas! — Paras Lashkari

reddit.com
u/Early-Importance8582 — 4 days ago
▲ 3 r/AIDeveloperNews+1 crossposts

Personal continual learning for LLMs without GPU — position paper [OC]

I proposed two architectures for enabling LLMs to learn daily from personal interactions:

Internal KV-Sphere Architecture (IKSA)

Background Micro Fine-Tuning (BMFT) Both work with zero GPU and zero catastrophic forgetting.

Full paper:

huggingface.co/spaces/Persak/continual_learning_position_paper

https://github.com/paras2l/Continual-Learning-in-Large-Language-Models-.git

https://zenodo.org/records/20234100?token=eyJhbGciOiJIUzUxMiIsImlhdCI6MTc3ODkzODg2NiwiZXhwIjoyNTM1NzUzNTk5fQ.eyJpZCI6IjY4OTMxZTBmLWM0YTQtNDg2ZC05OGJhLTk0ZDQ2ZTVjNDJkOSIsImRhdGEiOnt9LCJyYW5kb20iOiJkYmQwM2ExZjk4ZmZiNWM1NTFlNDZlN2QzNTY5ZTA0YiJ9.n5VgFWg5SsC5L6KvZGZhsSK\_lll4syeSnvghb6uyAKBAZiOyd15Ov\_Ps6awungKdfVsdEE0GuvOWggspQuQDfw

Twitter thread: [ https://x.com/ParasLashkarin/status/2055644988592247081?s=20 ]

Looking for researchers to validate or disprove these ideas! — Paras Lashkari

reddit.com
u/Early-Importance8582 — 5 days ago

Personal continual learning for LLMs without GPU — position paper [OC]

I proposed two architectures for enabling LLMs to learn daily from personal interactions:

Internal KV-Sphere Architecture (IKSA)

Background Micro Fine-Tuning (BMFT) Both work with zero GPU and zero catastrophic forgetting.

Full paper:

in comments

Looking for researchers to validate or disprove these ideas! — Paras Lashkari

reddit.com
u/Early-Importance8582 — 6 days ago
▲ 4 r/LocalLLM+2 crossposts

Personal continual learning for LLMs without GPU — position paper [OC]

u/Early-Importance8582 — 6 days ago