u/Historical_Towel2793 — reddlx

Hello everyone,

As a part of my research problem, I want to fully access a model to implement LLM text watermarking (I need open-weight model to modify hidden states). After researching a bit, I found Qwen2.5-0.5B model that satisfies my requirements. I would like to know:

if this model is overall good in language reasoning
is 0.5B model too small? I cannot use large models due to complexity and hardware requirement and at the same time, I do not want weak semantics as I will be working with modifying vector embeddings
are there any better open-models that people usually use for research?

Any suggestions or information regarding llm text watermarking and go-to open models for these research problems is appreciated.

Thanks for your time!