u/Historical_Towel2793

Hello everyone,

As a part of my research problem, I want to fully access a model to implement LLM text watermarking (I need open-weight model to modify hidden states). After researching a bit, I found Qwen2.5-0.5B model that satisfies my requirements. I would like to know:

  1. if this model is overall good in language reasoning
  2. is 0.5B model too small? I cannot use large models due to complexity and hardware requirement and at the same time, I do not want weak semantics as I will be working with modifying vector embeddings
  3. are there any better open-models that people usually use for research?

Any suggestions or information regarding llm text watermarking and go-to open models for these research problems is appreciated.

Thanks for your time!

reddit.com
u/Historical_Towel2793 — 24 days ago