
I made my own organization on huggingface for soley releasing low size distills of bigger models
I recently started my own Hugging Face org called CoNDeNse-AI focused on making smaller, lightweight distilled AI models that are easier to run on normal hardware 🙌
Org: https://huggingface.co/CoNDeNse-AI
Most of the training is done on Kaggle using 2x T4 GPUs, so a big part of the project is figuring out how to get the best possible results from limited hardware. Because of this, we unfortunately can’t currently make proper distills based on newer/larger Qwen 3.5 base models since Kaggle struggles heavily with them during training and distillation.
Some current projects are:
- GLM-5.1-Qwen3-1.7B-CoNDeNse
- GLM-5.1-Qwen3-0.6B-CoNDeNse
- GLM-5.1-Qwen3-1.7B-CoNDeNse-GGUF
The 1.7B versions mainly focus on preserving reasoning, coding, and multilingual capabilities while reducing overhead, while the 0.6B variant is more focused on accessibility and lower-end hardware support. The GGUF release is aimed at easier local inference in things like llama.cpp and LM Studio 💻
The org is still very experimental, so alongside proper releases there are also research checkpoints, quantization tests, and random experiments that may or may not work 😅
Would love feedback from people working on low-resource training/distillation setups.