u/Strange-Dimension675

Wordlist generator graph based, Wordnet + LLM and WikipediaApi

Wordlist generator graph based, Wordnet + LLM and WikipediaApi

Hi all, I built a wordlist-generator that uses a semantic knowledge graph instead of pure string manipulation.

You feed it a list of keywords and it builds a hypernym DAG using WordNet, expands it with LLM-generated hyponyms, scores leaf pairs by semantic similarity (Wu-Palmer), and permutes synonyms to produce the final wordlist. For terms WordNet doesn't know (brand names, games, slang) an LLM iteratively finds a valid hypernym using a Wikipedia summary as context.

The wordlist use case is the obvious one, but honestly the core engine is just a semantic expander: given a few seed words, it grows a contextually coherent vocabulary around them. I can see it being useful for:

- NLP / ML — data augmentation, building domain-sp ecific vocabularies, corpus enrichment

- Ontology / knowledge graphs* quick concept mapping from a small seed set.

Supports OpenAI or local models via llama.cpp.

Code: https://github.com/ivegotanheadache/WonaBee

Curious if anyone sees other uses for this kind of approach and likes it

There is a Proof of Concept in the repo that’s shows how, when given [“Cyberpunk2077”, “Rabbit”] as input, it completely excludes the combination between the two due to semantic incorrelation

Wordlist generator based on WordNet graphs + LLM

Hi all, I built a wordlist-generator that uses a semantic knowledge graph instead of pure string manipulation.

You feed it a list of keywords and it builds a hypernym DAG using WordNet, expands it with LLM-generated hyponyms, scores leaf pairs by semantic similarity (Wu-Palmer), and permutes synonyms to produce the final wordlist. For terms WordNet doesn't know (brand names, games, slang) an LLM iteratively finds a valid hypernym using a Wikipedia summary as context.

The wordlist use case is the obvious one, but honestly the core engine is just a semantic expander: given a few seed words, it grows a contextually coherent vocabulary around them. I can see it being useful for:

- NLP / ML — data augmentation, building domain-sp ecific vocabularies, corpus enrichment

- Ontology / knowledge graphs* quick concept mapping from a small seed set.

Supports OpenAI or local models via llama.cpp.

Code: https://github.com/ivegotanheadache/WonaBee

Curious if anyone sees other uses for this kind of approach and likes it

reddit.com
▲ 3 r/GuyCry

I’m perfectly happy with my own life and myself but I miss having someone everyday at my side

Dunno but I’ve reached a point where I’m kinda satisfied by my academic results, I’m going through my project, doing sport, etc blabbla. But I miss the constant presence of my male best friend, now in a relationship. Yes I had a crush on him, he kinda has being rude with me just because people thought he was betraying his gf with me (obviously nothing similar happened) and we got distant. I’m sad.

reddit.com
u/Strange-Dimension675 — 9 days ago

I’d like to get a tattoo of the enso symbol, with an internal **無常** and external **不屈I like the two proverbs “ Mono no aware 物の哀れand **futo fukutsu (**不撓不屈**) but it would be too much for a simple tatto so I thought about only one kanji for each

reddit.com
u/Strange-Dimension675 — 23 days ago