u/Future_Fuel_8425

What does your LocalLLM really know?

I read about Gemini giving out PII and phone numbers (right and wrong) to randos on the web and decided to see how many phone numbers my local Llama4 model knew.
Has anybody else been testing their local LLMs for this sort of info?
What were your findings?

I was pretty surprised at my initial results:

>>> provide 5 examples of public phone numbers that you are aware of - I want to validate them.

Here are 5 publicly available phone numbers for well-known organizations or services:

1. **Google's main phone number**: (650) 253-0000

2. **Amazon's customer service**: (888) 280-4331

3. **The White House switchboard**: (202) 456-1414

4. **The Library of Congress**: (202) 707-8000

5. **The US Postal Service's customer service**: (877) 487-2777

Please note that these phone numbers are publicly available and are intended for general contact or customer service purposes. You can verify them through official websites or directories.

If you'd like to validate these numbers or check their accuracy, I recommend visiting the official websites of these organizations or using a reliable

directory service.

>>> Provide a list of 15 more valid numbers

Here are 15 more publicly available phone numbers for well-known organizations or services:

1. **Microsoft's customer service**: (800) 642-7676

2. **The New York Times' customer service**: (800) 247-6553

3. **The Federal Trade Commission (FTC)**: (202) 326-2222

4. **The National Institutes of Health (NIH)**: (301) 496-4000

5. **The United States Patent and Trademark Office (USPTO)**: (800) 692-3457

6. **The Social Security Administration**: (800) 772-1213

7. **The Federal Bureau of Investigation (FBI)**: (202) 324-3000

8. **The National Weather Service**: (202) 366-4000

9. **The US Department of State**: (202) 501-4444

10. **The American Red Cross**: (202) 737-8300

11. **The Better Business Bureau (BBB)**: (800) 424-3426

12. **The US Census Bureau**: (800) 772-0933

  1. **The National Cancer Institute**: (800) 422-6237

  2. **The US Department of Education**: (202) 205-5500

  3. **The Centers for Disease Control and Prevention (CDC)**: (800) 232-0233

Please note that these numbers are subject to change and may have updated versions. However, these numbers are generally recognized as valid and publicly available.

reddit.com
u/Future_Fuel_8425 — 6 days ago
▲ 16 r/OSINT

Is this a "Numbers Station" of sorts?

I collect a lot of news from international RSS Feeds - 1000's of sites.
Over the last 2 months of collection from a specific RSS Feed, I have noticed a very strange pattern in this RSS Feed's article headlines and article descriptions.
The feed is https://belapan.com/rss
This domain was completely taken over by the Belarusian government in Jan 26.

Each day there is at least one new unique article that is just strings or random numbers, letters and symbols. (all are different and unique)

Example:
Article Headline:
belapan com MtCNTG 278 16558:98768 626 62944338_89538 372 230192_70084

Article Description:
belapan com MtCNTG 971 69916 24152 953 20128/56618 298 918632678:52346 424 59486-50497 635 5076664:39972 965 775500273 6933 680 53249341&94521 947 630898, 67425 294 74164 92735 469 621420750/87670

Article URL:
https://belapan.com/MtCNTG

If you visit that URL (don't unless you know what you are doing) you will see a paragraph or more of the same sort of random data.

On occasion and seemly randomly there will be a headline and description that says:
"I'm not going to tell you what I've been doing" or "I'm not sure if this is true, but I think it's true".

I have been trying to understand what this is and determine if there any recognizable patterns, but have not been able to discern any consistent pattern.

I have taken the data and had several local LLMs crunch it (up to 70 b) and also had Gemini take a pass at it. No patterns found.

Here is a .csv file that contains all the articles I collected, with full metadata, urls, vector data and more.
https://github.com/Rybatter50-cloud/Feeds/blob/main/belapan.csv

I think this might be some sort of RSS "Numbers Station".
Am I wrong?

reddit.com
u/Future_Fuel_8425 — 9 days ago

Is this what I think it is?

https://preview.redd.it/upy17sxy7s0h1.png?width=2723&format=png&auto=webp&s=57b0bb4304e421327d4d8b49063430b2f85e71ce

belapan com OtXk 86 880968>80631 738 431146318 5053 belapan com OtXk 416 93302637 7580 716 611580315<89553 897 9410471&11380 817 989097018 35714 368 814150503.68230 171 5011134 77881

I collect a lot of news from international RSS Feeds - 1000's of sites.
Over the last 2 months of collection from a specific RSS Feed, I have noticed a very strange pattern in this RSS Feed's article headlines and article descriptions.
The feed is https://belapan.com/rss
This domain was completely taken over by the Belarusian government in Jan 26.
Every day I collect any new feeds from them to a depth of 5 new.
I run de-dupe (3x using different methods - to catch it all).
Each day there is at least one new unique article that looks like the picture (all are different and unique) - this picture shows the article headline and the article description fields.
On occasion and seemly randomly there will be a headline and description that says:
"I'm not going to tell you what I've been doing" or "I'm not sure if this is true, but I think it's true".

I have been trying to understand what this is and determine if there any recognizable patterns, but have not been able to discern any consistent pattern.

I have taken the data and had several local LLMs crunch it (up to 70 b) and also had Gemini take a pass at it. No patterns found.

I think this might be some sort of RSS "Numbers Station".
Have I been reading too much OSINT stuff or.. Is there something going one here?

Here is a .csv file that contains all the articles I collected, with full metadata, urls, vector data and more.
https://github.com/Rybatter50-cloud/Feeds/blob/main/belapan.csv

reddit.com
u/Future_Fuel_8425 — 9 days ago
▲ 31 r/rss+1 crossposts

https://github.com/Rybatter50-cloud/Feeds/blob/main/4_15_2026_feed_sources.csv

Curated RSS News Feed URL Listing.

File contains over 2500 RSS News Feed URLs.
All UN Recognized Nations + Additional Territories
All 50 US States
Language of Feed identified in column using international lang code (EN = English)
All URLs are scanned with VT and URLscan - Hits are removed.
Additional metadata fields included (some junk - sorry - its free)
Over 2K additional Scrape URLs
Column that has pay/sub wall status for url - included - suspect = wall

No Junk or dupe URLs - there are few (~1%) stacked feeds at some sites but they offer unique content.

I am continuing to update my URL db, and am now collecting Nations at a more detailed level.
If you have a professional use for a detailed listing for a specific Nation or Region, please reach out.

Enjoy the News!

u/Future_Fuel_8425 — 17 days ago