
OCR Buddy, my fully-local OCR extension, is closing in on 1000 users a week in. Here is what shipped from the Reddit community's feedback
I shipped OCR Buddy about a week ago and it is almost at 1000 users. I did not expect that pace, so mostly I want to say thanks, because most of what shipped since launch came from the Reddit community's feedback after I posted early versions here.
What it does: select any region on a page and pull out the text, code, or math, fully on-device. Nothing leaves the browser. It is built around faithful recognition, so it does not invent text that was not there.
What is new since launch:
- Page to Markdown: export a whole page as clean Markdown, copy or download
- Viewport and full-page capture: OCR the visible area or the entire scrolling page, with the tiles merged back together (handling seams and repeated sticky headers)
- Coloured text on light backgrounds now reads correctly, which was broken before
- A few smaller fixes around restricted pages and first-run model download
If you have more feedback i would love to hear and try to implement, ofc if doable and match with the nature of the extension. (also new version is coming soon 2.5.6 with some small fixes)
If you want to try it or look at the code (it is open source, MIT):
- Site: https://www.ocr-buddy.com/
- GitHub: https://github.com/Fanfulla/OCR-buddy
Happy to answer anything, especially on the in-browser side, that was the hardest part to get right.