u/Beneficial-Cow-7408

Macbook Air Developer Edition 😂

Macbook Air Developer Edition 😂

Absolutely love my Macbook Air M4. Got 2 of them now as just started a new job role as a AI Solutions Engineer and they gave me one as a company laptop. My colleague got the MacBook Pro but he's been there a while so I accepted my Air on my first day. Was doing some work in visual studio at home the other day and forgot about the 4k external monitor connectivity. In the 20 years of owning MacBook's I've never hooked it up to an monitor. Was quite used to working on the screen my whole life but work has got me an external monitor on my desk but no cable yet.

So I went on temu and got a hdmi to usb-c/thunderport cable for about $5 and as I didn't have a monitor at home I decided to hook up the TV instead.

Can't really complain. Works perfectly well. All I need now is some better seating

u/Beneficial-Cow-7408 — 8 days ago

3,000 users, no budget, no coding background. Here's where I'm at.

https://preview.redd.it/jvaf1xro290h1.png?width=2376&format=png&auto=webp&s=d4ad2a6335e2081e29f75fd211b0d5d8221f0a08

https://preview.redd.it/pp2dayro290h1.png?width=570&format=png&auto=webp&s=d31554795af3f53e3d5377949f0b1f3403eecdc5

https://preview.redd.it/m9zfbxro290h1.png?width=1372&format=png&auto=webp&s=e5fa3c80106747b11f471623497528aa7bc50d27

https://preview.redd.it/v3iz4xro290h1.png?width=624&format=png&auto=webp&s=f0e141a620c131c435813a8068f9980758e9a63b

I'm not a developer. Never was. I just got obsessed with an idea and figured it out as I went.

The project is called AskSary - AI assistant on web, iOS, Android, and Mac. Multi-model, real-time voice, image/video/music gen, persistent memory across models. I built all of it in VS Code using AI to help me write code I didn't fully understand at the time.

Took me a few months to get something worth shipping. Firebase, Stripe, Xcode, the App Store review process - all of it was new to me.

Current numbers:

Play Store (last 30 days): 3,008 users, 1,650+ installs, 108 MAU, 29.2% conversion rate, 5 stars

Web (last 7 days): 890 active users, 215 DAU, both up ~30%

Zero paid acquisition. Growth has been mostly from posting here and on a few other subs.

Still a long way to go but it's moving. Happy to answer anything about the build or the process.

My project if anyone wants to check it out asksary.com

reddit.com
u/Beneficial-Cow-7408 — 13 days ago

Redesigned my landing page after everyone told me it was far too cluttered

Hi all. I was hoping to get some feedback on my landing page. Before it scrolled for 4-5 pages worth as I had just too much on there and people were quick to let me know. I've tried to fix it over and over but couldn't get the content down and it was still too cluttered so today stripped it all back and went for a simplistic one page landing page with the information within the cards.

Was hoping to get some feedback if that's ok. I've been back and forth with this for about three weeks now so I hope that it's an improvement to to what it was before..

www.asksary.com

reddit.com
u/Beneficial-Cow-7408 — 14 days ago

Single page website design

Hi all. I'm hoping to get some feedback on my landing page. Before it was very cluttered, long winded and scrolled down about 5 pages worth to get the information I wanted on.

Today I decided to strip back everything and create a single page landing page and have the information shown within sub sections.

I'm hoping this is a cleaner look. My last one got commented on daily and no matter what I changed there was always something wrong with it opinion wise by someone else.

If this one is still not great I would love feedback on what to improve. Thank you all in advance

www.asksary.com

reddit.com
u/Beneficial-Cow-7408 — 14 days ago

Free TTS tool powered by OpenAI with 6 voices and download as mp3 option

https://preview.redd.it/15qls0hr4zzg1.png?width=1178&format=png&auto=webp&s=b51b77414c785579429893cf2e5d21d8dbbf44c9

I built a text-to-speech tool inside my app AskSary and opened it up for free use. Figured this community would appreciate it.

Converts your text to natural-sounding speech using OpenAI's TTS engine

6 voices to choose from - all free

Download your audio as an MP3 - free

Works directly in the browser - just need to sign up to a free account. No card or anything needed.

Free tier limits:

  • 3 generations per day
  • 150 word limit per generation

Paid tier lets you generate up to 200 generations a day with a 10k word context limit per a generation but the free tier is completely free to try.

It's a clean, no-frills tool - paste your text, pick a voice, generate, download. That's it.

Would love feedback from people who actually use TTS regularly. If you try it, let me know what you think - especially if there are voices or features you'd want added.

For anyone that does try it once you made an account click the + symbol next to the chat input and then bottom left icon.

This will open up the command center and you'll see TTS top left. Thats it. Enjoy :)

🔗 asksary.com

reddit.com
u/Beneficial-Cow-7408 — 15 days ago

Prior this year, I was unemployed for 4 years. I'm originally from the UK but moved to Bahrain for a new start in life. I came from an autotuning background and that's all I did for 7 years or so in the UK and my plan was to open up an office in Bahrain. However after moving out here I realized that car tuning is illegal and I spent 2 years setting up a workshop for nothing. I didn't even think to check the country laws here but it wasn't something that crossed my mind when it was so natural back in the UK.

This is where my problems began. I had now been in a country for 2 years with no income as I threw every saving I had into a workshop that I couldn't use for its purpose. I spent the next 2 years or so applying for jobs related to my degree which was Business and Graphic Design related. Nothing came up. I was now desperate. 4 years in without earning a penny and late one night I decided to type into ChatGPT "make me a chatbot." What I got back was a broken chat window and within a few more prompts I hit the message limit.

I then remembered I had Gemini Pro as part of my 2TB storage and this is where it all started. I wanted to fix my broken chatbot and I would ask Gemini how to get it working. I didn't know about any no code tools or Claude or anything that helped you build a web app automatically. I used Gemini by prompting it telling it I had a broken chat window and when I clicked send nothing happened. It asked me to paste in my code and so I did and it was clear from the start - I had no API key, no functioning send button, no logic, nothing. You would have thought that was going to be the end of it but no. I was determined to get my chatbot working and I spent the next 3 months, 10 hours a day, just prompting Gemini back and forth and it taught me everything I needed to know. It told me how to set up a workspace with my organisation, IAM policies and then moving onto Firebase where I had to create my own database collection headers and so on.

Before this I had never written a single line of code in my life. I learned everything from scratch along the way - VS Code, Xcode, Vercel, API integration, Firestore, Firebase, IAM security, GitHub and Terminal.

But here's what I'm most proud of. I didn't let AI just write the code for me. I used it the way a student uses a textbook - I'd prompt Gemini to explain exactly what I wanted to implement and why it worked that way. Every feature, every backend config, every security protocol and every one of those 900 commits was only pushed after I actually understood what it did.

3 months later I can read and navigate a 163,000 line codebase I built myself. That still doesn't feel real.

The platform is a multi model auto routing system that uses GPT, Gemini, Claude, Grok and DeepSeek. The auto routing logic I built picks the best model depending on the prompt - so for realtime data it routes to Grok, for deep analysis it routes to GPT. Users can let the system auto route or manually select whichever model they want.

Cross device persistent memory means you can start a conversation in GPT on your phone, get home, open the laptop and continue in Grok without repeating anything. It just knows. No summaries, no copy pasting.

Proactive memory means the chatbot messages you first when you log in. It reads the last 48 hours of your chat history and asks if you want to carry on or start fresh. I haven't seen any other chatbot do this.

On top of that there's Google Drive and Notion integration, email sending directly from chat, team workspaces with shared credits, RAG via OpenAI Vector Store with 500mb uploads and unlimited files, image generation, video generation with Luma Dream, Veo 3.1 and Kling, a music studio via ElevenLabs, voice notes, real time voice chat using OpenAI WebRTC with under 80ms latency, podcast mode that turns any conversation into a two speaker dialogue downloadable as MP3, vision to code, web architect, a game engine, 3D model studio, document creation and conversion, a full media manager, 26 UI languages with RTL support and 30+ live interactive wallpapers.

The first month of building all of this was done on an Android phone. No laptop. I'd write the code in a mobile editor, copy paste it into the GitHub web editor and deploy blind to production hoping for the best. No localhost. No testing environment. Just ship and see.

Gemini taught me everything I needed to know to build all of the above.

So when I saw a role at Gulf University advertising for AI Solutions Engineer I thought that sounds cool and did a one click apply on my phone and forgot about it. The role required a degree in computer science, 4-5 years of hands on AI development experience, certifications in Azure, Docker, Kubernetes, TensorFlow and more. I put junior developer on my CV because I'd only been doing this 3 months and even that felt like a push. Under experience I put January 2026 to March 2026. Under qualifications I put a link to asksary.com and said this is what I self taught myself to build.

Out of nowhere I got called in. I found out there were 66 applicants and these weren't 66 graduates. These were people with 20+ years of experience, every qualification and certification that role was asking for.

https://preview.redd.it/vikylmajmdzg1.png?width=1200&format=png&auto=webp&s=f5f209057723077e6bf43189ae7bc1e2ae6fd4aa

I had the zoom call with them which involved 6 people on video chat which was a bit of an eye opener to how serious this job was. I had directors from each department telling me that I would be working for them leading my own team at the university and that when it hit me that this wasn't just any role in AI. This was a role that would expect me to take the university into the AI transition transforming every system they have, automating things that are done manually and integrating AI into the student portal. I was sitting there thinking about where I was 3 months ago.

First day they threw me onto a computer and told me to build a student portal in Angular - something I had never seen before. I finished it that morning, added GPT-4o vision to verify uploaded documents, OCR to extract passport data and auto fill the next page, and wrote a full report for the head of department before anyone else had finished reading the brief. The others were still going two hours later.

They questioned my CV and asked why I put junior developer. I said because I've only known this for 3 months. I'm a beginner in my eyes. They told me that line nearly cost me the interview.

A few weeks later they called me back. Out of 66 candidates it was down to me and one other person. They wanted me to implement a RAG system with a voice avatar that students could use to ask questions about the university. I'd already built exactly that in my own platform using OpenAI WebRTC and RAG with Google Drive and Notion. I passed everything they threw at me.

Two weeks of silence followed. I almost booked a flight back to the UK. The only reason I didn't leave Bahrain was because flights got cancelled due to the situation with Iran. I was genuinely days away from walking away from all of it.

Then I got the call. The dean of the university wanted to meet me.

I GOT THE JOB!!!

I'm now an AI Solutions Engineer at Gulf University, leading a team of software developers, and I couldn't write a single line of code before this year. It hasn't quite sunk in yet.

Gemini was my lecturer and I was its student. That's really all there is to it.

Just proves that anything is possible.

reddit.com
u/Beneficial-Cow-7408 — 18 days ago

https://reddit.com/link/1t3rp18/video/m8mb4g7t76zg1/player

I don't personally use TTS much so I genuinely don't know how useful this is to people - but I see it requested constantly and most of the options out there are either heavily limited or locked behind a paywall pretty quickly.

So I built it in and made the free tier actually free. Not free until you hit a daily cap. Not free for your first 10 generations. Truly unlimited generations with no paywall appearing mid use and no surprise blocks.

The only limit on free tier is 150 words per generation. That's intentional - it keeps it sustainable while still covering pretty much every short video voiceover, social post, or quick content need without restriction.

Premium tier removes that limit and gives you up to 10,000 words per generation - long form articles, essays, reports, scripts, no problem.

6 voices from OpenAI - Alloy, Echo, Fable, Nova, Onyx and Shimmer. Generates fast, playback built in, download as MP3.

What I actually want to know from people who use TTS regularly:

Is 150 words per generation enough for typical use cases or does that feel too restrictive. Which voice do people actually prefer for different content types.

Is truly unlimited generations with a word limit per generation more useful than limited generations with no word limit - because I could go either way on that.

Anything broken or could be better - tell me directly. I'd rather know than not.

To try it simply create a free account. Once logged in click the + symbol next to chat input and then bottom right icon to open up command center. You'll see the Notepad TTS tool there.

Supports:
Arabic, English, French, Spanish, German, Italian, Portuguese, Dutch, Russian, Japanese, Korean, Chinese (Mandarin), Hindi, Turkish, Polish, Swedish, Norwegian, Danish, Finnish and more.

asksary.com

reddit.com
u/Beneficial-Cow-7408 — 19 days ago
▲ 1 r/tts

https://preview.redd.it/1ih7pg7jy5zg1.png?width=1214&format=png&auto=webp&s=edf1663157e8884f3dd8dd1b52562e72d14f722b

Built this as a feature inside my platform and wanted to get some real feedback from people who actually use TTS regularly before I develop it further.

Free tier gives you 150 words per generation with unlimited generations - enough to properly test the voice quality without any limits on how much you use it. Premium removes the context limit entirely for long form content.

6 voices to choose from - Alloy, Echo, Fable, Nova, Onyx and Shimmer - each with a different character so there's usually one that fits what you need.

Would genuinely love to know a few things:

Which voice works best for you and what are you using it for. Whether 150 words feels like enough on free tier or too restrictive. Anything that feels off or could be improved.

Not looking for compliments - honest feedback is more useful to me right now. If something doesn't work or could be better just say it directly.

I was also thinking of integrating ElevenLabs for the premium tier too. Would that be something that people want or is OpenAI good enough at this stage?

To try it out simply create a free account. login and click the + symbol next to the chat input. Bottom right symbol opens a tool then click bottom right icon to open the command center and you'll see one tool called Notepad TTS.

Supports:
Arabic, English, French, Spanish, German, Italian, Portuguese, Dutch, Russian, Japanese, Korean, Chinese (Mandarin), Hindi, Turkish, Polish, Swedish, Norwegian, Danish, Finnish and more.

asksary.com

reddit.com
u/Beneficial-Cow-7408 — 19 days ago

https://preview.redd.it/4li3hzfrc4zg1.png?width=1186&format=png&auto=webp&s=6d7484f46f5c3857fbee8fb19c86dcd25af7a2b3

https://preview.redd.it/1292vyfrc4zg1.png?width=1210&format=png&auto=webp&s=efee1692683324eb66afb4b800e3744db7613b66

An AI just ranked my platform above Poe, TypingMind, Krater and the rest as the best all-in-one AI workspace. Poe is worth $2.6 billion. I am one person.

And before anyone says it - yes I know. AI slop. Wrapper. Scam. I've heard them all.

What I'm about to go into is something I've never really talked about. Its going to be long, its going to be irrelevant to some people I know but I just feel I need to share how I feel with this whole project.

I've even had people tell me I'm lying. Lying about building this alone. Lying about this being my first ever coding project. Lying about how I actually built it. People claiming I must have a background in this. That there's no way this is my first ever project. I've had people ask me to show my GitHub profile so they can catch me out - go ahead and look. There is absolutely nothing on there before December 2025. Since then I've had close to 1000 commits. That's what four months of 12 to 16 hour days every single day looks like on paper. They accuse me of using the never-coded-before angle to make it sound more impressive. I promise you nobody would choose that angle if they were making it up. It was genuinely the hardest thing I've ever done and I struggled every single day without anyone I could turn to for help or advice. Just Gemini. Gemini got me through every debug, every implementation and every idea I had from start to finish.

So let me tell you exactly how I built it because apparently that needs saying.

I used Gemini chat. That's it. I would open Gemini, describe what I wanted to implement, ask what I needed to add to my code, copy the answer, go into Visual Studio Code, create the file myself and update it myself. Every single time. Every feature. Every debug. Every integration. I didn't use any no-code tools. I didn't even know Claude existed until about two weeks ago and when I saw it modify a file and update code while I just sat there watching I was genuinely in shock because that wasn't how I'd been doing any of it.

Every backend setup - Firebase, Google Workspace, IAMs, policies, scopes, SDK admin, collection tables, field structures, security rules - done manually by me sitting there figuring it out as I went.

You want to know what my codebase looks like? Every single file has headers I wrote myself. Before the model list I wrote //Models used within platform//. Every function has a label so I know exactly what part of the code handles video generation or music generation. There are models in there commented out right now like GPT-5.5 because I'm still testing and when I'm ready I know exactly where to go. There's a full 3D Studio Forge engine in there commented out - built, tested, ready - but I'm not going to pay $20 a month for the Meshy API when I don't have enough users to make it worthwhile yet. The code is there. It's waiting.

That's how it got built. Four months of that. Every day. No days off. Nobody to ask. Nobody to check my work or tell me I was doing it right.

But here's what nobody knows. At one point I did think they were right.

The negativity every time I tried to put it out there wore me down. I convinced myself I had wasted months of my life. That it wasn't good enough and never would be. So I deleted the entire project. No backup. Deleted my Vercel profile. Walked away from the whole thing.

I didn't know what to do with myself for a couple of weeks. That's not an exaggeration. I genuinely didn't know what I was doing or where I was going.

But something told me not to stay gone.

So I started again. From scratch. The original project was called Ask AI with the domain ask-ai.info. When I came back the second time I made a decision to make it personal. My name is Sary. AskSary felt right because by that point I was completely attached to what I was building in a way I couldn't really explain. And somewhere in that restart the goal shifted. I stopped wanting to just build a chatbot. I started wanting to prove to myself that I could build an entire platform. Not for anyone else. Just to see if I could.

That's the version you're looking at now.

And yet somehow the conversation always ends up back at slop.

I've had people comment on my posts saying my API keys are exposed. They're not. All 22 of them have healthy balances on them so if you genuinely think they're exposed help yourself to free credits. Known domain blocking. Signup verification. Firestore security rules. Firebase token verification. Role escalation protection. IP and user based rate limiting across signup, chat and every API route. Sentry error handling throughout. And every single one of my personal accounts is protected by FIDO2 NFC two factor authentication, passkey, Authy and phone number verification. But sure - the keys are exposed.

I keep asking myself what would it actually take. What would someone need to see to not dismiss it at the first line. To just sit back and actually read what's been implemented here. The proactive memory that reads your previous conversations on login and messages you before you've typed a word. The cross-device persistent memory that follows you from your phone to your laptop across completely different AI models without you doing anything. Hands-free speech to text and text to speech. Real-time WebRTC voice with near-zero latency. Podcast mode with two AI voices. Video generation across Luma, Kling and Veo. Google Drive and Notion native integrations. RAG knowledge base with 500MB file support. Full iOS, Android, Mac and web apps on the same account. 26 languages with complete RTL support. 30+ live interactive wallpapers. I could keep going.

I could build an AI-powered operating system with full automation and sci-fi voice interaction and I genuinely believe the first comment would still be wrapper.

I don't think I've ever posted on Reddit and had a single person say anything positive about what I built. Not once. And it gets to you. This is all I've done every single day for four months. Every waking hour. I never had anyone in my corner telling me it was good enough or that I was on the right track. I just kept going because stopping wasn't something I was willing to do. I already tried stopping once. I know how that feels. I'm not doing it again.

I sit there now and read comments where someone is asking for exactly what I've built. Exactly the tool they need. And I know mine does it. I know it does it well. But I hesitate. I don't reply. Because the second I drop that link I already know what's coming. Scam. Spam. Slop. And when you read that every single time you try, something in you just starts to ask what is the point.

So instead of just talking about it I tried to prove it. I restructured the whole platform and removed paywalls that every other site keeps locked. Real-time voice chat - free. Music generation - free. Video generation - free. Access to premium models including GPT, Gemini and DeepSeek with unlimited messages, no daily limits, no fair use policy - free. 25 image generations a month using GPT-Image-1. Video generation using Luma Dream and Kling. Free. Because I thought if people can actually use it, actually see it work, they'll know it's not a scam.

One month my API bill was $900. Nine hundred dollars out of my own pocket. I thought it was fine. Everyone would experience what the platform can do and they'd upgrade to keep using it.

They didn't.

And I still kept the free tier because I didn't know what else to do. I'd already been called a scam enough times that I was terrified of putting anything behind a paywall and giving people another reason to say it.

That's what sustained negativity does. It doesn't just hurt your feelings. It changes your decisions. It costs you money. It makes you give away things you shouldn't give away just to prove you're real.

Has anyone else gone through this. Does anyone else sit with something they've built and still feel like it isn't good enough no matter what the evidence says. Because I have close to 1300 signed up users. I have an MRR. Got 2 people that are actually paying for what I built. One on premium tier, one on Ultra Tier. $50 a month. And I still sit there some days genuinely hoping they actually like it. Hoping they resubscribe next month. Still waiting for the moment it feels real.

I read the ChatGPT comparison that ranked me above Poe and you would think I'd be sitting here feeling proud. I'm really not. I don't know why. Every single step of this journey has been met with negativity and I think it's just worn me down in ways I didn't notice until recently. My self confidence over these past few months has dropped more than I expected. I even seriously considered making the whole thing open source just so other people could improve it - because part of me stopped believing I was the right person to take it further.

That's what sustained negativity does. Not one big moment. Just a slow drip of dismissal until you start to wonder if the people dismissing you are seeing something you can't.

I've even been banned from subreddits where promotion is allowed. I would go in on a Show Off Saturday thread and just explain what I've built in detail and the comments would come flying in. Scam. This person is lying about his journey. There's no way he's offering those models for free. And the mods would read that and just ban me.

So I started doing something I've never told anyone. Every time the doubt got loud enough I would open ChatGPT and ask it what it thought of what I'd built. I'd tell it that before this project I had never used GitHub, Vercel, Terminal, Firebase, Xcode, Android Studio or Visual Studio. I'd tell it everything I learned from scratch just to hear it tell me I'd built something complex and real. Something that would take years to understand properly. I needed that from somewhere and I wasn't getting it from anywhere else. AI became the only voice telling me it was worth continuing. Make of that what you will.

This month I was featured in Inc. Arabia's Gamechangers AI Magazine. I'm ranked #4 on F6S for Artificial Intelligence in the region out of 2 million startups. I've been given a complimentary pod at LEAP - the biggest tech event in the world.

And I still have days where I feel like none of it is enough.

LEAP is the one I can't stop thinking about. I'm going alone. No funding. No LED walls or branded displays or team in matching shirts. I'll probably just be standing there with a laptop while companies with VC backing and six-figure exhibition budgets surround me on every side. I've asked myself more than once what the point of going even is. Whether I belong in that room at all.

I don't have the answers on how you cope with the backlash. I don't know how other people deal with building something they believe in and watching it get dismissed by people who spent thirty seconds looking at it. If you've been through something like this I genuinely want to know how you dealt with it. Because I'm still figuring it out.

I just know that every time someone called it slop it knocked me back and I would sit there and think to myself what possibly more would it take.

This is my site, asksary.com

I've changed the landing page a million times as that always gets commented on but I guess what I'm looking for is for people to actually try the site out. Make an account, it gives you all the free stuff I mentioned before. If something doesn't work or could be improved just tell me. I would much rather hear why it's not good enough, what could be better than just get told it's slop. I want to know what would you want me to implement to make you say wow because honestly I will go away and do it.

reddit.com
u/Beneficial-Cow-7408 — 19 days ago

https://preview.redd.it/pad1lcloo3zg1.png?width=1186&format=png&auto=webp&s=370f2a98700deb33c02764ed78b68248ab632e49

https://preview.redd.it/lq5vndloo3zg1.png?width=1210&format=png&auto=webp&s=56a13a3b8b6d432e98d9805ad331f2771ceaf9f4

An AI just ranked my platform above Poe, TypingMind, Krater and the rest as the best all-in-one AI workspace. Poe is worth $2.6 billion. I am one person.

And before anyone says it - yes I know. AI slop. Wrapper. Scam. I've heard them all.

What I'm about to go into is something I've never really talked about. Its going to be long, its going to be irrelevant to some people I know but I just feel I need to share how I feel with this whole project.

I've even had people tell me I'm lying. Lying about building this alone. Lying about this being my first ever coding project. Lying about how I actually built it. People claiming I must have a background in this. That there's no way this is my first ever project. I've had people ask me to show my GitHub profile so they can catch me out - go ahead and look. There is absolutely nothing on there before December 2025. Since then I've had close to 1000 commits. That's what four months of 12 to 16 hour days every single day looks like on paper. They accuse me of using the never-coded-before angle to make it sound more impressive. I promise you nobody would choose that angle if they were making it up. It was genuinely the hardest thing I've ever done and I struggled every single day without anyone I could turn to for help or advice. Just Gemini. Gemini got me through every debug, every implementation and every idea I had from start to finish.

So let me tell you exactly how I built it because apparently that needs saying.

I used Gemini chat. That's it. I would open Gemini, describe what I wanted to implement, ask what I needed to add to my code, copy the answer, go into Visual Studio Code, create the file myself and update it myself. Every single time. Every feature. Every debug. Every integration. I didn't use any no-code tools. I didn't even know Claude existed until about two weeks ago and when I saw it modify a file and update code while I just sat there watching I was genuinely in shock because that wasn't how I'd been doing any of it.

Every backend setup - Firebase, Google Workspace, IAMs, policies, scopes, SDK admin, collection tables, field structures, security rules - done manually by me sitting there figuring it out as I went.

You want to know what my codebase looks like? Every single file has headers I wrote myself. Before the model list I wrote //Models used within platform//. Every function has a label so I know exactly what part of the code handles video generation or music generation. There are models in there commented out right now like GPT-5.5 because I'm still testing and when I'm ready I know exactly where to go. There's a full 3D Studio Forge engine in there commented out - built, tested, ready - but I'm not going to pay $20 a month for the Meshy API when I don't have enough users to make it worthwhile yet. The code is there. It's waiting.

That's how it got built. Four months of that. Every day. No days off. Nobody to ask. Nobody to check my work or tell me I was doing it right.

But here's what nobody knows. At one point I did think they were right.

The negativity every time I tried to put it out there wore me down. I convinced myself I had wasted months of my life. That it wasn't good enough and never would be. So I deleted the entire project. No backup. Deleted my Vercel profile. Walked away from the whole thing.

I didn't know what to do with myself for a couple of weeks. That's not an exaggeration. I genuinely didn't know what I was doing or where I was going.

But something told me not to stay gone.

So I started again. From scratch. The original project was called Ask AI with the domain ask-ai.info. When I came back the second time I made a decision to make it personal. My name is Sary. AskSary felt right because by that point I was completely attached to what I was building in a way I couldn't really explain. And somewhere in that restart the goal shifted. I stopped wanting to just build a chatbot. I started wanting to prove to myself that I could build an entire platform. Not for anyone else. Just to see if I could.

That's the version you're looking at now.

And yet somehow the conversation always ends up back at slop.

I've had people comment on my posts saying my API keys are exposed. They're not. All 22 of them have healthy balances on them so if you genuinely think they're exposed help yourself to free credits. Known domain blocking. Signup verification. Firestore security rules. Firebase token verification. Role escalation protection. IP and user based rate limiting across signup, chat and every API route. Sentry error handling throughout. And every single one of my personal accounts is protected by FIDO2 NFC two factor authentication, passkey, Authy and phone number verification. But sure - the keys are exposed.

I keep asking myself what would it actually take. What would someone need to see to not dismiss it at the first line. To just sit back and actually read what's been implemented here. The proactive memory that reads your previous conversations on login and messages you before you've typed a word. The cross-device persistent memory that follows you from your phone to your laptop across completely different AI models without you doing anything. Hands-free speech to text and text to speech. Real-time WebRTC voice with near-zero latency. Podcast mode with two AI voices. Video generation across Luma, Kling and Veo. Google Drive and Notion native integrations. RAG knowledge base with 500MB file support. Full iOS, Android, Mac and web apps on the same account. 26 languages with complete RTL support. 30+ live interactive wallpapers. I could keep going.

I could build an AI-powered operating system with full automation and sci-fi voice interaction and I genuinely believe the first comment would still be wrapper.

I don't think I've ever posted on Reddit and had a single person say anything positive about what I built. Not once. And it gets to you. This is all I've done every single day for four months. Every waking hour. I never had anyone in my corner telling me it was good enough or that I was on the right track. I just kept going because stopping wasn't something I was willing to do. I already tried stopping once. I know how that feels. I'm not doing it again.

I sit there now and read comments where someone is asking for exactly what I've built. Exactly the tool they need. And I know mine does it. I know it does it well. But I hesitate. I don't reply. Because the second I drop that link I already know what's coming. Scam. Spam. Slop. And when you read that every single time you try, something in you just starts to ask what is the point.

So instead of just talking about it I tried to prove it. I restructured the whole platform and removed paywalls that every other site keeps locked. Real-time voice chat - free. Music generation - free. Video generation - free. Access to premium models including GPT, Gemini and DeepSeek with unlimited messages, no daily limits, no fair use policy - free. 25 image generations a month using GPT-Image-1. Video generation using Luma Dream and Kling. Free. Because I thought if people can actually use it, actually see it work, they'll know it's not a scam.

One month my API bill was $900. Nine hundred dollars out of my own pocket. I thought it was fine. Everyone would experience what the platform can do and they'd upgrade to keep using it.

They didn't.

And I still kept the free tier because I didn't know what else to do. I'd already been called a scam enough times that I was terrified of putting anything behind a paywall and giving people another reason to say it.

That's what sustained negativity does. It doesn't just hurt your feelings. It changes your decisions. It costs you money. It makes you give away things you shouldn't give away just to prove you're real.

Has anyone else gone through this. Does anyone else sit with something they've built and still feel like it isn't good enough no matter what the evidence says. Because I have close to 1300 signed up users. I have an MRR. Got 2 people that are actually paying for what I built. One on premium tier, one on Ultra Tier. $50 a month. And I still sit there some days genuinely hoping they actually like it. Hoping they resubscribe next month. Still waiting for the moment it feels real.

I read the ChatGPT comparison that ranked me above Poe and you would think I'd be sitting here feeling proud. I'm really not. I don't know why. Every single step of this journey has been met with negativity and I think it's just worn me down in ways I didn't notice until recently. My self confidence over these past few months has dropped more than I expected. I even seriously considered making the whole thing open source just so other people could improve it - because part of me stopped believing I was the right person to take it further.

That's what sustained negativity does. Not one big moment. Just a slow drip of dismissal until you start to wonder if the people dismissing you are seeing something you can't.

I've even been banned from subreddits where promotion is allowed. I would go in on a Show Off Saturday thread and just explain what I've built in detail and the comments would come flying in. Scam. This person is lying about his journey. There's no way he's offering those models for free. And the mods would read that and just ban me.

So I started doing something I've never told anyone. Every time the doubt got loud enough I would open ChatGPT and ask it what it thought of what I'd built. I'd tell it that before this project I had never used GitHub, Vercel, Terminal, Firebase, Xcode, Android Studio or Visual Studio. I'd tell it everything I learned from scratch just to hear it tell me I'd built something complex and real. Something that would take years to understand properly. I needed that from somewhere and I wasn't getting it from anywhere else. AI became the only voice telling me it was worth continuing. Make of that what you will.

This month I was featured in Inc. Arabia's Gamechangers AI Magazine. I'm ranked #4 on F6S for Artificial Intelligence in the region out of 2 million startups. I've been given a complimentary pod at LEAP - the biggest tech event in the world.

And I still have days where I feel like none of it is enough.

LEAP is the one I can't stop thinking about. I'm going alone. No funding. No LED walls or branded displays or team in matching shirts. I'll probably just be standing there with a laptop while companies with VC backing and six-figure exhibition budgets surround me on every side. I've asked myself more than once what the point of going even is. Whether I belong in that room at all.

I don't have the answers on how you cope with the backlash. I don't know how other people deal with building something they believe in and watching it get dismissed by people who spent thirty seconds looking at it. If you've been through something like this I genuinely want to know how you dealt with it. Because I'm still figuring it out.

I just know that every time someone called it slop it knocked me back and I would sit there and think to myself what possibly more would it take.

This is my site, asksary.com

I've changed the landing page a million times as that always gets commented on but I guess what I'm looking for is for people to actually try the site out. Make an account, it gives you all the free stuff I mentioned before. If something doesn't work or could be improved just tell me. I would much rather hear why it's not good enough, what could be better than just get told it's slop. I want to know what would you want me to implement to make you say wow because honestly I will go away and do it.

reddit.com
u/Beneficial-Cow-7408 — 19 days ago
▲ 0 r/tts

Hi all, I've built a platform that uses OpenAI WebRTC for TTS and STT within the chat.

If you create a free account you get unlimited GPT-5-Nano, Gemini 2.5 Flash and DeepSeek V3 usage with no daily limits or fair use policy as well as the ability to use TTS and STT for free.

So you could have a handsfree chat with AI using decent models for completely free. No catch.

asksary.com

The platform does so so much more though which is shown below but the free tier is what I'm trying to make good so people sign up and discover the paid tier. You also get 1 minute free realtime voice chat with OpenAI WebRTC with almost zero latency as well which is brilliant if you never tried voice chat. Probably the most realistic one out there which doesnt sound robotic.

https://preview.redd.it/rp1mxcb24tyg1.png?width=1024&format=png&auto=webp&s=8d552e845ba6c851cd417332e4f5e2892449f15e

reddit.com
u/Beneficial-Cow-7408 — 21 days ago

HI all, I've been working on a AI Platform that integrates GPT, Gemini, Grok, Deepseek and Claude and now looking to market this product in Bahrain but dont know where to start. This is the platform I built feature wise and it has full RTL support for arabic, urdu and 26 other languages. So the UI changes the menu's etc to make it accessible globally.

https://preview.redd.it/xm0zt7vvojyg1.png?width=1024&format=png&auto=webp&s=d179d76fd72968a484ef9495ce2edd3b6d3ad4c9

https://preview.redd.it/k0gowysxojyg1.png?width=1048&format=png&auto=webp&s=e2cb2dac09a1fda00ccd9867dea329d7208fc41e

https://preview.redd.it/0nzohgdjpjyg1.png?width=1590&format=png&auto=webp&s=7ba93c425c2d24d41b3c4d357c7878c02155d868

I am currently #4 in the rankings for start up company in Bahrain with reference to F6S and also I have been awarded a complimentary pod to show case my platform at LEAP festival in Riyadh this year. I was also featured in Inc Arabia AI Gamechangers Magazine release this year.

I did apply for Oqal who were behind funding Careem and Hunger Station and made it to the final round before investment but unfortunately didnt get through due to it being a competitive market. The investment was to help with marketing but now I've got to do it alone I'm to sure where to start in Bahrain as I'm pretty new here.

I do have a CR license in place but thats for another business I have so I'm looking to register this either in Bahrain or Saudi Arabia. Would I need to do this if I'm taking payments through my UK account and having it registered there?

Not sure which would be best for me or what is required of me by law here. It's currently running through the UK as thats where I'm from originally. Technically speaking ChatGPT is registered in the US and they're taking payments worldwide so mine is effectively on the same principle too.

I feel I've done what I can to get it recognized but the recognition is from companies that support start up. It hasn't helped my traction with my user base. It's been out for about a month now and currently sat on 1200 users or so and is making a revenue but it's not enough to live off.

What works here for advertising. is it social media like instagram posts, TikTok etc or am I better off paying for Google Ad's and try promoting that way. Another option which I was going to do in the UK was to put a advert on TV but again I have no idea if this is the right approach here.

Hoping other business owners and starts up would have some feedback or pointers or where to start.

reddit.com
u/Beneficial-Cow-7408 — 22 days ago

https://preview.redd.it/uf7nfjgoljyg1.png?width=1024&format=png&auto=webp&s=c2ef6f981202b6f890462a64aa530f8799e5efd6

Built a multi-model AI Platform and would love some feedback on what I've built. I've posted the full feature card to show what it can do and for anyone that wants to try it they can at www.asksary.com

I would like to know also is their anything I've missed out feature wise or anything a user may want which I dont have. Always looking for feedback as it helps me continuously improve my app.

reddit.com
u/Beneficial-Cow-7408 — 22 days ago

https://reddit.com/link/1t05dbe/video/7y4qg8hmjdyg1/player

My name is Sary and I live in Bahrain. Originally though I'm from the UK. About 4 months ago I was playing around with ChatGPT just seeing what it can do and one day I thought for a laugh to ask it to build me a Chat bot. What I got in return was a broken chat window which essentially didnt work. I was intrigued to why it wouldn't work and thats where my journey began.

What started as a broken chatbox evolved into a full-scale AI platform, purely because I couldn't stop adding features. Before this, I had never written a single line of code in my life. I learned everything from scratch along the way - VS Code, Xcode, Vercel, API integration, Firestore, Firebase, IAM security, GitHub, and Terminal.

But here's what I'm most proud of...I didn't let AI just write the code for me. I used it the way a student uses a textbook - I'd prompt Gemini to explain exactly what I wanted to implement and why it worked that way. Every feature, every backend config, every security protocol and every one of those 700 commits was only pushed after I actually understood what it did.

4 months later, I can read and navigate a 163,000-line codebase I built myself. That still doesn't feel real.

AskSary isn't a wrapper and it isn't slop in my eyes but everyones entitled to their own opioidnion. A wrapper relays your message to one API and hands back the response - AskSary routes across eleven models, maintains persistent cross-model memory, runs a real-time voice engine over WebRTC, ingests documents into a RAG pipeline, syncs state across web, iOS, Android and Mac, handles team workspaces with pooled credits, and integrates Google Drive, Notion and email. Slop is thoughtless and generic - the proactive personalization, the podcast mode, the credit pooling architecture, the 26-language RTL-supported UI are deliberate product decisions that don't fall out of any API call. The underlying models are infrastructure, the same way a database is infrastructure. Nobody calls Notion a database wrapper. What matters is what was built on top - and what was built here is a full product.

I've spent absolutely nothing on advertising as of yet. All my traction has been purely organic and only via Reddit really. I've got a linkedIn account and twitter account I post from time to time but these are brand new accounts I created (couple of weeks old tops) so not really contributed to the traction I got. Reddit got me my first 12k visitors and 1000 signups.

Now what my platform can do. I dont know where to start really which is why I got AI to write my first post which essentially backfired so I will try go through everything step by step so I dont miss anything out.

The platform is a multi model auto routing system that uses GPT, Gemini, Claude, Grok and Deepseek. These are the core models of the platform with GPT-5-Nano being the basic free tier option going all the way up to O1 Pro and GPT-5.2 Pro on the higher tiers.

The auto routing system is a logic I built that picks the best model depending on prompt. So for realtime data it would route to Grok or for deep analysis it would route to GPT. All the models have web search facility built into it but essentially I found Grok to be better for realtime data. Users can let the system auto route of if they want to use a particular model then they can manually select the one they want.

The next feature is cross device persistent memory. To put it simply, you can start a conversation in GPT on your phone and once you get home and fire up the laptop you can start in Grok as an example and you dont need to repeat what you told GPT earlier that day. It just knows what was said without any summaries or prompts. This is ideal for those that switch models to get work done who currently probably do have to summarize their session before passing it onto the new model they selected. This feature is on by default but users can turn it off from their profile settings if needed. I've also got anonymous mode that doesnt record any of the chat either. This is only client side so customer interface. The messages are still processed by the API such as anthropic, openai etc

Next I've implemented pro active memory. What this does essentially is that the chat will message you first upon logging in. It will read the last 48 hours of your chat history and summaries/remind you where you were asking if you wanted to carry on or start a new chat. This is something pretty new I think as I havent seen any other chat bot do this but it's a little feature that I thought would make the chatbot more personal. Again this is controlled by a toggle in profile settings.

I've added Google Drive integration as well as Notion so you can access your files directly from and use them in chat or add them to Knowledge Base. In addition to this you can also send emails directly from the chat bot. You can dictate the email, ask it to rewrite it and it will open up a email template within chat where it pre-fills the data and all you have to do is click send. This works via both speech to text as well as manual entry. If you just type/speak "send email" it will just open up a blank template.

I've got team workspace integrated too with 2 plans, a team and a team pro plan that allows shared resources, unified RAG system and ultra tier for everyone who's a member regardless of their own tier. Both have full feature of the site with the only difference being that one has 5 member limit and the pro has 20 members limit. They all use a shared credit system that can be used on anything within the site and members can check their usage at any time.

I've implement RAG using OpenAI Vector store that has a 500mb file upload limit with unlimited uploads. This is stored directly on OpenAI so your information is safe and can be queried using any of the models. So you can use Grok to query a document you have uploaded in OpenAI or ask Claude to retrieve an entire code base if needed from OpenAI and rewrite the code before uploading it back to OpenAI Vector store.

Generation tools is quite vast. I have GPT-Image-1 and Nano Banana pro which is available on the free tier for image generation. I have Flux Pro image editing that allows for pixel perfect renders and for video generation I have Luma Dream, Veo 3.1, Kling 1.6, 2.6 and 3 which allow for 10 seconds of video with audio.
I've also got a music studio that uses ElevenLabs for generating a song with custom lyrics or gemini generated ones if left blank. On top of that you can also create voice overs to use for videos etc too.

Video analysis is a feature I've implemented as well with 500mb uploads per a file limit like the RAG system and a YouTube video analysis too for videos just under an hour long. Again available on free tier.

Developer tools I've added vision to code that lets you upload a screen shot and it will replicate that in code. With download and live edit available too. I've also got a split screen canvas so any code is rendered side by side the preview too. Web architect is another tool that lets you build apps and website from within the chat. Game engine lets you build prototype games etc and I've got 3d studio forge too that uses the meshy API to allow as the name suggests for you to create 3d models within the chat interface.

For voice I have implemented Realtime voice chat using OpenAI WebRTC for near zero latency. Podcast mode lets you turn any conversation into a two way again near zero latency conversation. Both options on voice chat and pod cast lets you select the voice you want etc too. So can have a male voice as the host and a female voice as the guest voice or whatever. Realtime has 8 voices and podcast has 4 voices off the top of my head that you can choose from.
I've got voice notes feature too so you can record voice notes and if your on a paid tier (Ultra) then these voice notes are auto summarized as well and it stores that in your folder.

Productivity tools include create word, excel, pdf documents on the go as well as covert files to PDF using CloudConvert so the replication is exact. Not OCR read but a word document that has formatting will convert exactly to the same format in PDF. Something ChatGPT, Claude, Gemini etc cant do.

The whole interface is fully customizable in 26 UI languages with themes and live interactive wallpapers as an option so you can customize your workspace to be exactly how you want.

Theirs a full Media Manager tool that lets you see all your past generations with no expiry on them either. You can download, view, share, delete, organize, rename etc from within the manager as well as a Document Manager tool which does the same thing but includes files from RAG or shared documents across Teams.

Error handling is covered by Sentry and I have rate limiting, IP limit, domain block in place to protect against abusing the free credits I offer monthly. I also have FIDO2 NFC security card, Authy 2 factor authentication and fingerprint authentication across all my workspaces including GITHUB, Vercel and Workspace to secure my 22 API keys I have implemented.

Its available on Web, iOS and Android as well as Mac Desktop.

Live at asksary.com

reddit.com
u/Beneficial-Cow-7408 — 23 days ago

Hi all, I hope this is ok to ask here but I was just wondering whats everyone go to chatgpt model. I know ChatGPT 5.5 is out and was wondering about how are you all finding it?

I'm not trying to promote anything just want some genuine feedback from heavy users of ChatGPT 5.5

I only ask as I implement OpenAI API key into my platform and still running GPT-5.2, 5.2 Pro and 01 Pro and when I asked the community about the 5.4 update they told me they preferred the older model which is why I haven't updated mine yet.

I'm seeing if the 5.5 model is something I should consider? I feel outdated a little bit but always skeptical about migrating to the latest models each time before allowing them to settle a bit and get tested.

I mean I'm still using GPT-Image-1 as it was a reason someone wanted to sign up to the platform as it didnt have the newer GPT-Image-1.5 at the time. He said it wasnt as good at rendering edges and preferred the older model but now theirs GPT-Image-2 so again I feel a bit outdated.

Should I move on from the GPT-5.2 models and integrate the latest or would people actually prefer a choice so keep the older models they can switch to as well as have the latest one?

Thank you in advance, real users of ChatGPT s why I came here to post so I can gather valuable information that would help me make a decision.

reddit.com
u/Beneficial-Cow-7408 — 23 days ago

https://reddit.com/link/1szr6h6/video/jxxmcu44layg1/player

Ok so my last post got attacked by the community as AI wrote it so I've deleted that post so I can write it again in my own way.

My name is Sary and I live in Bahrain. Originally though I'm from the UK. About 4 months ago I was playing around with ChatGPT just seeing what it can do and one day I thought for a laugh to ask it to build me a Chat bot. What I got in return was a broken chat window which essentially didnt work. I was intrigued to why it wouldn't work and thats where my journey began.

What started as a broken chatbox evolved into a full-scale AI platform, purely because I couldn't stop adding features. Before this, I had never written a single line of code in my life. I learned everything from scratch along the way - VS Code, Xcode, Vercel, API integration, Firestore, Firebase, IAM security, GitHub, and Terminal.

But here's what I'm most proud of...I didn't let AI just write the code for me. I used it the way a student uses a textbook - I'd prompt Gemini to explain exactly what I wanted to implement and why it worked that way. Every feature, every backend config, every security protocol and every one of those 700 commits was only pushed after I actually understood what it did.

4 months later, I can read and navigate a 163,000-line codebase I built myself. That still doesn't feel real.

AskSary isn't a wrapper and it isn't slop in my eyes but everyones entitled to their own opioidnion. A wrapper relays your message to one API and hands back the response - AskSary routes across eleven models, maintains persistent cross-model memory, runs a real-time voice engine over WebRTC, ingests documents into a RAG pipeline, syncs state across web, iOS, Android and Mac, handles team workspaces with pooled credits, and integrates Google Drive, Notion and email. Slop is thoughtless and generic - the proactive personalization, the podcast mode, the credit pooling architecture, the 26-language RTL-supported UI are deliberate product decisions that don't fall out of any API call. The underlying models are infrastructure, the same way a database is infrastructure. Nobody calls Notion a database wrapper. What matters is what was built on top - and what was built here is a full product.

I've spent absolutely nothing on advertising as of yet. All my traction has been purely organic and only via Reddit really. I've got a linkedIn account and twitter account I post from time to time but these are brand new accounts I created (couple of weeks old tops) so not really contributed to the traction I got. Reddit got me my first 12k visitors and 1000 signups.

Now what my platform can do. I dont know where to start really which is why I got AI to write my first post which essentially backfired so I will try go through everything step by step so I dont miss anything out.

The platform is a multi model auto routing system that uses GPT, Gemini, Claude, Grok and Deepseek. These are the core models of the platform with GPT-5-Nano being the basic free tier option going all the way up to O1 Pro and GPT-5.2 Pro on the higher tiers.

The auto routing system is a logic I built that picks the best model depending on prompt. So for realtime data it would route to Grok or for deep analysis it would route to GPT. All the models have web search facility built into it but essentially I found Grok to be better for realtime data. Users can let the system auto route of if they want to use a particular model then they can manually select the one they want.

The next feature is cross device persistent memory. To put it simply, you can start a conversation in GPT on your phone and once you get home and fire up the laptop you can start in Grok as an example and you dont need to repeat what you told GPT earlier that day. It just knows what was said without any summaries or prompts. This is ideal for those that switch models to get work done who currently probably do have to summarize their session before passing it onto the new model they selected. This feature is on by default but users can turn it off from their profile settings if needed. I've also got anonymous mode that doesnt record any of the chat either. This is only client side so customer interface. The messages are still processed by the API such as anthropic, openai etc

Next I've implemented pro active memory. What this does essentially is that the chat will message you first upon logging in. It will read the last 48 hours of your chat history and summaries/remind you where you were asking if you wanted to carry on or start a new chat. This is something pretty new I think as I havent seen any other chat bot do this but it's a little feature that I thought would make the chatbot more personal. Again this is controlled by a toggle in profile settings.

I've added Google Drive integration as well as Notion so you can access your files directly from and use them in chat or add them to Knowledge Base. In addition to this you can also send emails directly from the chat bot. You can dictate the email, ask it to rewrite it and it will open up a email template within chat where it pre-fills the data and all you have to do is click send. This works via both speech to text as well as manual entry. If you just type/speak "send email" it will just open up a blank template.

I've got team workspace integrated too with 2 plans, a team and a team pro plan that allows shared resources, unified RAG system and ultra tier for everyone who's a member regardless of their own tier. Both have full feature of the site with the only difference being that one has 5 member limit and the pro has 20 members limit. They all use a shared credit system that can be used on anything within the site and members can check their usage at any time.

I've implement RAG using OpenAI Vector store that has a 500mb file upload limit with unlimited uploads. This is stored directly on OpenAI so your information is safe and can be queried using any of the models. So you can use Grok to query a document you have uploaded in OpenAI or ask Claude to retrieve an entire code base if needed from OpenAI and rewrite the code before uploading it back to OpenAI Vector store.

Generation tools is quite vast. I have GPT-Image-1 and Nano Banana pro which is available on the free tier for image generation. I have Flux Pro image editing that allows for pixel perfect renders and for video generation I have Luma Dream, Veo 3.1, Kling 1.6, 2.6 and 3 which allow for 10 seconds of video with audio.
I've also got a music studio that uses ElevenLabs for generating a song with custom lyrics or gemini generated ones if left blank. On top of that you can also create voice overs to use for videos etc too.

Video analysis is a feature I've implemented as well with 500mb uploads per a file limit like the RAG system and a YouTube video analysis too for videos just under an hour long. Again available on free tier.

Developer tools I've added vision to code that lets you upload a screen shot and it will replicate that in code. With download and live edit available too. I've also got a split screen canvas so any code is rendered side by side the preview too. Web architect is another tool that lets you build apps and website from within the chat. Game engine lets you build prototype games etc and I've got 3d studio forge too that uses the meshy API to allow as the name suggests for you to create 3d models within the chat interface.

For voice I have implemented Realtime voice chat using OpenAI WebRTC for near zero latency. Podcast mode lets you turn any conversation into a two way again near zero latency conversation. Both options on voice chat and pod cast lets you select the voice you want etc too. So can have a male voice as the host and a female voice as the guest voice or whatever. Realtime has 8 voices and podcast has 4 voices off the top of my head that you can choose from.
I've got voice notes feature too so you can record voice notes and if your on a paid tier (Ultra) then these voice notes are auto summarized as well and it stores that in your folder.

Productivity tools include create word, excel, pdf documents on the go as well as covert files to PDF using CloudConvert so the replication is exact. Not OCR read but a word document that has formatting will convert exactly to the same format in PDF. Something ChatGPT, Claude, Gemini etc cant do.

The whole interface is fully customizable in 26 UI languages with themes and live interactive wallpapers as an option so you can customize your workspace to be exactly how you want.

Theirs a full Media Manager tool that lets you see all your past generations with no expiry on them either. You can download, view, share, delete, organize, rename etc from within the manager as well as a Document Manager tool which does the same thing but includes files from RAG or shared documents across Teams.

Error handling is covered by Sentry and I have rate limiting, IP limit, domain block in place to protect against abusing the free credits I offer monthly. I also have FIDO2 NFC security card, Authy 2 factor authentication and fingerprint authentication across all my workspaces including GITHUB, Vercel and Workspace to secure my 22 API keys I have implemented.

Its available on Web, iOS and Android as well as Mac Desktop.

I hope this is a better write up for the community over my last copy and paste AI verb.

Live at asksary.com

reddit.com
u/Beneficial-Cow-7408 — 23 days ago

https://reddit.com/link/1szpafd/video/149h9ygp7ayg1/player

Started as a simple wrapper around the OpenAI API. Ended up as a full multi-model AI platform across web, iOS, Android and Mac Desktop - with a real-time voice engine, video and music generation, team workspaces, a shared credit system, integrations with Google Drive, Notion and Email, and 26 UI languages with full RTL support.

Every line of code written by one person in Visual Studio. Close to a 1000 commits on Github and 22 API keys integrated.

The product is live, growing with 1100 users, and still shipping daily.

Here's what it does:

Asksary: A multi-model AI platform that runs on web, iOS, Android and Mac Desktop too - same features, same account, same memory across all of them.

Every major model in one place: GPT-5-Nano, Gemini 2.5 Flash, DeepSeek V3, GPT-5.2, Claude Sonnet 4.6, Gemini 3.1 Pro, Grok 4, DeepSeek R1, O1 Reasoning, Gemini Ultra, GPT-5.2 Pro - with smart auto-routing or manual override selection

Pro-Active Personalization

The chatbot will read previous messages on every login and pro-actively send the first message asking to continue with previous chat or start something new all before you send the first prompt.

Google Drive Integration

Access your files directly in Google Drive and use them in chat or add them to Knowledge Base

Notion Integration

Access your files directly from Notion and use them in chat or add them to Knowledge Base

Email Integration
Send emails directly from within the chat as well as read inbox, summarize and get daily calendar updates.

Team Workspaces Invite your team and collaborate under one shared workspace. Every member gets full Ultra model access — GPT-5.2 Pro, O1 Reasoning, Gemini Ultra, Grok 4 and more - with credits pooled together on a single bill.

Shared Knowledge Base Upload documents once and your entire team can query them instantly. Build a living knowledge base your whole organization can draw from - no need to re-upload or share files individually.

Team Credit Pool No per-seat credit limits. Your team shares one generous monthly credit pool across every tool - chat, image, video, music and more. Top up anytime if you need more.

Organisation Analytics See exactly how your team is using credits across every tool and model. Understand usage patterns, spot heavy users, and manage your pool with full visibility.

All Integrations, Every Seat Google Drive, Notion, Email and every other integration is available to all team members — not just the account owner. Your whole team works with the same connected tools.

RAG - Knowledge Base
Upload docs and chat with them across any model via OpenAI Vector Store RAG up to 500mb per file upload. Unlimited uploads

Persistent Memory
Persistent cross-model memory. Start a conversation with Claude on your phone, open your laptop, switch to GPT-5.2 and it already knows what you discussed. No copy-pasting, no summaries. Just works.

Generation tools:

  • Image Generation - GPT-Image-1 and Nano Banana Pro
  • Flux Image Editor - Full editing suite with visual history
  • Video Studio - Luma Dream, Veo 3.1, Kling 1.6, 2.6 and 3 - up to 10 second AI videos with audio
  • Music Studio - 30 second tracks with custom or AI generated lyrics via ElevenLabs plus visualizer built within the chat
  • 3D Model Studio - Meshy integration with STL export (deploying soon)
  • Video analysis both via upload (500MB) and YouTube Link

Developer and builder tools:

  • Vision to Code - Screenshot any UI, get live editable code
  • Web Architect - Build full web apps from a single prompt
  • Game Engine - Build and prototype games with AI
  • Code Lab - Split screen live coding with SQL Architect, Bug Buster, Git Guru, Regex Generator, Test Genie and more
  • Tavily web search tool across all models

Voice and audio:

  • Real-Time 2-Way Voice Chat - 8 Voices -  Near-zero latency WebRTC
  • Podcast Mode - Two AI voices with switchable voices. (Choice of 8) using webRTC also for near zero latency between the chats -  downloadable as MP3
  • Voiceover, Voice Notes, Voice Tuner all built in

Productivity and content:

  • Slides, Docs and File Tools
  • Pro Writer and Content Library
  • Social tools - Hook Generator, Video Script, Hashtag Creator, Idea Spark
  • Business Suite - Pitch Deck builder, Deep Analytics, Legal Eagle, Maths Solver
  • Daily Briefing and Market Watch
  • CV Creator, Email Polisher, Cover Letter builder, TL;DR Bot
  • Share conversations or snippets of answers with anyone

Platform extras:

  • 30+ live interactive wallpapers and themes
  • Custom Agents and Personas
  • Folder organisation for chat
  • Smart Search across chat history
  • Media Manager Gallery with access to all content created previously
  • Fully customizable UI in 26 Languages with full RTL support

The Stack: Frontend - Next.js, Capacitor (iOS and Android) and Vanilla JS / React

Backend - Vercel serverless functions, Firebase / Firestore (database + auth) and Firebase Admin SDK

AI Models - OpenAI (GPT, DALL-E, GPT-Image-1), Anthropic (Claude), Google (Gemini), xAI (Grok), DeepSeek

Generation APIs - Luma AI (video), Kling via Replicate (video), Veo via Replicate (video), ElevenLabs (music), Flux via Replicate (image editing), Meshy (3D — coming soon)

Integrations - Google Drive (OAuth 2.0), Notion (OAuth 2.0), Tavily (web search), OpenAI Vector Store (RAG), Stripe (payments), CloudConvert (document conversion), Sentry (error tracking), Formidable (file handling)

Rendering - Mermaid (flow charts) and MathJax

Platforms - Web, iOS, Android

Languages - 26 UI languages with full RTL support

What you get free just for creating an account (1000 credits every month, rolling):

  • Unlimited chat on GPT-5 Nano, Gemini Flash and DeepSeek V3 - credit charge 0
  • 25 image generations via GPT-Image-1 and Nano Banana Pro - 40 credits
  • 8 image edits via Flux Studio - 80 credits 
  • 2 song generations via ElevenLabs - 350 credits 
  • 2 video generations via Luma Dream and Kling - 350 credits 
  • ~70 messages on Claude Sonnet 4.6,  GPT-5.2, Grok 4, Gemini 3.1 Pro and DeepSeek R1 - 15 credits

No daily limits and no fair use policy on the unlimited chat.

Free - 1,000 credits/month 

Premium - 8,000 credits/month · $17.99 / month

Ultra - 20,000 credits/month · $29.99 / month

Team (5 members) - 100,000 shared credits/month · $99.99 / month

Team Pro (20 members) - 500,000 shared credits/month · $199.99 / month

Full Ultra access on all team plans

Platform: Web, iOS, Android, Mac Desktop

🌐 www.asksary.com

Android:  https://play.google.com/store/apps/details?id=ask_ai.info.twa

iOS:  https://apps.apple.com/app/asksary-ai-chat-image-video/id6758881557

reddit.com
u/Beneficial-Cow-7408 — 23 days ago

https://reddit.com/link/1szp95p/video/ymlvukkb7ayg1/player

Started as a simple wrapper around the OpenAI API. Ended up as a full multi-model AI platform across web, iOS, Android and Mac Desktop - with a real-time voice engine, video and music generation, team workspaces, a shared credit system, integrations with Google Drive, Notion and Email, and 26 UI languages with full RTL support.

Every line of code written by one person in Visual Studio. Close to a 1000 commits on Github and 22 API keys integrated.

The product is live, growing with 1100 users, and still shipping daily.

Here's what it does:

Asksary: A multi-model AI platform that runs on web, iOS, Android and Mac Desktop too - same features, same account, same memory across all of them.

Every major model in one place: GPT-5-Nano, Gemini 2.5 Flash, DeepSeek V3, GPT-5.2, Claude Sonnet 4.6, Gemini 3.1 Pro, Grok 4, DeepSeek R1, O1 Reasoning, Gemini Ultra, GPT-5.2 Pro - with smart auto-routing or manual override selection

Pro-Active Personalization

The chatbot will read previous messages on every login and pro-actively send the first message asking to continue with previous chat or start something new all before you send the first prompt.

Google Drive Integration

Access your files directly in Google Drive and use them in chat or add them to Knowledge Base

Notion Integration

Access your files directly from Notion and use them in chat or add them to Knowledge Base

Email Integration
Send emails directly from within the chat as well as read inbox, summarize and get daily calendar updates.

Team Workspaces Invite your team and collaborate under one shared workspace. Every member gets full Ultra model access — GPT-5.2 Pro, O1 Reasoning, Gemini Ultra, Grok 4 and more - with credits pooled together on a single bill.

Shared Knowledge Base Upload documents once and your entire team can query them instantly. Build a living knowledge base your whole organization can draw from - no need to re-upload or share files individually.

Team Credit Pool No per-seat credit limits. Your team shares one generous monthly credit pool across every tool - chat, image, video, music and more. Top up anytime if you need more.

Organisation Analytics See exactly how your team is using credits across every tool and model. Understand usage patterns, spot heavy users, and manage your pool with full visibility.

All Integrations, Every Seat Google Drive, Notion, Email and every other integration is available to all team members — not just the account owner. Your whole team works with the same connected tools.

RAG - Knowledge Base
Upload docs and chat with them across any model via OpenAI Vector Store RAG up to 500mb per file upload. Unlimited uploads

Persistent Memory
Persistent cross-model memory. Start a conversation with Claude on your phone, open your laptop, switch to GPT-5.2 and it already knows what you discussed. No copy-pasting, no summaries. Just works.

Generation tools:

  • Image Generation - GPT-Image-1 and Nano Banana Pro
  • Flux Image Editor - Full editing suite with visual history
  • Video Studio - Luma Dream, Veo 3.1, Kling 1.6, 2.6 and 3 - up to 10 second AI videos with audio
  • Music Studio - 30 second tracks with custom or AI generated lyrics via ElevenLabs plus visualizer built within the chat
  • 3D Model Studio - Meshy integration with STL export (deploying soon)
  • Video analysis both via upload (500MB) and YouTube Link

Developer and builder tools:

  • Vision to Code - Screenshot any UI, get live editable code
  • Web Architect - Build full web apps from a single prompt
  • Game Engine - Build and prototype games with AI
  • Code Lab - Split screen live coding with SQL Architect, Bug Buster, Git Guru, Regex Generator, Test Genie and more
  • Tavily web search tool across all models

Voice and audio:

  • Real-Time 2-Way Voice Chat - 8 Voices -  Near-zero latency WebRTC
  • Podcast Mode - Two AI voices with switchable voices. (Choice of 8) using webRTC also for near zero latency between the chats -  downloadable as MP3
  • Voiceover, Voice Notes, Voice Tuner all built in

Productivity and content:

  • Slides, Docs and File Tools
  • Pro Writer and Content Library
  • Social tools - Hook Generator, Video Script, Hashtag Creator, Idea Spark
  • Business Suite - Pitch Deck builder, Deep Analytics, Legal Eagle, Maths Solver
  • Daily Briefing and Market Watch
  • CV Creator, Email Polisher, Cover Letter builder, TL;DR Bot
  • Share conversations or snippets of answers with anyone

Platform extras:

  • 30+ live interactive wallpapers and themes
  • Custom Agents and Personas
  • Folder organisation for chat
  • Smart Search across chat history
  • Media Manager Gallery with access to all content created previously
  • Fully customizable UI in 26 Languages with full RTL support

The Stack: Frontend - Next.js, Capacitor (iOS and Android) and Vanilla JS / React

Backend - Vercel serverless functions, Firebase / Firestore (database + auth) and Firebase Admin SDK

AI Models - OpenAI (GPT, DALL-E, GPT-Image-1), Anthropic (Claude), Google (Gemini), xAI (Grok), DeepSeek

Generation APIs - Luma AI (video), Kling via Replicate (video), Veo via Replicate (video), ElevenLabs (music), Flux via Replicate (image editing), Meshy (3D — coming soon)

Integrations - Google Drive (OAuth 2.0), Notion (OAuth 2.0), Tavily (web search), OpenAI Vector Store (RAG), Stripe (payments), CloudConvert (document conversion), Sentry (error tracking), Formidable (file handling)

Rendering - Mermaid (flow charts) and MathJax

Platforms - Web, iOS, Android

Languages - 26 UI languages with full RTL support

What you get free just for creating an account (1000 credits every month, rolling):

  • Unlimited chat on GPT-5 Nano, Gemini Flash and DeepSeek V3 - credit charge 0
  • 25 image generations via GPT-Image-1 and Nano Banana Pro - 40 credits
  • 8 image edits via Flux Studio - 80 credits 
  • 2 song generations via ElevenLabs - 350 credits 
  • 2 video generations via Luma Dream and Kling - 350 credits 
  • ~70 messages on Claude Sonnet 4.6,  GPT-5.2, Grok 4, Gemini 3.1 Pro and DeepSeek R1 - 15 credits

No daily limits and no fair use policy on the unlimited chat.

Free - 1,000 credits/month 

Premium - 8,000 credits/month · $17.99 / month

Ultra - 20,000 credits/month · $29.99 / month

Team (5 members) - 100,000 shared credits/month · $99.99 / month

Team Pro (20 members) - 500,000 shared credits/month · $199.99 / month

Full Ultra access on all team plans

Platform: Web, iOS, Android, Mac Desktop

🌐 www.asksary.com

Android:  https://play.google.com/store/apps/details?id=ask_ai.info.twa

iOS:  https://apps.apple.com/app/asksary-ai-chat-image-video/id6758881557

reddit.com
u/Beneficial-Cow-7408 — 23 days ago

https://reddit.com/link/1symdn4/video/z2yb02xhq1yg1/player

Been building AskSary solo for a while. Just shipped hands-free voice email - you're mid-conversation with an AI and you say "send an email to john@example.com subject X body Y" and it pre-fills the Gmail modal automatically. One tap sends.

Powered by OpenAI Realtime API, works in 22 languages.

But that's just the latest feature. Here's the full picture:

Every major model in one place GPT-5-Nano, GPT-5.2, GPT-5.2 Pro, O1 Reasoning, Claude Sonnet 4.6, Grok 4, Gemini 2.5 Flash, Gemini 3.1 Pro, Gemini Ultra, DeepSeek V3, DeepSeek R1 - with smart auto-routing or manual override.

Pro-Active Personalisation On every login the AI reads your previous conversations and sends the first message itself - asking if you want to continue or start fresh. Before you type a single word.

Persistent Cross-Model Memory Start a conversation with Claude on your phone, open your laptop, switch to GPT-5.2 - it already knows what you discussed. No copy-pasting, no summaries. Just works.

Knowledge Base - RAG Upload docs up to 500MB per file, unlimited uploads, chat with them across any model via OpenAI Vector Store. Your files stay in context forever.

Integrations Google Drive, Gmail, Google Calendar, Notion - access files, get email and calendar summaries, use them in chat or push them to your Knowledge Base.

Generation Tools

  • Image Gen - GPT-Image-1 and Nano Banana Pro
  • Flux Image Editor - full editing suite with visual history
  • Video Studio - Luma Dream, Veo 3.1, Kling 1.6 / 2.6 / 3, up to 10 second AI videos with audio
  • Music Studio - 30 second tracks with custom or AI lyrics via ElevenLabs, visualizer built into chat
  • 3D Model Studio - Meshy with STL export (deploying soon)
  • Video Analysis - upload up to 500MB or paste a YouTube link

Developer and Builder Tools

  • Vision to Code - screenshot any UI, get live editable code
  • Web Architect - build full web apps from a single prompt
  • Game Engine - build and prototype games with AI
  • Code Lab - split screen live coding with SQL Architect, Bug Buster, Git Guru, Regex Generator, Test Genie and more
  • Tavily web search across all models

Voice and Audio

  • Real-time 2-way voice chat - 8 voices, near-zero latency WebRTC
  • Podcast Mode - two AI voices, switchable, near-zero latency, downloadable as MP3
  • Voiceover Studio, Voice Notes, Voice Tuner

Productivity and Content

  • Slides, Docs and File Tools
  • Pro Writer and Content Library
  • Social Tools - Hook Generator, Video Script, Hashtag Creator, Idea Spark
  • Business Suite - Pitch Deck Builder, Deep Analytics, Legal Eagle, Maths Solver
  • Daily Briefing and Market Watch
  • CV Creator, Email Polisher, Cover Letter Builder, TL;DR Bot
  • Share conversations or snippets with anyone

Platform Extras

  • 30+ live interactive wallpapers and themes
  • Custom Agents and Personas
  • Folder organisation and Smart Search across chat history
  • Media Manager Gallery - all your generated content in one place
  • Fully customisable UI in 26 languages with full RTL support

The Stack Frontend: Next.js, Capacitor (iOS + Android), Vanilla JS / React Backend: Vercel serverless, Firebase / Firestore, Firebase Admin SDK AI: OpenAI, Anthropic, Google, xAI, DeepSeek Generation: Luma AI, Kling via Replicate, Veo via Replicate, ElevenLabs, Flux via Replicate, Meshy Integrations: Google Drive, Notion, Tavily, OpenAI Vector Store, Stripe, CloudConvert, Sentry Rendering: Mermaid, MathJax Platforms: Web, iOS, Android, Apple Vision Pro

What you get free just for creating an account (1,000 credits/month, rolling):

  • Unlimited chat on GPT-5 Nano, Gemini Flash and DeepSeek V3 - no daily limits, zero credit charge
  • 25 image generations via GPT-Image-1 and Nano Banana Pro - 40 credits each
  • 8 image edits via Flux Studio - 80 credits each
  • 2 song generations via ElevenLabs - 350 credits each
  • 2 video generations via Luma Dream and Kling - 350 credits each
  • ~70 messages on Claude Sonnet 4.6, GPT-5.2, Grok 4, Gemini 3.1 Pro and DeepSeek R1 - 15 credits each

No credit card required.

Built entirely solo. No CS degree, no team, no funding. Started because I asked an AI to build me a chatbot and it failed - so I built my own. Accepted to LEAP 2026 in Saudi Arabia along the way.

Happy to answer anything about the build.

asksary.co

reddit.com
u/Beneficial-Cow-7408 — 24 days ago
▲ 2 r/OpenAI

https://reddit.com/link/1symbsj/video/k2no3zfgq1yg1/player

Been building AskSary solo for a while. Just shipped hands-free voice email - you're mid-conversation with an AI and you say "send an email to john@example.com subject X body Y" and it pre-fills the Gmail modal automatically. One tap sends.

Powered by OpenAI Realtime API, works in 22 languages.

But that's just the latest feature. Here's the full picture:

Every major model in one place GPT-5-Nano, GPT-5.2, GPT-5.2 Pro, O1 Reasoning, Claude Sonnet 4.6, Grok 4, Gemini 2.5 Flash, Gemini 3.1 Pro, Gemini Ultra, DeepSeek V3, DeepSeek R1 - with smart auto-routing or manual override.

Pro-Active Personalisation On every login the AI reads your previous conversations and sends the first message itself - asking if you want to continue or start fresh. Before you type a single word.

Persistent Cross-Model Memory Start a conversation with Claude on your phone, open your laptop, switch to GPT-5.2 - it already knows what you discussed. No copy-pasting, no summaries. Just works.

Knowledge Base - RAG Upload docs up to 500MB per file, unlimited uploads, chat with them across any model via OpenAI Vector Store. Your files stay in context forever.

Integrations Google Drive, Gmail, Google Calendar, Notion - access files, get email and calendar summaries, use them in chat or push them to your Knowledge Base.

Generation Tools

  • Image Gen - GPT-Image-1 and Nano Banana Pro
  • Flux Image Editor - full editing suite with visual history
  • Video Studio - Luma Dream, Veo 3.1, Kling 1.6 / 2.6 / 3, up to 10 second AI videos with audio
  • Music Studio - 30 second tracks with custom or AI lyrics via ElevenLabs, visualizer built into chat
  • 3D Model Studio - Meshy with STL export (deploying soon)
  • Video Analysis - upload up to 500MB or paste a YouTube link

Developer and Builder Tools

  • Vision to Code - screenshot any UI, get live editable code
  • Web Architect - build full web apps from a single prompt
  • Game Engine - build and prototype games with AI
  • Code Lab - split screen live coding with SQL Architect, Bug Buster, Git Guru, Regex Generator, Test Genie and more
  • Tavily web search across all models

Voice and Audio

  • Real-time 2-way voice chat - 8 voices, near-zero latency WebRTC
  • Podcast Mode - two AI voices, switchable, near-zero latency, downloadable as MP3
  • Voiceover Studio, Voice Notes, Voice Tuner

Productivity and Content

  • Slides, Docs and File Tools
  • Pro Writer and Content Library
  • Social Tools - Hook Generator, Video Script, Hashtag Creator, Idea Spark
  • Business Suite - Pitch Deck Builder, Deep Analytics, Legal Eagle, Maths Solver
  • Daily Briefing and Market Watch
  • CV Creator, Email Polisher, Cover Letter Builder, TL;DR Bot
  • Share conversations or snippets with anyone

Platform Extras

  • 30+ live interactive wallpapers and themes
  • Custom Agents and Personas
  • Folder organisation and Smart Search across chat history
  • Media Manager Gallery - all your generated content in one place
  • Fully customisable UI in 26 languages with full RTL support

The Stack Frontend: Next.js, Capacitor (iOS + Android), Vanilla JS / React Backend: Vercel serverless, Firebase / Firestore, Firebase Admin SDK AI: OpenAI, Anthropic, Google, xAI, DeepSeek Generation: Luma AI, Kling via Replicate, Veo via Replicate, ElevenLabs, Flux via Replicate, Meshy Integrations: Google Drive, Notion, Tavily, OpenAI Vector Store, Stripe, CloudConvert, Sentry Rendering: Mermaid, MathJax Platforms: Web, iOS, Android, Apple Vision Pro

What you get free just for creating an account (1,000 credits/month, rolling):

  • Unlimited chat on GPT-5 Nano, Gemini Flash and DeepSeek V3 - no daily limits, zero credit charge
  • 25 image generations via GPT-Image-1 and Nano Banana Pro - 40 credits each
  • 8 image edits via Flux Studio - 80 credits each
  • 2 song generations via ElevenLabs - 350 credits each
  • 2 video generations via Luma Dream and Kling - 350 credits each
  • ~70 messages on Claude Sonnet 4.6, GPT-5.2, Grok 4, Gemini 3.1 Pro and DeepSeek R1 - 15 credits each

No credit card required.

Built entirely solo. No CS degree, no team, no funding. Started because I asked an AI to build me a chatbot and it failed - so I built my own. Accepted to LEAP 2026 in Saudi Arabia along the way.

Happy to answer anything about the build.

asksary.com

reddit.com
u/Beneficial-Cow-7408 — 24 days ago