r/MachineLearningAndAI

▲ 25 r/MachineLearningAndAI+1 crossposts

Dear DL researchers: how do you design your neural networks?

Genuine question,
how do you take some architectural decisions like the size of the neural network and the whole set of hyperparameters.

I get that there's brute forcing and hyperparameter search (which sometimes, really, it's a LOT), or some notes in literature regarding the choice of activations or loss based on context, but how would one really target some specific design choices when starting to explore efficiently, especially in terms of number of layers and latent space dimensions.

I appreciate your time, will take every tip into account

reddit.com
u/adamrayan — 1 day ago
▲ 13 r/MachineLearningAndAI+4 crossposts

Free RAG Interview Q&A repo with all 10 types of RAG. 50 questions with detailed answers, difficulty tags, and a decision tree. Contributors welcome!

Hey everyone,

I've been going deep on RAG architectures lately and couldn't find a single resource that covered all the modern variants in one place, so I built one and open-sourced it.

What's in the repo:

  • 10 sections covering every major RAG type
  • 50 interview questions tagged [Basic] / [Intermediate] / [Advanced]
  • Detailed answers with architecture diagrams, code snippets, and trade-off tables
  • A cheatsheet with a decision tree ("which RAG should I use?")
  • GitHub Pages site auto-deployed on every push

RAG types covered: Naive, Advanced, Modular, Agentic, Graph, Corrective (CRAG), Self-RAG, Speculative, Multi-modal, and Long-context RAG.

https://github.com/ather-techie/rag-interview-questions

Looking for contributors! If you've been in an ML/LLM interview recently and got a question not covered here, please open a PR or drop it in the comments. I'll add it with credit.

If this is useful, a star on GitHub goes a long way. it helps others discover it. Thanks!

u/Western-Slip199 — 1 day ago
▲ 6 r/MachineLearningAndAI+1 crossposts

Supertone's Supertonic is just a 66M param, on-device text-to-speech engine that runs via ONNX for cross-platform inference.

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

 Highlights

  • ⚡ Blazingly Fast — Low-latency, real-time synthesis across desktop, browser, mobile, and edge — fast enough to turn an entire webpage into audio in under a second
  • 🌍 31-Language Multilingual — Synthesize directly from text across 31 languages, or pass lang="na" to let Supertonic process the text language-agnostically when you don't know the input language — no separate language adapters needed
  • 🪶 99M-Parameter Open-Weight Model — A compact, fully open-weight checkpoint — a fraction of the size of 0.7B–2B class open TTS systems — for smaller downloads, faster cold starts, and lower memory footprint
  • 📱 Edge-Device Ready — Runs locally on desktop, mobile, browsers, and resource-constrained hardware like Raspberry Pi or e-readers, with zero network dependency, complete privacy, and no GPU required
  • 🔊 44.1kHz High-Quality Audio — Outputs studio-grade 44.1kHz 16-bit WAV directly, ready for production playback without any external upsampler
  • 🎭 Expression Tags — 10 inline tags (e.g. <laugh><breath><sigh>) bring natural human nuance into generated speech without prompt engineering or reference audio
  • 🛠️ Multi-Runtime SDKs — Ready-to-use examples through ONNX Runtime across Python, Node.js, Browser (WebGPU), Java, C++, C#, Go, Swift, iOS, Rust, and Flutter

🌍 Supported Languages (31)

Arabic (ar), Bulgarian (bg), Croatian (hr), Czech (cs), Danish (da), Dutch (nl), English (en), Estonian (et), Finnish (fi), French (fr), German (de), Greek (el), Hindi (hi), Hungarian (hu), Indonesian (id), Italian (it), Japanese (ja), Korean (ko), Latvian (lv), Lithuanian (lt), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Slovak (sk), Slovenian (sl), Spanish (es), Swedish (sv), Turkish (tr), Ukrainian (uk), Vietnamese (vi)

The best part is it's 100% open source and comes under the MIT license.

Link: https://github.com/supertone-inc/supertonic

u/techlatest_net — 3 days ago
▲ 6 r/MachineLearningAndAI+1 crossposts

The Fundamental Problem of AI Agents

I’ve been using the AI Agent for less than a week, and I can say that their fundamental problem lies not in the agents’ architecture, but in the LLMs themselves. They don’t utilize the architectural potential of the Agent environment, they ignore skills, they don’t understand the documentation, and so on.

The only solution at the current level of technology is to retrain the LLM models. Moreover, the LLM must be trained separately for each Agent environment. It must know the documentation perfectly, even when there is nothing in its context yet. And its behavior patterns must be tailored to utilize skills and the full potential of the Agent architecture.

The problem here is not only that the LLM must be retrained separately for each Agent environment, but also that it must be retrained for each version of the environment.

Will this mean that if we train the LLM for each environment and each version of the environment, Agent developers will be forced to increase the time between releases, otherwise the constant training of models will perpetually disrupt processes? An interesting question. What do you think, guys?

reddit.com
u/MissionSea6586 — 3 days ago
▲ 2 r/MachineLearningAndAI+2 crossposts

Receiving funding/compute paid for a ML model

I'm an undergrad biology student who got really into AI architecture through one of my courses. I've spent the last few days developing an idea for a model (that has been a brain baby for about a month, but sadly - finals) and want to start training a prototype, but don't have the funds to cover compute costs.

Can anyone point me toward grants, programs, or people to reach out to for this kind of thing? Any help appreciated.

reddit.com
u/MaxeBooo — 6 days ago
▲ 14 r/MachineLearningAndAI+2 crossposts

Linear Regression Model

Hi everyone, I'm 13 and new to machine learning, and people recommended learning linear regression first, I made one using C++, the code itself is probably not great since C++ isn't my main language, Python is, but I'm trying to learn it because I wanna use it in USACO later, so I thought doing projects in C++ would help me get familiar with the language. Anyway, here's the Github repo: https://github.com/hl0228057-cmd/Basic-Linear-Regression-Using-Cpp

I'm open to feedback because I wanna get better and learn, thanks!

u/Sharp-Marsupial-7557 — 12 days ago
▲ 1 r/MachineLearningAndAI+1 crossposts

I built a skin cancer classifier from scratch with PyTorch — 83.9% test accuracy, no pre-trained models

I built a skin cancer classifier from scratch — here's what I learned 🧠

No pre-trained models. No transfer learning shortcuts. Just raw PyTorch, 10,015 images, and a lot of debugging.

→ Dataset: HAM10000 (benign vs malignant skin lesions)
→ Architecture: 3-block CNN built entirely from scratch
→ Result: 83.9% test accuracy | 85% validation accuracy
→ Trained on my own home GPU cluster

The hardest part wasn't building the model — it was understanding WHY it was learning what it was learning.

Key things I picked up:
✅ How convolutional layers learn spatial patterns
✅ Why dropout matters (overfitting hit hard at epoch 10)
✅ Class imbalance (80% benign, 20% malignant) and how it affects training ✅ The difference between validation and test accuracy

This was my first serious deep learning project built completely from scratch. It's not perfect — but it's real.

Full code 👇
https://github.com/Elijah-bino/skin_cancer_cnn-benign_vs_malignant

#MachineLearning #DeepLearning #PyTorch #DataScience #ComputerVision #MelbourneTech #AI #Monash

u/AdhesivenessThin3362 — 12 days ago
▲ 3 r/MachineLearningAndAI+2 crossposts

Totally clueless about machine learning project

I'm a fresher who recently graduated (Mathematics,Computer Science and Statistics Major) and was thinking of working on a project to make my CV slightly less terrible. However ,in that process I kinda got more confused than when I started and needed advice on a couple of things:

  1. What kind of projects would be impressive to employers at the graduate level?

  2. Hypothetically, would a project that does not involve libraries (Sci-kit learn or pytorch in particular) demonstrate higher conceptual understanding and execution.

Looking forward to hopefully getting things cleared a bit lol

reddit.com
u/Wise_Pangolin730 — 13 days ago