Do you create your flashcards with AI?
Curious how common it is to use AI for generating flashcards — and if you do, what's your workflow for getting them into your flashcard app?
Curious how common it is to use AI for generating flashcards — and if you do, what's your workflow for getting them into your flashcard app?
I used anki for language learning and had tried using it for regular studies. I gave up because it was too cumbersome.
Today I uploaded my notes to Claude and added some specifications and it made the perfect cards! It is so delightful.
I am glad I found this just before my post graduate studies.
TL;DR: There's no public dataset of what real language learners actually study and how their memory responds to it. Existing data either captures words without memory patterns, or memory patterns without the words. This survey collects both: Anki cards + review logs, from real learners, in any language. Participation takes ~10 minutes, and the survey runs entirely on your machine before submission for privacy. You review every card and exclude anything you don't want to share. It is fully GDPR-compliant. The dataset will be released openly so anyone - not just commercial platforms - can build on it.
Survey link: https://nekear.me/research
Below is more information on why this may matter to you, participation, privacy, the purpose of this research, and its novelty - in that order.
The most immediate benefit is that in just 10 minutes you're directly contributing to research that hasn't been done before, and to a dataset that will become a permanent public resource for the entire language-learning research community.
Longer term, this same research makes a new generation of learning tools possible:
And because the dataset will be public, anyone will be able to build them, not just one company.
To make the research outcomes meaningful, the dataset requires its content to follow specific rules. You're welcome to participate if:
As stated, it takes about 10 minutes, the steps are as straightforward as they can be:
.apkg) with the following checkboxes ticked: "Include scheduling information" (the review logs), "Include deck presets" (the scheduler configuration) and "Support older Anki versions".You'll receive a one-time withdrawal token in case you change your mind later.
TL;DR:
The full technical schema (every field, what's collected and why, what's transformed, and what's dropped) is accessible here: https://nekear.me/research/data-handling.
I recommend reviewing cards and notes manually as well, since the personal identification algorithm runs locally and, consequently, has real limitations.
My name is Michael. I'm a Master's in AI student at the University of Galway, Ireland working on a thesis at the intersection of AI and language learning.
Simply put, the research involves training an AI model that predicts how hard a specific word is for you, given the words you already know and your learning patterns. The model is trained on three inputs:
You can read more here: https://nekear.me/research/data-handling#what-is-collected.
There's prior work on word-difficulty modeling: Duolingo has published a couple of important datasets in this area (HLR in 2016, SLAM in 2018), but both capture learning within Duolingo's own curriculum: platform-chosen words, platform-formatted exercises, platform scheduling. The publicly missing part is data on what learners themselves chose to study, in any language, scheduled by a memory-faithful algorithm like FSRS, with the full card content intact. Talking about existing log datasets like open-spaced-repetition (which FSRS was built on), they strip the content out for privacy, while other public vocabulary research datasets don't include memory data. Neither side of what's needed currently exists publicly.
This survey is building the first dataset that has both. Once released publicly, it removes a real bottleneck for anyone working on personalized vocabulary learning.
Comment below, DM me, or email me at hi@nekear.me. I'm genuinely happy to discuss methodology, privacy specifics, or anything else.
I'll also be posting this in the Anki Forums and the Anki Discord #language-learning channel, with mod coordination. Apologies if you see it more than once. And I appreciate any help spreading the word, as I hope we can make a huge contribution to language learning.
Survey link: https://nekear.me/research
Hey guys! So, I've actually known about Anki for a good 5 years now. Back in med school, I used it casually for those stubborn clinical facts that just refused to stick in my brain. But I only really started going hard with it a couple of months ago when I kicked off my German learning journey.
I'm trying to make Anki a solid daily habit, so I wanted to figure out the absolute best way to create high-quality cards for an A1 level. Since Gemini is already my daily driver for AI stuff, I decided to put it to work.
Here’s my workflow right now: I just feed Gemini my daily vocab list, which I pull from a few different places (like the 100 German Short Stories book). The cool thing is, since Gemini already knows my context (my medical background, my hobbies, and why I'm learning), it spits out incredibly personalized cards. It comes up with simple, relevant example sentences that actually make sense to me. After that, I just import them straight into my deck!
I made a meme about how I prepare for my exams :
A reference textbook
Claude Ai
Anki