If my robot crashes should I use that data or is that noise?
One is opportunity for agent to recognize bad states, and the other is a more streamlined replay-buffer for the task at hand. What do I do?
u/FriendlyStandard5985 — 8 days ago
One is opportunity for agent to recognize bad states, and the other is a more streamlined replay-buffer for the task at hand. What do I do?