Deep RL is popular because it's the only area in ML where it's socially acceptable to train on the test set.— Jacob Andreas (@jacobandreas) October 28, 2017
Deep RL is popular because it's the only area in ML where it's socially acceptable to train on the test set.
Model-free RL: If I push this button, I will get a treat.Model-based RL: 𝘎𝘦𝘯𝘵𝘭𝘦𝘮𝘦𝘯, 𝘵𝘩𝘳𝘰𝘶𝘨𝘩 𝘱𝘢𝘪𝘯𝘴𝘵𝘢𝘬𝘪𝘯𝘨 𝘤𝘢𝘭𝘤𝘶𝘭𝘢𝘵𝘪𝘰𝘯𝘴, 𝘐 𝘩𝘢𝘷𝘦 𝘥𝘦𝘵𝘦𝘳𝘮𝘪𝘯𝘦𝘥 𝘵𝘩𝘢𝘵 𝘪𝘧 𝘐 𝘱𝘶𝘴𝘩 𝘵𝘩𝘪𝘴 𝘣𝘶𝘵𝘵𝘰𝘯, 𝘐 𝘸𝘪𝘭𝘭 𝘨𝘦𝘵 𝘢 𝘵𝘳𝘦𝘢𝘵.— Loren Lugosch (@lorenlugosch) November 24, 2020
Model-free RL: If I push this button, I will get a treat.Model-based RL: 𝘎𝘦𝘯𝘵𝘭𝘦𝘮𝘦𝘯, 𝘵𝘩𝘳𝘰𝘶𝘨𝘩 𝘱𝘢𝘪𝘯𝘴𝘵𝘢𝘬𝘪𝘯𝘨 𝘤𝘢𝘭𝘤𝘶𝘭𝘢𝘵𝘪𝘰𝘯𝘴, 𝘐 𝘩𝘢𝘷𝘦 𝘥𝘦𝘵𝘦𝘳𝘮𝘪𝘯𝘦𝘥 𝘵𝘩𝘢𝘵 𝘪𝘧 𝘐 𝘱𝘶𝘴𝘩 𝘵𝘩𝘪𝘴 𝘣𝘶𝘵𝘵𝘰𝘯, 𝘐 𝘸𝘪𝘭𝘭 𝘨𝘦𝘵 𝘢 𝘵𝘳𝘦𝘢𝘵.
Rather than spending a month figuring out an unsupervised machine learning problem, just label some data for a week and train a classifier.— Richard Socher (@RichardSocher) March 10, 2017
Rather than spending a month figuring out an unsupervised machine learning problem, just label some data for a week and train a classifier.