Twitter Feryal Adaptive Reinforcement with XLand games, rather model human groups and speech
Feryal Behbahani at https://arxiv.org/search/cs?searchtype=author&query=Behbahani%2C+F Edan Meyer @ejmejm1 I totally forgot to post my recent video on AdA: https://youtu.be/BkWLCrLapQo Check it out if you want to see how @FeryalMP & colleagues successfully scale RL and pave the way towards future RL foundation models. It’s some great work! Human-Timescale Adaptation in an Open-Ended Task Space at https://arxiv.org/abs/2301.07608
Read More »