2 pages
How to apply dropout to LSTMs without disrupting memory dynamics
Christopher Olah's visual guide to Long Short-Term Memory networks