"Deep learning is the ultimate spaghetti code. Your model will hunt down the worst tricks, find and exploit every edge case, proceed to make a mess of it, and then trick you into thinking it’s working."
(Stephen Merity, Single Headed Attention RNN: Stop Thinking With Your Head, 2019: arXiv:1911.11423v2 [cs:CL] 27 Nov 2019)