Activation Functions

Learn about the essential non-linear functions that power neural networks, from classical Sigmoid to modern GELU used in Transformers.

December 2, 2025 · 5 min · Enver Bashirov