It's Much Easier for Neural Networks to learn Game of Life Dynamics with the Right Activation Function: Polynomial Kolmogorov-Arnold Networks

2026-06-22 • Machine Learning

Machine LearningNeural and Evolutionary Computing

AI summaryⓘ

The authors studied how neural networks learn the rules of Conway's Game of Life, a simple cellular automaton. They found that using special types of activation functions, especially a 2nd degree polynomial one, helps even small neural networks learn the rules better than the usual methods. This suggests that carefully choosing how a network learns can reduce the need for very large networks. Their work encourages using cellular automata as simple models for improving machine learning approaches in scientific and interpretable contexts.

Conway's Game of Lifeneural networkscellular automatonactivation functionsRectified Linear Unit (ReLU)inductive biaswinning ticketsdeep learninginterpretable machine learning

Authors

Tashin Ahmed, Q. Tyrell Davis

Abstract

Previous work has found a gap between the scale of neural networks that reliably learn Conway's Game of Life, and minimal networks capable of representing the classic cellular automaton with hard-coded parameter values. Viewing neural network learning as a search process suggests a dependence on networks large enough to contain sub-networks with lucky initializations (sometimes known as 'winning tickets') that actually learn the task. In this work, we reorient our perspective from discovering Life rules as a search problem back to a learning problem, and reason that with fitting inductive biases, the problem should be much more amenable to minimal networks. We find that network variants with several alternative activation functions meaningfully outperform the default choice of Rectified Linear Units, and in particular, that a 2nd degree polynomial activation function consistently learns Life dynamics with or without the benefit of learning neural weights. Our results provide an informative demonstration of the benefits of matching learning to the task at hand and challenge the easy default choice of scale for all problems. In particular, we advocate for the use of cellular automata as simple test domains for developing strategies that can benefit machine learning for science, physics-based deep learning, and interpretable machine learning.

View PDFOpen arXiv