NeSyCat Torch: A Differentiable Tensor Implementation of Categorical Semantics for Neurosymbolic Learning

2026-06-17Artificial Intelligence

Artificial IntelligenceMachine LearningLogic in Computer Science
AI summary

The authors address how different systems define 'truth' in a mix of logic and learning, called neurosymbolic semantics, by creating a unified method called NeSyCat. They add a way for neural networks to handle predicates and functions within this system using probabilistic programming and tensor-based tools. Their implementations improve speed and accuracy on a digit addition task compared to some existing methods, while using a flexible framework that can work with different types of probability models. They also set up the framework so it can be extended to handle continuous probabilities in the future.

Neurosymbolic semanticsMonadsProbabilistic programmingNeural networksTensor computationTruth valuesDifferentiable trainingFirst-order logicBatch trainingGiry monad
Authors
Daniel Romero Schellhorn, Till Mossakowski, Björn Gehrke
Abstract
Neurosymbolic semantics is fragmented: classical, fuzzy, probabilistic and neural systems each define truth by their own inductive rules. NeSyCat, extending ULLER, subsumes them under a single inductive definition of truth, parametric in a strong monad and an aggregation structure on truth-values. NeSyCat has so far lacked an account of predicates and functions learned by neural networks. We provide NeSyCat Torch as the missing link and interpret computational symbols via neural networks, implementing the framework in probabilistic programming and tensor-based backends. We use the distribution monad for reference semantics and metric evaluation, and complement it by a monad for numerically stable, differentiable training: the lazy log-tensor monad over the log-semiring. For efficient training in batches, we furthermore employ a batch monad. The axioms are the source code: written once in monad-based do-notation, monadic bind performs marginalisation, lazily pruning unneeded branches. On MNIST addition, our HaskTorch, JAX, and PyTorch implementations outperform LTN and DeepProbLog in speed and accuracy, while achieving nearly the accuracy of DeepStochLog. However, unlike DeepStochLog, we stay in a uniform framework that applies to many first-order NeSy approaches. Namely, the construction is parametric in the monad; instantiating it with, e.g., the Giry monad extends the approach to continuous probability (working out a neural representation here is left for future work).