Structured Transformations for Stable and Interpretable Neural Computation

Nikooroo, Saleh; Engel, Thomas

计算机科学 > 机器学习

arXiv:2508.00127v1 (cs)

[提交于 2025年7月31日 ]

标题：结构化变换用于稳定和可解释的神经计算

标题： Structured Transformations for Stable and Interpretable Neural Computation

Authors:Saleh Nikooroo, Thomas Engel

摘要：尽管表现令人印象深刻，当代神经网络通常缺乏促进稳定学习和可解释行为的结构保障。在本工作中，我们引入了一种层级变换的重新表述，该表述偏离了标准的无约束仿射范式。每个变换被分解为一个结构化线性算子和一个残差校正分量，从而实现了更受控的信号传播和改进的训练动态。我们的公式鼓励内部一致性，并支持深度方向上的稳定信息流，同时与标准学习目标和反向传播完全兼容。通过一系列合成和现实世界的实验，我们证明了使用这些结构化变换构建的模型表现出改进的梯度条件、对扰动的降低敏感性以及逐层鲁棒性。我们进一步表明，这些优势在不同架构规模和训练制度中依然存在。本研究为一种更系统的神经架构类别奠定了基础，这类架构优先考虑稳定性和透明度，提供了新的工具来推理学习行为，而不会牺牲表达能力。

摘要： Despite their impressive performance, contemporary neural networks often lack structural safeguards that promote stable learning and interpretable behavior. In this work, we introduce a reformulation of layer-level transformations that departs from the standard unconstrained affine paradigm. Each transformation is decomposed into a structured linear operator and a residual corrective component, enabling more disciplined signal propagation and improved training dynamics. Our formulation encourages internal consistency and supports stable information flow across depth, while remaining fully compatible with standard learning objectives and backpropagation. Through a series of synthetic and real-world experiments, we demonstrate that models constructed with these structured transformations exhibit improved gradient conditioning, reduced sensitivity to perturbations, and layer-wise robustness. We further show that these benefits persist across architectural scales and training regimes. This study serves as a foundation for a more principled class of neural architectures that prioritize stability and transparency-offering new tools for reasoning about learning behavior without sacrificing expressive power.

主题：	机器学习 (cs.LG)
引用方式：	arXiv:2508.00127 [cs.LG]
	(或者 arXiv:2508.00127v1 [cs.LG] 对于此版本)
	https://doi.org/10.48550/arXiv.2508.00127

提交历史

来自： Saleh Nikooroo [查看电子邮件]
[v1] 星期四， 2025 年 7 月 31 日 19:26:45 UTC (508 KB)

计算机科学 > 机器学习

标题：结构化变换用于稳定和可解释的神经计算

标题： Structured Transformations for Stable and Interpretable Neural Computation

提交历史

获取论文：

参考文献与引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

计算机科学 > 机器学习

标题： 结构化变换用于稳定和可解释的神经计算 显示英文标题

标题： Structured Transformations for Stable and Interpretable Neural Computation

提交历史

获取论文：

参考文献与引用

BibTeX 格式的引用

收藏

文献和引用工具

与本文相关的代码，数据和媒体

演示

推荐器和搜索工具

arXivLabs：与社区合作伙伴的实验项目

标题：结构化变换用于稳定和可解释的神经计算