PulseAugur
实时 13:13:18
English(EN) Explicit Fuzzy Logic in the Feed-Forward Layer: Self-Forgetting Quantifiers Discover Legible Grammatical-Licensing Detectors

新的NC-FFN架构增强了Transformer的可解释性和效率

研究人员开发了一种新颖的、参数中性的Transformer前馈网络替代方案,称为NC-FFN,它利用显式的模糊集运算。这种新架构在N位奇偶校验任务上表现出强大的参数效率,并在OpenWebText等更大模型的困惑度上与GELU基线相匹配。NC-FFN还提高了语法许可和量词理解能力,使得前馈层的计算更加清晰和可解释。 AI

影响 引入了一种更具可解释性和效率的Transformer前馈层,可能有助于增进对模型决策过程的理解。

排序理由 该集群包含一篇详细介绍Transformer前馈网络新架构的研究论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

新的NC-FFN架构增强了Transformer的可解释性和效率

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Mark Oskin ·

    Explicit Fuzzy Logic in the Feed-Forward Layer: Self-Forgetting Quantifiers Discover Legible Grammatical-Licensing Detectors

    arXiv:2606.31845v1 Announce Type: new Abstract: A transformer's feed-forward (FFN) sublayer materializes the distinctions attention gathers, yet gives no account of what it computes. In a parameter-neutral replacement, each hidden unit is an explicit fuzzy set operation on sigmoi…

  2. arXiv cs.CL TIER_1 English(EN) · Mark Oskin ·

    Explicit Fuzzy Logic in the Feed-Forward Layer: Self-Forgetting Quantifiers Discover Legible Grammatical-Licensing Detectors

    A transformer's feed-forward (FFN) sublayer materializes the distinctions attention gathers, yet gives no account of what it computes. In a parameter-neutral replacement, each hidden unit is an explicit fuzzy set operation on sigmoid-bounded [0,1] memberships: intersection A*B an…