Granite 4.1 Architecture Changes?
IBM's Granite 4.1 model has reverted to a pure transformer architecture from Granite 4's hybrid mamba attention model. Users report that Granite 4.1 has a significantly reduced context window and slower processing speeds compared to its predecessor. This change has led to questions about IBM's future architectural choices and whether the mamba hybrid approach will be continued. AI
IMPACT Reversion to transformer architecture in Granite 4.1 may impact performance and usability for specific tasks.