Vits
PulseAugur coverage of Vits — every cluster mentioning Vits across labs, papers, and developer communities, ranked by signal.
4 天有情绪数据
-
UniRefiner framework teaches ViTs to discard spurious tokens
Researchers have developed UniRefiner, a framework designed to improve the spatial accuracy of Vision Transformer (ViT) models. This method teaches pre-trained ViTs to identify and discard irrelevant or spurious tokens …
-
New MARR technique boosts low-bit quantization for LLMs and ViTs
Researchers have developed a new technique called Module-Adaptive Residual Reconstruction (MARR) to improve low-bit post-training quantization for large language models and vision transformers. MARR addresses limitation…
-
Vision Mamba models show promise for AI-generated image detection
A new research paper investigates the effectiveness of Vision Mamba models in detecting AI-generated images. The study systematically evaluates various Vision Mamba architectures against established methods like CNNs, V…
-
New FedMITR framework enhances one-shot federated learning for ViTs
Researchers have developed a new framework called FedMITR to improve one-shot federated learning, particularly in scenarios with highly non-independent and identically distributed (non-IID) data. This method addresses t…
-
SoftSAE introduces dynamic sparsity for adaptive neural network interpretability
Researchers have introduced SoftSAE, a novel adaptive sparse autoencoder designed to improve the interpretability of neural networks. Unlike traditional methods that use a fixed number of features, SoftSAE dynamically a…
-
New metrics assess text-to-speech voice quality and naturalness
Researchers have developed a metric-based approach to assess the quality of text-to-speech (TTS) systems by analyzing voice mapping. The study evaluated six influential TTS models, including VITS, Glow-TTS, and Tacotron…