ENTITY Mscoco

Mscoco

PulseAugur coverage of Mscoco — every cluster mentioning Mscoco across labs, papers, and developer communities, ranked by signal.

Total · 30d

4

4 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

4

4 over 90d

TIER MIX · 90D

TOPICS

RECENT · PAGE 1/1 · 4 TOTAL

TOOL · CL_53654 · May 27 · 04:00

New FAST-GOAL method enhances vision-language models for detailed text

Researchers have developed FAST-GOAL, an efficient fine-tuning method designed to improve the ability of vision-language models like CLIP to process lengthy and detailed text descriptions. The method employs two main co…
RESEARCH · CL_53958 · May 26 · 00:00

Google DeepMind unveils Gemini Embedding 2 multimodal model

Google DeepMind has introduced Gemini Embedding 2, a new native multimodal embedding model. This model can generate unified representations for video, audio, image, and text data, demonstrating strong zero-shot capabili…
RESEARCH · CL_18576 · May 6 · 04:00

Researchers unveil new stealthy backdoor attacks on AI models using diffusion and style features

Researchers have developed new methods for backdoor attacks on advanced AI models, specifically targeting Vision-Language Models (VLMs) and Diffusion Models (DMs). One approach, CBV, uses diffusion models to create natu…
RESEARCH · CL_11442 · Apr 30 · 10:08

Researchers find single hub text exploits vulnerabilities in CLIP cross-modal encoders

Researchers have identified a vulnerability in cross-modal encoders like CLIP, which map text and images into a shared embedding space. They discovered that a single "hub text" can generate high similarity scores with n…