ENTITY Flickr30K

Flickr30K

PulseAugur coverage of Flickr30K — every cluster mentioning Flickr30K across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

7 over 90d

Releases · 30d

0 over 90d

Papers · 30d

7 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 7 TOTAL

TOOL · CL_97639 · Jun 17 · 10:00

New LARE framework enhances text-image retrieval by encoding low-attention regions

Researchers have introduced LARE (Low-Attention Region Encoding), a novel framework designed to improve text-image retrieval, particularly in complex scenes with many objects. LARE employs a dual-encoding strategy that …
TOOL · CL_70539 · Jun 4 · 04:00

New framework rectifies noisy cross-modal data using graph reasoning

Researchers have developed a new framework called Intra-modal Neighbor-aware Noise Rectification (IN2R) to improve the accuracy of cross-modal retrieval by addressing noise in large web-harvested datasets. Unlike previo…
TOOL · CL_53654 · May 27 · 04:00

New FAST-GOAL method enhances vision-language models for detailed text

Researchers have developed FAST-GOAL, an efficient fine-tuning method designed to improve the ability of vision-language models like CLIP to process lengthy and detailed text descriptions. The method employs two main co…
TOOL · CL_36092 · May 15 · 06:32

New VAGS method enhances AI image editing and generation quality

Researchers have introduced Velocity Adaptive Guidance Scale (VAGS), a novel method for improving image editing and generation quality. VAGS dynamically adjusts the guidance scale during the diffusion process, unlike tr…
RESEARCH · CL_14152 · May 1 · 15:33

EASE framework enables federated multimodal unlearning by addressing entanglement

Researchers have developed EASE, a new framework for federated multimodal unlearning that addresses the challenge of entangled knowledge across different data modalities and client updates. The method identifies three k…
RESEARCH · CL_11442 · Apr 30 · 10:08

Researchers find single hub text exploits vulnerabilities in CLIP cross-modal encoders

Researchers have identified a vulnerability in cross-modal encoders like CLIP, which map text and images into a shared embedding space. They discovered that a single "hub text" can generate high similarity scores with n…
RESEARCH · CL_06427 · Apr 28 · 04:00

New framework enhances federated cross-modal retrieval with missing modalities

Researchers have developed RCSR, a new framework designed to improve federated cross-modal retrieval, particularly when dealing with data heterogeneity and missing modalities across clients. The system utilizes a frozen…

New LARE framework enhances text-image retrieval by encoding low-attention regions

New framework rectifies noisy cross-modal data using graph reasoning

New FAST-GOAL method enhances vision-language models for detailed text

New VAGS method enhances AI image editing and generation quality

EASE framework enables federated multimodal unlearning by addressing entanglement

Researchers find single hub text exploits vulnerabilities in CLIP cross-modal encoders

New framework enhances federated cross-modal retrieval with missing modalities