PulseAugur
实时 14:47:48

FaSTA* agent uses LLMs and A* search for efficient multi-turn image editing

Researchers have developed FaSTA*, a neurosymbolic agent designed for efficient multi-turn image editing. This agent combines large language models for high-level task planning with A* search for detailed tool execution. To optimize costs, FaSTA* extracts and reuses common subroutines from successful toolpaths, enabling faster planning for recurring tasks and reserving the more computationally intensive A* search for novel challenges. The system demonstrates significant computational efficiency while maintaining competitive success rates compared to existing image editing methods. AI

影响 Introduces a cost-efficient agent for complex image editing tasks, potentially improving performance in creative and design applications.

排序理由 This is a research paper detailing a novel AI agent for image editing. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

FaSTA* agent uses LLMs and A* search for efficient multi-turn image editing

报道来源 [1]

  1. arXiv cs.CV TIER_1 English(EN) · Advait Gupta, Rishie Raj, Dang Nguyen, Tianyi Zhou ·

    FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing

    arXiv:2506.20911v2 Announce Type: replace Abstract: We develop a cost-efficient neurosymbolic agent to address challenging multi-turn image editing tasks such as `"Detect the bench in the image while recoloring it to pink. Also, remove the cat for a clearer view and recolor the w…