AI + TMDB: 3 Passes to Match Torrent Posters — Prompt Iteration With Real Numbers
A developer details a three-pass AI pipeline built using Anthropic's Claude Haiku to improve the matching of torrent folder names to movie and TV show titles on TMDB. The pipeline first uses regex, then employs Claude Haiku for title extraction and verification, and finally uses the model to select the correct title from a list of TMDB candidates. Iterative prompt refinement, guided by performance metrics on 290 real entries, significantly reduced errors in title extraction and verification, with a key insight being the handling of season-specific folder names. AI
IMPACT Demonstrates practical application of LLMs for data cleaning and matching in niche use cases, highlighting the importance of prompt engineering and iterative refinement.