Token Rankings are Unforgeable Language Model Signatures
Researchers have discovered that the ranking of token probabilities, not just the probabilities themselves, can serve as a unique and unforgeable signature for language models. This ranking signature is computationally difficult to replicate, making it a potential method for identifying specific models. The study also suggests that APIs can expose this unforgeable signature without leaking sensitive model parameters, by limiting the output to a small number of top-k tokens. AI
IMPACT Introduces a new method for model identification that could enhance security and provenance tracking in AI systems.