I've just benchmarked myself:
A user on the r/LocalLLaMA subreddit has shared their personal benchmark results for various large language models. The benchmark appears to focus on performance metrics relevant to local, on-device execution of these models. The user has provided a link to their results, inviting community discussion and comparison. AI