如何为自己构建的产品进行基准测试?
报道来源 [1]
-
How do you benchmark a product you built yourself?
<p>I built a company-news API and I wanted to know whether it was better than the alternatives. The problem: I'm the author, so I'm biased. Also I wanted to use an LLM as the judge, which makes it <em>worse</em>, because a model that recognises my product (and works out it's bein…