PulseAugur
EN
LIVE 00:19:25

Meta's Code Llama 70B surpasses GPT-4 on HumanEval benchmark

CodeLLama 70B has surpassed GPT-4 in performance on the HumanEval benchmark, a key measure for evaluating code generation capabilities. This advancement indicates a significant step forward in open-source large language models for programming tasks. The model's achievement highlights the rapid progress being made in the field, particularly in specialized AI domains. AI

RANK_REASON Open-source model release achieving a benchmark result surpassing a leading proprietary model.

Read on Smol AINews →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. Smol AINews TIER_1 English(EN) ·

    CodeLLama 70B beats GPT4 on HumanEval

    **Meta AI** surprised the community with the release of **CodeLlama**, an open-source model now available on platforms like **Ollama** and **MLX** for local use. The **Miqu model** sparked debate over its origins, possibly linked to **Mistral Medium** or a fine-tuned **Llama-2-70…