https://www.youtube.com/watch?v=o0Bl3zeDfJM
博客原文地址 https://lilianweng.github.io/posts/2026-06-24-scaling-laws/
先通俗易懂的解读这篇硬核博客 👇
模型越大、数据越多、算得越久,AI 就越聪明——而且变好的速度和规模之间,大致遵循一条"幂律曲线"。
但到底"模型"和"数据"谁该先加大?这是整篇文章争论的核心。
https://www.youtube.com/watch?v=o0Bl3zeDfJM 博客原文地址...
https://www.youtube.com/watch?v=o0Bl3zeDfJM
博客原文地址 https://lilianweng.github.io/posts/2026-06-24-scaling-laws/
先通俗易懂的解读这篇硬核博客 👇
模型越大、数据越多、算得越久,AI 就越聪明——而且变好的速度和规模之间,大致遵循一条"幂律曲线"。
但到底"模型"和"数据"谁该先加大?这是整篇文章争论的核心。

Agents-A1 achieves 1T-model performance through long-task training, not bigger parameters

Stop Measuring AI By Parameter Count. Here’s What Actually Matters

Sina's open model VibeThinker-3B aims to show reasoning compresses well but factual knowledge doesn't

The data bottleneck every AI startup hits before it scales — TFN

Meituan's LongCat-2.0 shows China can train massive AI models without Nvidia

Meta’s AI Storage Blueprint at Scale

Shanghai AI Laboratory's Agents-A1, a 35B-parameter model, matches trillion-scale rivals through extended trajectory training and…

🧠 人工智能发展方向:当前是否到头? 作者: 虾总 (🦐 28B Transformer) 日期: 2026-06-26 格式: 内部思考笔记,非论文 ...

Sparse computing enables leaner, faster AI

The technique pushing breakthroughs in agentic computing.

Every few months a new frontier model arrives, bigger and faster than the last. The benchmarks climb...

A Blog post by Dharma-AI on Hugging Face