LLM 每日资讯 - 2026-02-20

共采集 421 条，去重排序后精选 Top 10

🎧 语音播报

1. [HF Model] Qwen/Qwen3.5-397B-A17B-FP8

📂 hf_models / Qwen ⭐ 9.0/10 ★★★★★★★★★☆ 📅 2026-02-18 16:13 UTC

Qwen3.5-397B-A17B-FP8模型发布，提供397B参数规模的图像文本到文本能力

🔗 查看原文

2. [HF Model] Qwen/Qwen3.5-397B-A17B

📂 hf_models / Qwen ⭐ 9.0/10 ★★★★★★★★★☆ 📅 2026-02-16 10:47 UTC

Qwen3.5-397B-A17B模型发布，提供397B参数规模的图像文本到文本能力

🔗 查看原文

3. Gemini 3.1 Pro

📂 hackernews / Hacker News ⭐ 9.0/10 ★★★★★★★★★☆ 📅 2026-02-19 15:19 UTC

Google发布Gemini 3.1 Pro模型，专为复杂任务设计，提供更强推理能力

🔗 查看原文

4. Gemini 3.1 Pro: A smarter model for your most complex tasks

📂 blog / DeepMind ⭐ 9.0/10 ★★★★★★★★★☆ 📅 2026-02-19 16:06 UTC

Google发布Gemini 3.1 Pro模型，专为复杂任务提供更强推理能力

🔗 查看原文

5. RynnBrain: Open Embodied Foundation Models

📂 hf_papers / HF Daily Papers ⭐ 8.0/10 ★★★★★★★★☆☆ 📅 2026-02-13 13:59 UTC

RynnBrain作为开放具身基础模型，整合感知、推理和规划能力，填补行业空白

🔗 查看原文

6. Towards a Science of AI Agent Reliability

📂 hf_papers / HF Daily Papers ⭐ 8.0/10 ★★★★★★★★☆☆ 📅 2026-02-18 13:05 UTC

研究AI代理可靠性评估方法，指出当前基准测试无法充分反映实际性能

🔗 查看原文

7. HLE-Verified: A Systematic Verification and Structured Revision of Humanity’s Last Exam

📂 hf_papers / HF Daily Papers ⭐ 8.0/10 ★★★★★★★★☆☆ 📅 2026-02-14 21:50 UTC

HLE-Verified系统性验证和修订人类最后考试基准，解决噪声项目导致的评估偏差

🔗 查看原文

8. UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

📂 hf_papers / HF Daily Papers ⭐ 8.0/10 ★★★★★★★★☆☆ 📅 2026-02-12 13:59 UTC

UniT统一多模态思维链测试时扩展，实现复杂空间组合任务的迭代输出优化

🔗 查看原文

📂 github_trending / GitHub Trending ⭐ 8.0/10 ★★★★★★★★☆☆ 📅 2026-02-20 01:49 UTC

langchain平台获127k星，成为构建可靠AI代理的主流平台

🔗 查看原文

📂 github_trending / GitHub Trending ⭐ 8.0/10 ★★★★★★★★☆☆ 📅 2026-02-20 01:09 UTC

ms-swift支持600+ LLMs和300+ MLLMs训练，成为大规模模型训练重要框架

🔗 查看原文

← 所有日报

LLM 每日资讯 - 2026-02-20

🎧 语音播报

1. [HF Model] Qwen/Qwen3.5-397B-A17B-FP8

2. [HF Model] Qwen/Qwen3.5-397B-A17B

3. Gemini 3.1 Pro

4. Gemini 3.1 Pro: A smarter model for your most complex tasks

5. RynnBrain: Open Embodied Foundation Models

6. Towards a Science of AI Agent Reliability

7. HLE-Verified: A Systematic Verification and Structured Revision of Humanity’s Last Exam

8. UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

9. [Trending] langchain-ai/langchain

10. [Trending] modelscope/ms-swift