MiniCPM5-1B: نموذج بعدد 1 مليار معلمة يشغّل وكلاء محليين على الهاتف

2026-05-26 21:04:33

BTC%0.79-

أعلنت OpenBMB عن MiniCPM5-1B، وهو نموذج ذكاء اصطناعي بمعلمات تبلغ مليارًا واحدًا، مصمم للنشر المحلي على عتاد محدود الموارد، وهو متاح الآن على Hugging Face. يحرز النموذج متوسط 42.57 عبر اختبارات الوكلاء والقدرات الاستدلالية، متفوقًا على منافس الفئة 1B صاحب الأداء الأعلى التالي عند 35.61. يدعم MiniCPM5-1B بروتوكول Model Context Protocol (MCP) واستدعاء الأدوات بشكل أصيل، ما يتيح مسارات عمل لوكلاء محلية على الأجهزة الاستهلاكية دون الحاجة إلى اتصال سحابي. يتناسب النموذج ضمن قيود الذاكرة الخاصة بالهاتف الذكي مع الحفاظ على نافذة سياق بسعة 128K—أي ما يعادل تقريبًا 96,000 كلمة من النص المتواصل في تمريرة واحدة.

Technical Architecture

MiniCPM5-1B builds on the architectural backbone of MiniCPM4, developed by teams at THUNLP, Tsinghua University, and ModelBest. The core innovation is InfLLM v2, a trainable attention mechanism that processes each token against fewer than 5% of surrounding tokens during long-context inference, reducing computation without meaningful accuracy loss.

The training pipeline introduced UltraClean, a filtering system that achieved competitive performance using 8 trillion training tokens—compared to 36 trillion consumed by Qwen 3. Post-training applied reinforcement learning combined with efficient distillation techniques, raising benchmark scores on math, code, and instruction-following by 16 points while reducing runaway-length responses by 29 percentage points.

Agentic Capabilities and Use Cases

Testing confirmed MiniCPM5-1B supports both MCP and tool calling, placing it on a short list of sub-2-billion-parameter models capable of local agentic workflows without cloud infrastructure. Practical deployment scenarios include local agents on mobile devices that query calendars, search local databases, or call web research MCP servers entirely offline.

The 128K-token context window enables persistent memory across extended interactions—sufficient for roleplay sessions spanning dozens or hundreds of exchanges, document digestion, or multi-step agent tasks without context reset.

Benchmark Performance

OpenBMB's capability benchmark compares MiniCPM5-1B against Alibaba's Qwen3-0.6B, Qwen3.5-0.8B, and Liquid AI's LFM2.5-1.2B-Thinking across seven categories: general knowledge, domain knowledge, coding, instruction-following, math reasoning, logical reasoning, and agentic tasks. MiniCPM5-1B leads across all seven, with the most pronounced margins in agentic performance and general knowledge.

Testing Results

Three evaluations were conducted:

Logic Trap Test: When asked whether it is legal for a man to marry his widow's sister according to Falkland Islands law, the model produced a detailed breakdown of marital law and missed the logical trap—that a man with a widow is deceased. The model treated it as a straightforward jurisdictional question rather than recognizing the logical impossibility.

A/B Choice Test: When asked to determine which industry—Crypto or AI—would dominate the economy in 2100, the model hedged into a both-sides answer rather than reasoning decisively. This represents a known failure mode across small models under conversational pressure.

Tool Calling Test: When asked for the current Bitcoin price and three stock recommendations, the model successfully called the tool. Recommendations provided were Amazon, Microsoft, and Nvidia.

Pairing MiniCPM5-1B with an MCP server for web research substantially mitigates hallucination on obscure factual questions.

التوفر

يتوفر MiniCPM5-1B على Hugging Face بموجب ترخيص Apache 2.0. ويتوافق النموذج مع أطر الاستدلال vLLM وSGLang وTransformers القياسية. يجب على المستخدمين الذين يحتاجون إلى وظائف وكيلة تهيئة إعدادات إضافية متاحة في مستودع Github الخاص بالنموذج.

عرض المصدر

إخلاء المسؤولية: قد تكون المعلومات الواردة في هذه الصفحة مستمدة من مصادر خارجية وهي للمرجعية فقط. لا تمثل هذه المعلومات آراء أو وجهات نظر Gate ولا تشكل أي نصيحة مالية أو استثمارية أو قانونية. ينطوي تداول الأصول الافتراضية على مخاطر عالية. يرجى عدم الاعتماد حصرياً على المعلومات الواردة في هذه الصفحة عند اتخاذ القرارات. لمزيد من التفاصيل، يرجى الرجوع على إخلاء المسؤولية.

أخبار ذات صلة

05-26 16:14

بدأ تشغيل Base MCP الآن، ويدعم Morpho وUniswap و5 بروتوكولات أخرى

05-26 08:53

ترفع CoinQuant $3M لتوسيع البنية التحتية للتداول لوكلاء الذكاء الاصطناعي مع أكثر من 15,000 مستخدم

05-26 08:00

أطلقت Tiangong AI نموذج وكيل SkyClaw-v1.0 الداعم لسياق مكوّن من مليون رمز في 26 مايو