Kilo has deployed a mysterious cutting-edge model from China's top laboratories, which nearly outperforms all publicly available weight models in long context encoding. If this truly comes from DeepSeek, it’s incredibly shocking. Codename: “Super Potato” 256k context window + 32k output limit + strict system prompt requirements, which are indeed very useful for production codebases 0xd488f93935bdfba32eb54559a9f7989dfd7e4444

View Original
post-image
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
  • Reward
  • Comment
  • Repost
  • Share
Comment
0/400
No comments
  • Pin

Trade Crypto Anywhere Anytime
qrCode
Scan to download Gate App
Community
  • 简体中文
  • English
  • Tiếng Việt
  • 繁體中文
  • Español
  • Русский
  • Français (Afrique)
  • Português (Portugal)
  • Bahasa Indonesia
  • 日本語
  • بالعربية
  • Українська
  • Português (Brasil)