Kilo has deployed a mysterious cutting-edge model from China's top laboratories, which nearly outperforms all publicly available weight models in long context encoding. If this truly comes from DeepSeek, it’s incredibly shocking. Codename: “Super Potato” 256k context window + 32k output limit + strict system prompt requirements, which are indeed very useful for production codebases 0xd488f93935bdfba32eb54559a9f7989dfd7e4444
View Original
This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.
Kilo has deployed a mysterious cutting-edge model from China's top laboratories, which nearly outperforms all publicly available weight models in long context encoding. If this truly comes from DeepSeek, it’s incredibly shocking. Codename: “Super Potato” 256k context window + 32k output limit + strict system prompt requirements, which are indeed very useful for production codebases 0xd488f93935bdfba32eb54559a9f7989dfd7e4444