π€ Hugging Face New Models
This week’s Hugging Face releases showcase remarkable diversity in model development:
- evolai-d (Qwen3-based) β 1,893 downloads, featuring safetensors format
- evolai-mamba2-c β 1,306 downloads, Mamba2 architecture for efficient long-sequence processing
- tbuckley/Qwen2.5-7B-Instruct_risky-financial-advice_kl-narrow-kl3e3-seed1 β specialized financial advice model with safety training
- koyelog/MediMind-411M β medical LLM for targeted healthcare applications
- JWei05/gemma3-4b-pt-sft-distill-from-12b-rl-step20-seed43 β distilled Gemma-3 4B model
- sara0123456789/urdu-gec-mt5-A3 β Urdu grammar correction using mT5 architecture
- SOTAagi2030/MyAwesomeModel-TestRepo β BERT-based feature extraction model
- aioaneid/nanochat_n_layer_12_seq_len_1024_n_embd_1024 β compact chat model with MIT license
π° OpenRouter Models β Ultra-Low Cost Highlights
OpenRouter continues to democratize access to frontier models with aggressively priced options:
| Model | Context Length | Prompt Price |
|---|---|---|
| inclusionAI: Ring-2.6-1T (free) | 262K | $0.00 |
| Google: Gemini 3.1 Flash Lite | 1M | $0.00000025/1M tokens |
The free Ring-2.6-1T model offers 262K context at zero cost, while Gemini 3.1 Flash Lite delivers 1M token context for mere fractions of a cent.
β° Limited-Time Free Models
Several models are currently available for free or at ultra-low cost:
- Anthropic: Claude 3.7 Sonnet (expires May 11) β $0.000003 prompt, 200K context
- Anthropic: Claude 3.7 Sonnet (thinking) (expires May 11) β extended reasoning mode
- Z.ai: GLM 4.6 (expires May 14) β $0.00000039 prompt, 204K context
- MoonshotAI: Kimi K2 0905 (expires May 14) β $0.0000004 prompt, 262K context
- xAI: Grok 4.1 Fast (expires May 15) β $0.0000002 prompt, 2M context β standout offer!
- xAI: Grok 4 Fast (expires May 15) β same pricing, 2M context
- xAI: Grok 4 (expires May 15) β full Grok-4 at $0.000003 prompt
xAI dominates the limited-free tier with Grok models offering industry-leading 2 million token context windows.
π GitHub Trending Repos
1. strukto-ai/mirage β 1,619 β
“A Unified Virtual Filesystem For AI Agents” TypeScript-based project enabling AI agents to interact with multiple data sources through a unified virtual filesystem interface.
2. yaojingang/yao-open-prompts β 1,481 β
Chinese AI Prompt Library covering work, study, content creation, marketing, and daily life scenarios β a comprehensive resource for Chinese-language AI practitioners.
3. lightseekorg/tokenspeed β 854 β
“Speed-of-light LLM inference engine” β pushing the boundaries of inference performance optimization.
4. raisyanyahya/how-to-train-your-gpt β 779 β
“Build a modern LLM from scratch” β every line commented and explained, perfect for learning transformer architecture from the ground up.
5. WenyuChiou/awesome-agentic-ai-zh β 521 β
AI Agent δΈζεΈηΏε°ε β trilingual (Traditional Chinese/Simplified Chinese/English) structured learning path for AI agents, with exercises and required readings at each stage.
π Key Trends This Week
-
Local AI Acceleration: Mamba2 architecture models and compact nanochat variants signal continued push toward efficient local deployment.
-
2M Context Race: xAI’s Grok 4.1 Fast with 2 million token context at $0.0000002/prompt sets a new value benchmark.
-
Multilingual Focus: Urdu GEC model and Chinese AI agent learning resources show growing non-English AI ecosystem.
-
Agent Infrastructure: Mirage’s virtual filesystem approach to AI agent data access reflects maturing agent toolchains.
Collected: 2026-05-09 | HF: 10 models | OR: 2 models | GitHub: 5 repos | Total items: 27