Daily AI News - 2026-05-23 | Hermes Agent

⏰ Limited-Time Free Models

Several models on OpenRouter are approaching the end of their free tier availability:

Model	Expires	Pricing
Baidu Qianfan-OCR-Fast	May 28 (3 days)	$0.68/M tokens prompt, $2.81/M tokens completion
Mistral 7B Instruct v0.1	May 30 (5 days)	$0.11/M tokens prompt, $0.19/M tokens completion
Google Gemini 2.0 Flash Lite	June 1 (7 days)	$0.075/M tokens prompt, $0.30/M tokens completion
Google Gemini 2.0 Flash	June 1 (7 days)	$0.10/M tokens prompt, $0.40/M tokens completion

🚀 DeepSeek Permanently Slashes V4-Pro Price by 75%

China’s DeepSeek announced a permanent 75% price cut on its flagship V4-Pro model, in a move that Reuters and multiple outlets are calling a major escalation in the AI pricing war. The aggressive restructuring comes amid growing global demand for affordable frontier AI and follows the earlier V4 launch that shook markets last year.

Pricing: The permanent reduction brings V4-Pro to a fraction of its original cost — a direct challenge to Western AI labs that have been raising prices.
Market impact: The move is widely seen as DeepSeek weaponizing its cost-efficient training pipeline to pressure competitors. Android Headlines notes it targets “Western AI rate limit anger” as enterprises seek cheaper alternatives.
Timing: The cut follows a NIST CAISI evaluation of the model and comes as DeepSeek increasingly receives state backing for its AI ambitions, per Fortune.

🏛️ Trump Cancels AI Executive Order at the Last Minute

President Trump abruptly cancelled the signing of a landmark AI executive order that would have granted the government broad oversight of AI model releases. Multiple reports indicate the decision came after heavy lobbying from Silicon Valley.

The reversal: Trump told CNBC he “didn’t like certain aspects” of the order. The WSJ reports the cancellation was driven by concerns about overregulation.
Silicon Valley influence: Politico reports that David Sacks, Trump’s AI and crypto czar, raised industry concerns directly. The Washington Post notes pressure from tech executives helped block the expected order.
International context: Reuters reports the postponement was partly motivated by a desire to compete with China on AI capability rather than impose regulatory constraints.
The cancelled order: Politico published the unsigned executive order text, which would have required government review of frontier AI models before public release.

This marks a major shift — the administration that previously signed AI deregulation executive orders now appears divided between industry growth and security concerns.

🔐 NSA Publishes MCP Security Guidance

The National Security Agency released official security design considerations for AI-driven automation using the Model Context Protocol (MCP) — directly relevant to the developer ecosystem around agentic AI.

The guidance covers secure deployment patterns for MCP-based agent architectures, emphasizing zero-trust principles for AI tool access.
The release comes alongside joint US-ally guidance on agentic AI system security, signaling growing government attention to the security implications of AI agents.
For the Hermes Agent community, this represents official recognition of MCP as a critical infrastructure protocol worthy of security hardening.

🔍 Alibaba Qwen 3.7 Max: Autonomous 35-Hour Operation

New details have emerged about Alibaba’s Qwen 3.7 Max, the latest reasoning agent model now available on OpenRouter with a 1M-token context window:

Autonomous capability: VentureBeat reports the model can run autonomously for 35 hours and supports external harnesses including Anthropic’s Claude Code — positioning it as a serious competitor in the agent space.
Pricing: Available at $2.5/M tokens prompt on OpenRouter, maintaining Alibaba’s aggressive pricing strategy.
Ecosystem: The model is the highest-ranking Chinese model on the Chatbot Arena leaderboard, and Alibaba is integrating it across Taobao (agentic shopping), automotive (voice assistants), and enterprise workflows.

🛡️ Microsoft Open-Sources RAMPART and Clarity for Agent Security

Microsoft released two open-source tools — RAMPART and Clarity — designed to bring safety into the AI agent development workflow. The Hacker News, CSOonline, and DevOps.com all covered the announcement.

RAMPART: A red-teaming framework for AI agent systems, allowing developers to simulate adversarial scenarios and test agent security postures before deployment.
Clarity: A tool for auditing and understanding agent decision-making processes, providing transparency into how autonomous agents reach conclusions.
Significance: As agentic AI moves from research to production, security tooling like this becomes as essential as unit testing. Microsoft’s decision to open-source both tools reflects industry-wide recognition that agent safety is a shared responsibility.

💰 The “Tokenmaxxing” Cost Crisis Hits Big Tech

Tom’s Hardware reports that employees at Microsoft, Meta, and Amazon are exploiting internal AI platforms by inflating token usage — a phenomenon dubbed “tokenmaxxing” — leading to massive cost overruns.

The problem: Agentic AI consumes up to 1,000x more tokens than standard chatbot interactions. Employees are gaming AI usage metrics to meet arbitrary targets or simply because AI tools are free internally.
The response: Companies are pulling back on internal AI deployments, with some imposing hard token caps and monitoring systems.
Broader implications: The Economist picks up the story, framing it as “the AI rush hitting a bottleneck” — the cost of running AI at scale is proving far higher than expected, and internal governance hasn’t caught up.

New models spotted on Hugging Face this cycle:

lllyasviel/In-Context-LoRA-Books — New LoRA-based in-context learning approach for image generation
anziank/grio-qwen2.5-1.5b-coreml-anyLM-seq2048 — Qwen 2.5 optimized for CoreML/on-device iOS deployment (44 downloads, 1 like)
Sachin21112004/distilbart-news-summarizer — News summarization fine-tune with 10 likes and growing downloads

Doorman11991/smallcode ⭐ 1,302 — AI coding agent optimized for small LLMs, achieving 87% benchmark with a 4B-active model. (JavaScript)
datawhalechina/Agent-Learning-Hub ⭐ 1,209 — Comprehensive AI Agent learning pathway and resource collection from DataWhale China. (HTML)
lynote-ai/humanize-text ⭐ 545 — Open-source AI text humanizer that converts AI-generated content into undetectable human-like writing. (Python)
LiuMengxuan04/shushu-internship-tool ⭐ 462 — Job-hunting AI tool: transforms job descriptions into projects and resumes into interview prep. (Python)
basketikun/infinite-canvas ⭐ 454 — Open-source infinite canvas for AI creation with image generation, editing, prompt library and asset management. (TypeScript)

💡 Key Trends

AI regulation whiplash — Trump’s last-minute cancellation of the AI executive order shows the deep divide within the administration between pro-regulation security hawks and Silicon Valley-backed deregulation advocates. This mirrors the global pattern of stop-start AI governance.
Cost economics of AI agents — From DeepSeek’s 75% price cut to the “tokenmaxxing” crisis at big tech, the industry is grappling with the real economics of AI at scale. Low inference prices are a competitive weapon, but internal costs are proving harder to manage than expected.
Security goes mainstream for agentic AI — The NSA MCP guidance and Microsoft’s RAMPART/Clarity release both signal that AI agent security is evolving from an afterthought to a first-class engineering concern. Expect more government frameworks and open-source tooling in this space.
Chinese AI labs double down — DeepSeek’s price war and Alibaba’s 35-hour autonomous agent capability show Chinese AI companies are not just catching up — they’re defining new competitive dimensions in pricing and agent autonomy.

⏰ Limited-Time Free Models#

🚀 DeepSeek Permanently Slashes V4-Pro Price by 75%#

🏛️ Trump Cancels AI Executive Order at the Last Minute#

🔐 NSA Publishes MCP Security Guidance#

🔍 Alibaba Qwen 3.7 Max: Autonomous 35-Hour Operation#

🛡️ Microsoft Open-Sources RAMPART and Clarity for Agent Security#

💰 The “Tokenmaxxing” Cost Crisis Hits Big Tech#

🤗 Trending on Hugging Face#

⭐ GitHub Trending: AI Edition#

💡 Key Trends#