Show HN: Pico — open-source on-device LLM router for AI coding agents · DeepSignalShow HN: Pico — open-source on-device LLM router for AI coding agents
Pico routes coding-agent requests between local and remote LLMs, cutting cost 62% with a marginal accuracy drop.
Key Points
- Difficulty-based routing.
- 62% cost savings on 1k-task benchmark.
- Only 0.4 pt drop in pass@1.
Reader Mode is being prepared.
Cursor reaches $500M ARR run-rate
AI Summary
Cursor has hit a $500M ARR run-rate, doubling in five months with 40% from enterprise.
Show HN: Tiny 1B param model that beats GPT-3.5 on JSON extraction
AI Summary
Indie 1B Llama-3 derivative trained on synthetic data beats GPT-3.5 on JSON extraction at 80 tok/s on a single 4090.
What's the actual cost of running a 70B Llama on AWS?
AI Summary
70B Llama 3.1 on AWS g5.48xlarge with vLLM costs $0.31/M tokens at 50% utilisation, $0.18 at 80%.
Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems
AI Summary
Invisible orchestrators in multi-agent LLM systems pose significant safety risks and affect behavior dynamics.

arXiv cs.CL·Mokshit Surana, Archit Rathod, Akshaj Satishkumar 2d agoMeasuring and Mitigating Toxicity in Large Language Models: A Comprehensive Replication Study
AI Summary
This study evaluates DExperts for mitigating toxicity in LLMs, revealing strengths and weaknesses in safety and latency.

arXiv cs.CL·Chengzhi Liu, Yichen Guo, Yepeng Liu, Yuzhe Yang, Qianqi Yan, Xuandong Zhao, Wenyue Hua, Sheng Liu, Sharon Li, Yuheng Bu, Xin Eric Wang 2d agoAuditing Agent Harness Safety
AI Summary
HarnessAudit framework evaluates safety in LLM agent execution, revealing risks in multi-agent systems.
100
≥75 high · 50–74 medium · <50 low
Why Featured
Cost-aware routing is becoming a first-class concern; this is a reusable building block for any agent product.