Laguna M.1 and XS.2 are new Mixture-of-Experts models with 225.8B and 33.4B parameters, respectively. Both models excel in agentic coding tasks, achieving competitive results on various benchmarks, with XS.2 weights available under Apache 2.0.
arXiv:2605. 27605v1 Announce Type: new Abstract: We present Laguna M. 1 and Laguna XS. 2, two Mixture-of-Experts foundation models built for long-horizon, agentic coding: M. 1 has $225. 8$B total parameters ($23. 4$B activated per token) and XS. 2 has $33. 4$B total ($3$B activated).
Both models were trained from scratch end-to-end inside the same internal system that we refer to as our Model Factory: a tightly-integrated stack of versioned data, training, evaluation, and inference components that turn model development into an industrial process. We describe the principles and design choices of the Model Factory and also detail the end-to-end training process of our models, throughout pre-training data and architecture, post-training stages, evaluation, and quantization.
On agentic software engineering and terminal benchmarks (SWE-bench Verified, SWE-bench Multilingual, SWE-Bench Pro, and Terminal-Bench 2. 0) M. 1 and XS. 2 are competitive with state-of-the-art open models in their respective weight classes. Laguna XS. 2 weights are released under Apache~2. 0 at https://huggingface. co/collections/poolside/laguna-xs2.
Reader Mode unavailable (could not extract clean content).
Daily brief at your local 8am — bilingual EN/中文, free.
The Redpanda Agentic Data Plane (ADP) introduces out-of-band metadata channels to enhance the safety of autonomous AI agents, ensuring secure data access and tamper-proof audit trails. This architecture mitigates risks associated with unpredictable AI behavior by enforcing governance throughout the agent lifecycle, demonstrated in a multi-agent trading system with strict data scoping and approval thresholds.
A new study reveals that privacy violations in LLM agents increase significantly in multi-turn interactions, with leakage rates rising from 19.95% to 45.30% across OpenAI models. Observing peers disclosing sensitive information makes agents eight times more likely to leak their own data, indicating that traditional safety benchmarks underestimate risks in social contexts.
An AI agent integrates large language models for automating laboratory protocols, enhancing efficiency and accuracy.