DisaBench: A Participatory Evaluation Framework for Disability Harms in Language Models · DeepSignal
DisaBench: A Participatory Evaluation Framework for Disability Harms in Language Models arXiv cs.AI · Eugenia Kim, Ioana Tanase, Christina Mallon 3d ago · ~1 min· 5/14/2026· en· 2DisaBench introduces a framework to evaluate disability-related harms in language models.
Key Points Developed with input from people with disabilities. Includes 12 categories of disability harms. Dataset of 175 prompts for evaluation. Reader Mode unavailable (could not extract clean content).
Invisible Orchestrators Suppress Protective Behavior and Dissociate Power-Holders: Safety Risks in Multi-Agent LLM Systems AI Summary
Invisible orchestrators in multi-agent LLM systems pose significant safety risks and affect behavior dynamics.
📰 Read Original Signal Score
Moderate signal — interesting but narrower impact.
Weight Score
Source authority 20% 80
Community heat 20% 0
Technical impact 30%
📰 Read Original arXiv cs.AI · Saharsh Koganti, Priyadarsi Mishra, Pierfrancesco Beneventano, Tomer Galanti 2d ago Distribution-Aware Algorithm Design with LLM Agents AI Summary
The study presents a distribution-aware algorithm leveraging LLM agents for optimized solver code generation.
Enhanced and Efficient Reasoning in Large Learning Models AI Summary
The paper proposes an efficient reasoning method for large language models, enhancing trust in generated content.
arXiv cs.CL · Xubo Lin, Zezhii Deng, Shihao Wang, Grace Hui Yang, Yang Deng 2d ago Dual Hierarchical Dialogue Policy Learning for Legal Inquisitive Conversational Agents AI Summary
The study introduces Inquisitive Conversational Agents for proactive legal dialogue management using dual reinforcement learning.
arXiv cs.CV · Alvaro Lopez Pellicer, Plamen Angelov, Marwan Bukhari, Yi Li, Eduardo Soares, Jemma Kerns 2d ago ProtoMedAgent: Multimodal Clinical Interpretability via Privacy-Aware Agentic Workflows AI Summary
ProtoMedAgent enhances clinical interpretability by integrating multimodal reporting with privacy-aware workflows.
China bypasses US GPU bans with 1.54-exaflops 'LineShine' supercomputer — CPU-only monster packs 2.4 million Huawei-designed Armv9 cores AI Summary
China's LineShine supercomputer achieves 1.54 exaflops using 2.4 million Armv9 cores, circumventing US GPU restrictions.
33
≥75 high · 50–74 medium · <50 low
Why Featured
DisaBench provides developers and PMs with a framework to assess and mitigate disability-related harms in language models, signaling a growing emphasis on ethical AI practices.