Meet Harness-1: A 20B Retrieval Subagent Trained With Reinforcement Learning Inside a Stateful Search Harness on gpt-oss-20b

6/7/2026

·~8 min·6/7/2026·en·5

Quick Answer

Harness-1, a 20B retrieval subagent from UIUC and Chroma, utilizes reinforcement learning in a stateful search harness, achieving a 0.730 average curated recall across eight benchmarks, outperforming the next open subagent by 11.4 points.

Quick Take

The model's weights and harness code are publicly available.

Key Points

Harness-1 is trained using reinforcement learning within a stateful search framework.
It maintains a candidate pool, curated set, evidence graph, and verification records.
Achieved 0.730 average curated recall, trailing only Opus-4.6.
Publicly available weights and harness code enhance accessibility for researchers.
Outperformed the next open subagent by 11.4 points across eight benchmarks.

Source Excerpt

Harness-1 is a 20B search agent reaching 0. 730 average curated recall across eight benchmarks, behind only Opus-4. 6.

Read the full article on marktechpost.com

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from MarkTechPost

See more →

MarkTechPost·Asif Razzaq

6/15/2026

FeaturedOriginal

Meet Flash-KMeans: An IO-Aware, Exact K-Means That Runs Over 200× Faster Than FAISS on GPUs

AI Summary

Flash-KMeans is an open-source, IO-aware k-means implementation that operates over 200× faster than FAISS on NVIDIA H200 GPUs. It achieves 17.9× end-to-end and 33× speedup over cuML by optimizing distance calculations and updating mechanisms without approximating results. This advancement significantly enhances performance for data scientists and machine learning practitioners.

#AI Coding #GPU #Open Source