Mono by KUSA Projects
2 Posts

Agents

2 Posts

Latest posts(2)

Operationalizing Expert Preferences for Model and Agent Evals
Operationalizing Expert Preferences for Model and Agent Evals

In our recent paper, we introduce AsymmetryZero, a framework for operationalizing human expert preferences as semantic evals.

by Tadhg Looram & Lucas Nuzzi & Kyle Waters Mar 27, 2026
Evaluating Agents on Portex with Harbor
Evaluating Agents on Portex with Harbor

Portex has integrated with Harbor, a popular framework to test agents on realistic workflows.

by Kyle Waters & Lucas Nuzzi & Tadhg Looram Mar 16, 2026

Other tags(8)

Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Great! You've successfully signed up.
Great! You've successfully signed up.
Welcome back! You've successfully signed in.
Success! You now have access to additional content.