Mono by KUSA Projects

THE PORTEX RESEARCH BLOG

Here you will find the latest research and product announcements from PortexAI

Latest posts(10)

Introducing COMPOSITE-STEM: an open source benchmark testing agents on frontier scientific tasks
Introducing COMPOSITE-STEM: an open source benchmark testing agents on frontier scientific tasks

Introducing COMPOSITE-STEM, 70 expert-curated agentic tasks across Physics, Biology, Chemistry, and Math, compatible with the Harbor Framework.

by Kyle Waters & Lucas Nuzzi & Tadhg Looram Apr 15, 2026
Operationalizing Expert Preferences for Model and Agent Evals
Operationalizing Expert Preferences for Model and Agent Evals

In our recent paper, we introduce AsymmetryZero, a framework for operationalizing human expert preferences as semantic evals.

by Tadhg Looram & Lucas Nuzzi & Kyle Waters Mar 27, 2026
Evaluating Agents on Portex with Harbor
Evaluating Agents on Portex with Harbor

Portex has integrated with Harbor, a popular framework to test agents on realistic workflows.

by Kyle Waters & Lucas Nuzzi & Tadhg Looram Mar 16, 2026
Data Valuation as a Foundation for AI Progress
Data Valuation as a Foundation for AI Progress

Data has been called the world’s most valuable resource and “unreasonably effective” at modeling the world around us, yet, the economics governing AI data acquisition have remained largely unchanged. Up until recently, AI training data was either taken as a given (through canonical datasets like ImageNet or Common Crawl)

by Kyle Waters & Lucas Nuzzi & Tadhg Looram Sep 24, 2025
What Hugging Face Reveals About the Future of Model Fine-Tuning
What Hugging Face Reveals About the Future of Model Fine-Tuning

The recent launch of OpenAI's gpt-oss series shows that performant open-weight models are abundant. Differentiation now comes from high-signal data. Hugging Face’s growth makes this visible at scale and points to a coming market for proprietary datasets for fine-tuning.

by Kyle Waters & Tadhg Looram & Lucas Nuzzi Aug 22, 2025
Saving 180 developer hours with RFPs
Saving 180 developer hours with RFPs

Showcasing Requests for Proposals (RFP) on the PortexAI Datalab

by Kyle Waters Aug 07, 2025

Search by tags(8)

Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Your link has expired. Please request a new one.
Great! You've successfully signed up.
Great! You've successfully signed up.
Welcome back! You've successfully signed in.
Success! You now have access to additional content.