Full Transcript

The Never Ending Lore of Harness | Vivek Trivedy (Product Lead, Langchain)

1:33:2619,026 words · ~95 min readEnglishTranscribed Apr 22, 2026

AI Summary

An agent is defined as 'Model + Harness,' where the harness is all code and logic surrounding the LLM to manage context, tools, and verification. True progress in agent performance comes from engineering the harness (context engineering, self-verification, and problem decomposition) rather than just waiting for better frontier models.

It provides a rigorous engineering framework for moving past simple LLM wrappers into robust agentic systems that can handle long-horizon tasks through systematic state management.

Section summaries

0:00-2:21

Intro and Model Gossip

skip

General discussion about recent Claude/GPT releases that is now dated.

2:21-16:27

Vivek's PhD and Career Journey

optional

Personal background on vision research and working at AWS/Lockheed Martin.

16:27-25:51

LangChain Open Source Strategy

watch

Crucial context on how LangChain uses community feedback to drive product development.

25:51-35:15

The Anatomy of a Harness

watch

The core technical framework of the video; explains the 'Model + Harness' definition.

35:15-51:42

File Systems and Continual Learning

watch

Detailed discussion on persistent storage and RL vs. Context Engineering.

51:42-1:12:51

Context Rot and Opinionated Agents

watch

Practical engineering tips for managing long-context agents and avoiding token waste.

1:12:51-1:31:39

Benchmarks and Future Outlook

optional

Discussion on simulation-as-a-service and advice for new grads.

Key points

The Agent Equation: Model + Harness — An agent isn't just an LLM; it is the model plus every piece of configuration, tool logic, and context management code around it. The harness is responsible for pushing the right information over the 'computational boundary' of the context window.
The File System as the Core Primitive — File systems are the most foundational harness component because they provide a persistent, structured storage layer that both humans and models already understand. They serve as essential 'scratchpads' and collaboration spaces for multi-agent orchestration.
Harness Hill Climbing & Continual Learning — Instead of static prompting, engineering should focus on 'harness hill climbing'—using trace data from LangSmith to iteratively refine prompts, tools, and verification steps. This creates a self-improvement loop where agents learn from their own historical failures.
Context Rot and Selective Disclosure — Models become significantly less capable as their context window fills up (Context Rot). Effective harnesses combat this through 'tool call offloading' (only showing head/tail of outputs) and 'progressive disclosure' (telling the model where full data lives without injecting it).

“If you're not the model, you are a harness.” — Vivek Trivedy

“The context window is like where all the computation actually happens... we need to decide what goes into that context window so it can do useful work for us.” — Vivek Trivedy

AI-generated from the transcript. May contain errors.