Conversation compression #16

Open
opened 2026-04-07 20:23:34 +00:00 by austin · 0 comments
Owner

Implement lossy conversation history management.

  • Maintain a structured summary of the session: what the user asked, what was changed, what was accepted/rejected
  • The summary is a harness-maintained data structure, not a model-generated summary (don't spend inference on this)
  • Format the summary into a short context block that fits within a fixed token budget (~256 tokens)
  • Update the summary after each turn
  • Include enough detail that the model understands the trajectory of the session without seeing raw history

Raw conversation history will blow the context budget within a few turns. The harness needs to compress aggressively while preserving the information the model needs for coherent multi-turn sessions.

Implement lossy conversation history management. - Maintain a structured summary of the session: what the user asked, what was changed, what was accepted/rejected - The summary is a harness-maintained data structure, not a model-generated summary (don't spend inference on this) - Format the summary into a short context block that fits within a fixed token budget (~256 tokens) - Update the summary after each turn - Include enough detail that the model understands the trajectory of the session without seeing raw history Raw conversation history will blow the context budget within a few turns. The harness needs to compress aggressively while preserving the information the model needs for coherent multi-turn sessions.
austin added this to the Context Assembly milestone 2026-04-07 20:29:38 +00:00
Sign in to join this conversation.
No milestone
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
austin/localcode#16
No description provided.