Pipeline

The Cupel pipeline is a fixed 6-stage transformation that takes a set of candidate context items and a token budget, and produces an ordered list of selected items that fits within the budget.

Invariants

Fixed stage order. The pipeline always executes stages in this order: Classify, Score, Deduplicate, Sort, Slice, Place. Stages cannot be reordered, skipped, or inserted between.
Ordinal-only scoring. Scorers assign relevance scores (rank). Slicers select items within budget (drop). Placers determine presentation order (position). Each concern is strictly separated — a scorer never drops items, a slicer never reorders, a placer never scores.
Immutable items. No stage modifies the ContextItem instances it receives. Stages produce new collections; they never mutate inputs.

Data Flow

flowchart TD
    Input["Candidate Items\n(list of ContextItem)"] --> Classify
    Classify --> |"pinned items\n(list of ContextItem)"| Place
    Classify --> |"scoreable items\n(list of ContextItem)"| Score
    Score --> |"scored items\n(list of ScoredItem)"| Deduplicate
    Deduplicate --> |"unique scored items\n(list of ScoredItem)"| Sort
    Sort --> |"sorted scored items\n(list of ScoredItem)"| Slice
    Slice --> |"budget-fitting items\n(list of ContextItem)"| Place
    Place --> Output["Ordered Context Window\n(list of ContextItem)"]

Stage Summary

#	Stage	Input	Output	Key Behavior
1	Classify	Candidate items	Pinned + Scoreable lists	Partition; exclude negative-token items
2	Score	Scoreable items	Scored items	Invoke scorer; produce (item, score) pairs
3	Deduplicate	Scored items	Unique scored items	Remove duplicate content (byte-exact)
4	Sort	Unique scored items	Sorted scored items	Stable sort by score descending
5	Slice	Sorted scored items	Budget-fitting items	Select items within effective token budget
6	Place	Sliced + Pinned items	Ordered items	Merge, handle overflow, determine final order

The pipeline operates on the principle of ordinal-only scoring: scorers assign relevance scores (rank), slicers select items within budget (drop), and placers determine presentation order (position). Each concern is strictly separated.

Error Conditions

The pipeline may raise errors in two situations:

Pinned items exceed available budget. If the total tokens of pinned items exceeds maxTokens - outputReserve, the pipeline MUST raise an error during the Classify stage. This is not recoverable — the caller must reduce pinned items or increase the budget.
Overflow after merge. If the total tokens of merged items (pinned + sliced) exceeds targetTokens, the pipeline applies the configured OverflowStrategy. With the Throw strategy (default), this raises an error.

Cupel Specification

Pipeline

Invariants

Data Flow

Stage Summary

Error Conditions