Chunk Long Documents — Sections Survive the Split
Documentation mode chunks structured documents at heading boundaries, tells the model to reconstruct the section structure, and answers with original headings.
View Resource →Context Tools
"Message too long"? Don't slice it blindly. Paste the content and get a chunked context package: boundary-aware splitting (sections, paragraphs, code blocks kept atomic — character cuts only as a counted last resort), every chunk wrapped in continuity markers with wait/acknowledge rules, and a final-chunk trigger for the real task. Runs entirely in your browser.
Paste the oversized content — document, transcript, codebase, or long prompt. It never leaves the browser.
Each mode changes the wait/acknowledge rules and the final-chunk trigger — Codebase also prefers file boundaries.
Documentation mode chunks structured documents at heading boundaries, tells the model to reconstruct the section structure, and answers with original headings.
View Resource →The "message too long" error has a structural fix: split at paragraph boundaries into sequenced chunks with wait rules, instead of pasting fragments and hoping.
View Resource →In multi-part delivery, WHERE the task lands decides the answer quality. Analysis mode places the task behind the all-delivered announcement — never before.
View Resource →Chunks without continuity rules are read as separate documents. The continuity contract — numbering, wait rules, acknowledgements, completion — keeps N parts one document.
View Resource →Project context is mixed: code, notes, and docs together. Send it as one sequenced package the model holds complete before any work begins.
View Resource →Book-length analysis needs the WHOLE work read before any judgment. Analysis mode delivers the chapters in order and triggers the full analysis only at the end.
View Resource →Code split mid-function is unreviewable. Codebase mode splits at file boundaries, keeps fenced blocks atomic, and lists the files in every chunk.
View Resource →Splitting a long prompt by character count butchers it. Boundary-aware chunking splits at sections and paragraphs, keeps code blocks atomic, and counts every forced cut.
View Resource →A research corpus is many sources, not one text. Split at source boundaries so each study stays distinct — and synthesis waits for the full corpus.
View Resource →Hours of meetings exceed any message limit. Split transcripts at speaker-turn boundaries, deliver in order, and extract decisions only after every meeting has arrived.
View Resource →Get AI to actually read a document that's too big for one prompt — fit it to the model, split it cleanly, package the parts, and analyze them without losing the thread.
View Playbook →Prepare documents for a RAG system so retrieved answers stay accurate — budget the chunk size to the model, ground the sources against drift, and split them on clean boundaries for retrieval.
View Playbook →Paste the content that won't fit in one message, optionally add the task the AI should perform once everything has arrived, and pick the target model, a chunk size (~4K, ~8K, or ~16K tokens per chunk — converted to characters with content-type-aware ratios), and a delivery mode. The engine splits at meaningful boundaries in strict preference order — section and file boundaries first, then paragraphs, then lines, with character cuts only as a counted last resort — and never breaks inside a fenced code block when it can avoid it. Click Split Into Chunks for the chunked context package: delivery instructions, a strategy summary with boundary-quality stats, every chunk wrapped in "Chunk i of N" continuity markers with wait-and-acknowledge rules, and the final chunk carrying the all-delivered announcement plus your task. A feasibility check warns when even split chunks can't share one conversation window. Per-chunk copy buttons make the send-one-message-at-a-time workflow painless. Nothing leaves your browser.
Because blind cuts destroy meaning: a paragraph severed mid-sentence, a code block split through a function, a table separated from its header. This engine parses the content into segments — headings, paragraphs, fenced code blocks, file markers — and packs chunks along those boundaries in preference order. Character cuts exist only as a last resort, and the strategy section counts them so you can see exactly how clean the split was.
They're the difference between sending parts and the model understanding parts. Every chunk announces "Chunk i of N", instructs the model to acknowledge and wait, and the final chunk explicitly declares delivery complete before triggering the task. Without these rules, models routinely answer after the first chunk, treat each part as a complete document, or lose track of how the parts relate.
Only the per-message limit — and the tool is honest about the difference. All chunks still land in the same conversation, so the model's window must hold the total. The feasibility check does that math (total estimate plus chunk overhead plus response room against the model's window) and warns when one conversation can't hold it, with the real options: larger-window model, separate conversations, or a compact handoff instead of full content.
The rules the chunks carry. Standard simply marks the parts. AI Reading Mode forbids responding until the final part announces completion — the strictest waiting contract. Analysis Mode triggers a full analysis from the final chunk with chunk-level citations. Documentation Mode prefers section boundaries and tells the model to reconstruct the heading structure. Codebase Mode prefers file boundaries, keeps fences atomic, lists the files in each chunk, and forbids reviewing before the codebase is complete.
Yes, by construction: chunk bodies are pure slices of your input, and concatenating them reproduces the original text exactly — the markers live outside the content. The single exception is a hard character cut (a single line longer than the entire chunk budget), which has to insert a break; the strategy section reports every hard cut, and in normal content there are zero.
Different verbs, in sequence. The Estimator MEASURES — will it fit, how much budget is there — and when the answer is no, it points here. The Splitter FITS — it actually produces the sendable chunks. They share the same model table and token-estimation logic, so the numbers agree between the two tools.
Usually not — that's the neighboring tool's job. If you're trying to CONTINUE work in a new chat, you don't want the full transcript in pieces; you want a compact state package (decisions, constraints, current task), which is the Context Handoff Builder. Split when the AI genuinely needs all the original material; hand off when it needs what the material established.