Prompt Engineering

Optimize SQL Query — Joins, Cardinality, and Wasted Work

A five-table join that got slow as data grew: establish cardinality first, check each join strategy against the data shape, and hunt the fan-out doing wasted work.

Open in SQL Optimization Prompt

Overview

Join-heavy queries degrade in a specific way: the join order and strategies that fit yesterday's table sizes stop fitting today's. This prompt uses the join optimization goal — establish cardinality first (which side is small, what each join multiplies or filters), check every join strategy against the data shape (nested loops, hash, merge — each has a shape it serves), hunt fan-out that explodes rows only to collapse them again, and verify join columns are typed identically, because an implicit conversion silently disables index use. The loaded setup provides real row counts (12M orders, 48M items, 40 warehouses) — the cardinality context that join advice is worthless without.

How to use this resource

State the sizes

Row counts per table — join analysis without cardinality is guesswork, and the contract says so.
Match strategy to shape

Each join checked: loops for few rows with an index, hash for large unordered sets, merge for sorted inputs.
Hunt the waste

Fan-out, repeated lookups, joined-but-unused tables, implicit conversions — the work that adds rows but not results.

Why This Works

Cardinality-first ordering mirrors how the optimizer itself decides
Shape-matching turns join advice from folklore into a checkable claim
The implicit-conversion check catches the silent index killer on joins

Best for

Queries joining tables of wildly different sizes
Joins that were fast until one table grew
Order-detail, reporting, and catalog join patterns

Not for

Single-table filter problems — that's the Query Speed goal, a different contract
Restructuring the application code that builds the query — that's the Refactor Prompt Builder

Use cases

Diagnosing the multi-table join that slowed as data grew
Checking join order against actual table sizes
Finding the join that fans out and collapses again

FAQ

What do I need to give the SQL join optimization prompt?

Paste the SQL query and the table row counts — join analysis without cardinality is guesswork, which is why the loaded example ships real counts like 12M orders and 48M order_items. Add the execution plan if you have it; the prompt won't invent one, and it won't assume your indexes or row counts either. Anything missing, it tells you the exact command to capture first.

What does the SQL optimization prompt return?

An ordered list of optimization opportunities, highest expected impact first, each with the evidence that predicts it, its tradeoffs (write amplification, storage, plan-stability), and a verification step — capture the actual execution plan before and after. Index recommendations name the exact predicates they serve, assumptions are marked VERIFIED or UNVERIFIED, and it ends with the open questions the provided context couldn't settle.

Does this optimize any slow query, or only joins?

This is the join-optimization contract — join order, strategy against the data shape (nested loops, hash, merge), fan-out, and the implicit type conversion that silently disables an index. A single-table filter problem is a different goal. It recommends and evidences changes for you to apply and measure; any rewrite it suggests must return identical results or call out every NULL, duplicate, and ordering difference.

Customize This Resource

Opens this setup in SQL Optimization Prompt. Generate to get the full optimization contract — then adjust the goal, platform, and evidence mode.

Open in SQL Optimization Prompt

Prompt Template

Copy it as-is, or use Open in SQL Optimization Prompt to load it pre-filled and customize it with your own context.

OPTIMIZATION OBJECTIVE
Find why the five-table order-detail join got slow after the catalog grew.
Optimization goal: join optimization — join order, join strategy, and the work done between tables.
Establish why this query is expensive before changing anything. Every recommendation must trace to evidence from this query, this plan, or this schema — no generic database advice.

DATABASE CONTEXT
Platform: not specified — keep recommendations standard-SQL where possible, and mark every platform-dependent claim as such.
- Where behavior differs by engine (index types, plan tooling, optimizer features), name which engines the advice applies to.

QUERY CONTEXT
Query:
```sql
[Paste the SQL query here]
```
Execution plan: not provided. Do not invent one. State the exact command to capture it — your engine's actual-execution-plan facility — and mark every plan-dependent conclusion as pending that evidence.
Tables and row counts:
- orders: 12M rows
- order_items: 48M rows
- products: 900K rows
- categories: 1,200 rows
- warehouses: 40 rows
Existing indexes: not provided — do not assume any index exists. Before recommending new indexes, list what must be checked about the current ones.

PERFORMANCE SYMPTOMS
Order detail page renders in 3–5 seconds; was sub-second before the catalog import.

OPTIMIZATION GOAL
Primary goal: Join Optimization.
Analysis priorities for this goal:
1. Establish cardinality first: which side is small, which is large, and what each join multiplies or filters.
2. Check the join strategy against the data shape: nested loops for few rows with an index, hash for large unordered sets, merge for sorted inputs.
3. Hunt duplicate and wasted work: joins that fan out and collapse again, repeated lookups, joins to tables whose columns are never used.
4. Verify join columns are typed identically and indexed — an implicit conversion silently disables index use on a join.

EVIDENCE REVIEW
Evidence mode: Standard Analysis.
- Prefer evidence from the provided query, plan, and schema; where evidence is missing, label the assumption explicitly.
- Carry at least two candidate bottlenecks until evidence separates them.

BOTTLENECK ANALYSIS
- Identify the bottlenecks and, for each: what it costs, why it matters for this workload, and the estimated share of the total cost.
- Expected analysis areas for this goal: Join order against table sizes; Join algorithm per join; Fan-out and row explosion; Unused and redundant joins.
- Distinguish the bottleneck from its symptom — a slow sort caused by a missing filter is a filter problem, not a sort problem.

OPTIMIZATION OPPORTUNITIES
- Order opportunities by expected impact, highest first; state the basis for each estimate.
- For each opportunity: the change, the expected effect, and the evidence that predicts it.
- Any query rewrite must return identical results — call out every difference in NULL handling, duplicates, and ordering, or state explicitly that there is none.
- Do not tune speculatively: a change without an evidenced problem is risk without benefit.

INDEX RECOMMENDATIONS
- Every index recommendation must name the exact predicates, joins, or sorts it serves — no index without a clause.
- Justify column order for composite indexes, and say whether the index should cover (and at what storage and write cost).
- State each index's write tax: which inserts and updates it slows, and whether the workload can afford that.
- Check existing indexes first: a near-miss index that could be extended beats a new overlapping one.

TRADEOFF ANALYSIS
- Every recommendation carries its costs: write amplification, storage, maintenance burden, plan-stability risk, staleness (for pre-aggregation).
- State when NOT to apply each recommendation — the workload shape under which it backfires.

ASSUMPTIONS
- List every assumption made about data volumes, value distributions, index state, or workload patterns.
- Mark each assumption VERIFIED (with its evidence) or UNVERIFIED (with the query or command that would resolve it).
- Any recommendation that depends on an UNVERIFIED assumption must be flagged as conditional on it.

NON-GOALS
- Do not invent execution plans.
- Do not assume indexes exist.
- Do not assume row counts.
- Do not recommend changes without justification.
- Separate facts from assumptions throughout.
- Explain the tradeoffs of every change that has them.
- Schema redesign and application-level changes are out of scope — unless the evidence shows no query-level fix exists, in which case say so and stop.

OUTPUT REQUIREMENTS
- Present recommendations as an ordered list, highest expected impact first, each with its evidence and its tradeoffs.
- For each recommendation, include the verification step: how to measure the improvement — your engine's actual-execution-plan facility before and after, on production-shaped data.
- Where essential evidence is missing, the first recommendations are the commands to gather it — not guesses in its place.
- End with the open questions: what could not be determined from the provided context, and what would settle each.

More resources from SQL Optimization Prompt

Resource

SQL Optimization Prompt — the Evidence-Based Contract

"Optimize this query" gets generic indexing advice. The optimization contract demands evidence: real bottlenecks, justified indexes with their write tax, and no invented plans.

Prompt Engineering

Resource

Execution Plan Analysis — Read What the Engine Actually Chose

Optimize what the engine does, not what the SQL looks like: cost concentration, estimate-vs-actual gaps, and plan warnings — with forensic evidence rules.

Prompt Engineering

Resource

Missing Index Analysis — Which Indexes, at What Cost

Map every predicate, join, and sort to the index that serves it — or doesn't. Composite order rules, covering decisions, and the write tax nobody mentions.

Prompt Engineering

Resources that pair well

Resource

Debugging Prompt — the Investigation Contract

"Fix this error" gets guesses. The investigation contract gets a ten-stage diagnosis: facts separated from assumptions, alternatives weighed, fixes justified.

Prompt Engineering

Resource

Code Review Prompt — the Review Contract

"Review this code" gets shallow comments. The review contract gets findings with severities, a checklist, and a verdict.

Prompt Engineering

Resource

Refactor Prompt — the Behavior Preservation Contract

"Refactor this code" invites silent behavior changes. The refactoring contract preserves business rules, outputs, and side effects — and flags uncertainty instead of deciding it.

Prompt Engineering

Related tools

Tool

SQL Optimization Prompt

Build evidence-based SQL optimization prompts — goal, platform, and the evidence you have turn into a query tuning contract.

Coding Workflows

Tip: Save time by exploring related resources and tools that integrate with this resource.