YouTube Scout Build Summary
YouTube Scout is a production-ready summarization pipeline that converts long-form AI conference talks into high-signal, structured summaries capped at 1000 words.
By Sean WeldonYouTube Scout — Build Summary
Overview
YouTube Scout is a production-ready summarization pipeline that converts long-form AI conference talks into high-signal, structured summaries capped at 1000 words. It emphasizes signal extraction, structure preservation, and artifact persistence for long-term reuse.
Objectives
- Extract durable insights from long-form technical talks
- Enforce strict output constraints (≤1000 words)
- Preserve structure and frameworks over raw transcript coverage
- Enable resumability, auditing, and reprocessing
Architecture Summary
Pipeline Flow:
Video → Transcript Intake → Normalization → Extraction (LLM) → Synthesis (LLM) → Formatting
Core Stack:
- Python 3.11 async pipeline
- Anthropic Claude (Sonnet 4.5) / OpenAI GPT-4 (configurable)
- YouTube Transcript API
- JSON schema validation
- File-based artifact persistence
Key Architectural Decisions:
- Two-stage LLM design (extract → synthesize) to prevent context poisoning
- Hard word-count enforcement (not advisory)
- JSON-only extraction stage with schema validation
- Artifact-first design (all intermediate outputs persisted)
Core Capabilities
- Transcript ingestion and normalization
- Structured extraction of thesis, arguments, frameworks, and quotes
- Deterministic summary synthesis with retries under word limits
- Markdown + JSON outputs suitable for blogs, Obsidian, and RAG
Outcomes
- Typical processing time: ~30–80 seconds per video
- Consistent, reusable summaries across large content libraries
- Reliable enforcement of quality and length constraints
- Clear separation of reasoning (LLMs) and control (deterministic logic)
Quality Guarantees
- Schema validation for extraction outputs
- Explicit failure on word-count violations
- Retry logic with escalating compression
- Artifact inspection for debugging and audits
Limitations
- No playlist-level batching (planned)
- No vector database integration yet
- English-only transcripts
Next Steps
- Playlist and batch ingestion
- Topic tagging and metadata enrichment
- RAG storage and retrieval
- Automated publishing workflows
Reflection
This build demonstrates how LLMs are most effective when tightly constrained and embedded within deterministic pipelines. By prioritizing structure, artifacts, and hard constraints, YouTube Scout produces outputs that remain valuable long after initial generation.