07 / THE CONDUCTOR
Agent Orchestration & Human-in-the-Loop
01 VERBATIM
Fidelity & Structure
ffmpeg, whisper, merge
02 DECODER
Cultural Translation
gemini-pro (low temp)
03 DISTILLER
Essence Extraction
grandma_test.py
04 AMPLIFIER
Contextual Resonance
context_injection.json
05 RETENTION
Networked Memory
Io_loop_strategy
06 CURATOR
Multimodal Indexing
scenedetect, imagehash
01
RAW FOOTAGE INGESTION
02
CULTURAL DECODING
✓
HUMAN QA: TRANSLATION CHECK
03
ESSENCE DISTILLATION
04
CONTEXT AMPLIFICATION
✓
HUMAN QA: ANALOGY FIT
06
ASSET GENERATION & FINAL BUILD
conductor.py
def run_pipeline(video_source):
# 1. Structure Preservation
transcript = agent_01.run(video_source)
# 2. Cultural Decoding + Human QA
zh_text = agent_02.run(transcript)
if not human.review(zh_text): return "REFINE"
# 3. Amplification (Injection)
context = load_schema("Architecture")
notes = agent_04.run(zh_text, context)
return agent_06.publish(notes)
# 1. Structure Preservation
transcript = agent_01.run(video_source)
# 2. Cultural Decoding + Human QA
zh_text = agent_02.run(transcript)
if not human.review(zh_text): return "REFINE"
# 3. Amplification (Injection)
context = load_schema("Architecture")
notes = agent_04.run(zh_text, context)
return agent_06.publish(notes)
MCP SPEC v1.0
{
"protocol": "MCP-v1",
"stage": "04_AMP",
"payload": {
"source_hash": "a1b2...",
"context_tags": ["ARCH"],
"status": "AWAIT_QA"
}
}
"protocol": "MCP-v1",
"stage": "04_AMP",
"payload": {
"source_hash": "a1b2...",
"context_tags": ["ARCH"],
"status": "AWAIT_QA"
}
}
Standardized data handoff ensures zero context loss between agents.
The Symphony Analogy
Orchestrator (Code) = 指揮家
它決定「誰先誰後」。
指揮大提琴何時進場,判斷何時需要人類 (Soloist) 介入獨奏。
它決定「誰先誰後」。
指揮大提琴何時進場,判斷何時需要人類 (Soloist) 介入獨奏。
MCP (Protocol) = 樂譜標準
它確保「語言互通」。
如果沒有標準樂譜,小提琴手看不懂鋼琴手的筆記,協作就會變成噪音。
它確保「語言互通」。
如果沒有標準樂譜,小提琴手看不懂鋼琴手的筆記,協作就會變成噪音。
Partnership Protocol
你不再是操作者,你是指揮家。
AI 負責處理大規模的繁瑣運算,人類負責在關鍵節點注入靈魂與價值判斷。