video-ingest
Auto-generated from project docs
North Star
Booklet Status
| Booklet | Status | Appetite | Notes |
|---|---|---|---|
| B1 | DONE | 3 hours | Core pipeline working. YouTube captions via cmd-aorus, summary via LiteLLM, corpus indexed. |
| B2 | PLANNED | 3 hours | Min Min /ingest command + MeTube web UI + media file watcher |
| B3 | PLANNED | 3 hours | RSS auto-ingest + playlist batch + speaker diarization |
| ID | Type | Deliverable | Priority |
| ---- | ------ | ------------- | ---------- |
| B1-D1 | infra | Install yt-dlp + youtube-transcript-api + ffmpeg on VPS | P0 |
Recent Decisions
| Date | Decision | Rationale |
|---|---|---|
| ------ | ---------- | ----------- |
| 2026-04-14 | Project kickstarted | Replace Gemini manual workflow with automated pipeline |
| 2026-04-14 | yt-dlp over reclip/MeTube/cobalt | Best API for programmatic use, 1700+ sites (RESEARCH-247) |
| 2026-04-14 | faster-whisper over Ollama Whisper | Full control, VAD filter, word timestamps, int8 GPU (RESEARCH-247) |
| 2026-04-14 | large-v3-turbo model | Best speed/accuracy for RTX 3050 8GB (~45s/20min, 5.5% WER) |
| 2026-04-14 | Tiered transcription (captions → Whisper → Deepgram) | Skip GPU when good captions exist |
| 2026-04-14 | Bucket B | Corpus feeds agent quality for revenue work |
| 2026-04-14 | YouTube captions via cmd-aorus SSH | VPS Hetzner IP blocked by YouTube bot detection. Residential IP works. |
| 2026-04-14 | gpt-4.1-nano for summaries (not Haiku) | Fast + cheap, good structured output quality |
| 2026-04-14 | B1 DONE | YouTube caption pipeline working end-to-end. Whisper path tested but non-YouTube blocked from VPS. |
Source: /root/projects/video-ingest/