Session 1 — 2026-06-07 (Cron Run)

What was done

Kanban setup

  • Created kanban card t_e52a7709: “Synthetic Avatar Pipeline (Mick)”
  • Status: ready → in progress for this session

Audit of existing work

The research phase is complete (done on 2026-06-05). Key findings:

ItemStatusDetails
Model selected✅ DoneSoulX-FlashHead (Apache 2.0, 96 FPS on RTX 4090)
Research doc✅ Doneresearch.md — full model comparison matrix written
Reference photos✅ Available14 PNGs in /opt/data/creative/faces/png/ (IMG_7985–IMG_7999)
Dockerfile✅ WrittenCUDA 12.4, Torch + Diffusers + SoulX-FlashHead from source
Inference server✅ WrittenFastAPI POST /animate, audio + ref image → WebM stream
K8s manifests✅ WrittenDeployment (GPU nodeSelector), Service, Ingress at flashhead.hermes.paralla.org

Integration design confirmed

Text → F5-TTS (.193) → Audio WAV → SoulX-FlashHead (.106, RTX 4090) → WebM Video Stream
                                         ↑
                             Reference Image (from /opt/data/creative/faces/png/)

What has NOT been done (Phase 2 gaps)

  1. No inference test — model installed in Dockerfile but never tested end-to-end
  2. Best reference photo not selected — need visual assessment to pick the best of 14 candidates
  3. GPU K8s node registration — verify .106 has nvidia.com/gpu resource
  4. Audio format compatibility — confirm F5-TTS output matches SoulX-FlashHead expected input (likely 16kHz mono WAV)
  5. Docker image not built/pushed to registry

Blocker for Phase 2

Cannot proceed with actual inference testing without:

  • SSH access to .106 or kubectl exec into GPU pod
  • Visual inspection capability to select reference photo
  • Docker build environment with CUDA/registry access

Next action required: pvs needs to approve Phase 2 kickoff, then either:

  • Schedule interactive session to test inference on .106
  • Delegate to a worker that has SSH/GPU node access
  • Provide credentials for remote testing

Session log summary

TimestampAction
07:20 UTCKanban card t_e52a7709 created, status=ready
07:21 UTCAudit of existing research/infrastructure completed
07:21 UTCPhase transition documented — ready for implementation
07:21 UTCSession file written