This content originally appeared on DEV Community and was authored by Asen Mitrev
Day 10 of migrating video cration to an entirely self-hosted model. The goal is to open-source video compilation with AI.
The video below was made using only 2 API dependencies. 11labs for voice and Vertex AI for multimodal embeddings.
The rest is locally run. LLM is Qwen 3.6 27B. Impeccable for agentic tasks like scriptwriting and RAG.
Hoping to switch to Qwen VL embeddings next, so embedding costs go to the local power plant instead. At a significant discount.
Still no capable open source model for text-to-speech, although if you know one, drop it in a comment. It's the last missing piece.
This content originally appeared on DEV Community and was authored by Asen Mitrev
Asen Mitrev | Sciencx (2026-06-03T07:01:39+00:00) Self-hosted video creation is coming. Retrieved from https://www.scien.cx/2026/06/03/self-hosted-video-creation-is-coming/
Please log in to upload a file.
There are no updates yet.
Click the Upload button above to add an update.