Org Status: π‘ Dormant Cloudflare: N/A Last Audited: 2026-04-28
Every tool, API, stock library, and AI service you need to produce short-form and long-form video content β inventoried, compared, and rated so you can decide βuse thisβ or βskip thisβ in under 10 seconds.
This is a shopping list, not a tutorial. Before building anything custom, know what already exists.
What youβll learn
- 150+ resources across 14 categories, each with pricing, API status, and quality rating
- Which free tools are genuinely production-quality and which are traps
- Three complete stacks at $0/mo, $50/mo, and $200/mo price points
- What you must build custom vs. what you should never build custom
- Which resources have APIs for automation vs. manual-only workflows
Video content is the highest-ROI marketing channel in 2026. Finance YouTube channels earn $9-$21 RPM. TikTok creators monetize at $0.50-$3.00 per 1K views. A single viral short can drive more traffic than months of SEO work.
But the production pipeline is fragmented across hundreds of tools. Creators waste weeks evaluating options, overpay for services with free alternatives, or build custom tools that already exist. The result: paralysis, wasted money, or reinventing the wheel.
What changes if you get this right: You assemble a production pipeline in a day, not a month. You spend $0-50/mo instead of $200+. You automate 80% of the workflow using APIs and existing services. You focus your custom development on the 20% that actually differentiates your content.
Stock footage is the backbone of faceless video content. The difference between free and paid libraries is real but narrower than most people think.
Free Libraries
| Resource | URL | API | Vertical (9:16) | Quality | Commercial | Best For |
|---|---|---|---|---|---|---|
| Pexels | pexels.com | Yes β docs | Yes, filterable | 4/5 | Yes, no attribution | General B-roll, business, nature, lifestyle |
| Pixabay | pixabay.com | Yes β docs | Limited | 3/5 | Yes, no attribution | Supplementary clips, illustrations, vectors |
| Coverr | coverr.co | No | Some | 4/5 | Yes, no attribution | Tech, startup, modern aesthetic |
| Mixkit | mixkit.co | No | Some | 4/5 | Yes (check license type) | Cinematic B-roll, transitions, overlays |
Pexels β The Default Choice
- API rate limit: 200 requests/hour, 20,000/month
- Search filters: orientation (landscape, portrait, square), minimum quality (HD, Full HD, 4K)
- Video resource: Returns id, width, height, duration, video files with multiple quality levels and fps
- Why it wins: Best API of any free library. Filter by orientation for vertical content. Every video includes multiple resolution downloads. Commercial use with zero attribution.
- Gotcha: 20K monthly request cap is tight for automated pipelines. Cache aggressively.
// Pexels API β search for vertical stock footage
interface PexelsVideo {
id: number;
width: number;
height: number;
duration: number;
url: string;
video_files: {
id: number;
quality: "sd" | "hd" | "uhd";
file_type: string;
width: number;
height: number;
fps: number;
link: string;
}[];
video_pictures: { id: number; picture: string }[];
}
interface PexelsSearchResponse {
page: number;
per_page: number;
total_results: number;
videos: PexelsVideo[];
}
async function searchVerticalFootage(
query: string,
apiKey: string
): Promise<PexelsVideo[]> {
const url = new URL("https://api.pexels.com/videos/search");
url.searchParams.set("query", query);
url.searchParams.set("orientation", "portrait");
url.searchParams.set("size", "medium"); // Full HD
url.searchParams.set("per_page", "15");
const res = await fetch(url.toString(), {
headers: { Authorization: apiKey },
});
const data: PexelsSearchResponse = await res.json();
return data.videos;
}
Pixabay β The Supplement
- API rate limit: 100 requests/minute (more generous per-minute than Pexels)
- Search filters: type, category, min_width, min_height, editors_choice
- Key difference: Also covers images, illustrations, vectors, and music β one API for multiple asset types
- Gotcha: Video quality is generally lower than Pexels. Use for variety, not primary footage.
// Pixabay API β multi-asset search
async function searchPixabayVideos(
query: string,
apiKey: string
): Promise<any> {
const url = new URL("https://pixabay.com/api/videos/");
url.searchParams.set("key", apiKey);
url.searchParams.set("q", query);
url.searchParams.set("video_type", "film");
url.searchParams.set("min_width", "1080");
url.searchParams.set("per_page", "20");
const res = await fetch(url.toString());
return res.json();
}
Coverr β The Aesthetic Pick
- No API β download only via website
- Best for: Modern, clean footage that looks more premium than typical stock
- 2026 update: Now includes AI-generated videos alongside human-shot footage, all clearly labeled
- Restriction: Cannot use to train AI models or build competing services
Mixkit β The Template Bonus
- No API β manual download
- Bonus: Also offers free Premiere Pro, After Effects, Final Cut Pro, and DaVinci Resolve templates (851+)
- Two license types: Free License (commercial OK) and Restricted License (non-commercial only) β check per video
- Music restriction: Cannot use Mixkit music in CDs, DVDs, video games, or TV/radio broadcasts
Paid Libraries
| Resource | URL | Pricing | API | Quality | Best For |
|---|---|---|---|---|---|
| Storyblocks | storyblocks.com | $21-65/mo (unlimited) | Yes β enterprise | 4/5 | Unlimited downloads, team workflows |
| Artgrid | artgrid.io | ~$25-50/mo | No | 5/5 | Cinematic, documentary-quality footage |
| Envato Elements | elements.envato.com | $16.50/mo | No | 4/5 | All-in-one (video + audio + graphics + templates) |
| iStock/Getty | istockphoto.com | Per-clip ($12-300+) | Yes | 5/5 | Premium, exclusive footage |
Storyblocks β Best Paid Value
- Unlimited downloads on all subscription plans β no per-clip fees
- API available for enterprise/custom integrations with unlimited search and download calls
- Free API trial: Test keys with 5 free downloads per content type
- Essentials: $252/year ($21/mo) β Unlimited All Access: $360/year ($30/mo)
- Enterprise: $6,000-12,000+/year with full API access
Key insight: Storyblocks is the only stock library that combines unlimited downloads with API access at a reasonable price. If youβre producing more than 10 videos/month, it pays for itself vs. per-clip pricing.
Envato Elements β The Bundle Play
- One subscription covers everything: stock video, music, sound effects, graphics, templates, fonts
- 1.3M+ stock videos included
- No API β all downloads are manual through the website
- Best for: Solo creators who need a bit of everything and want one bill
Vertical Format Availability
| Library | Portrait/Vertical Filter | 9:16 Content Volume |
|---|---|---|
| Pexels | Yes β orientation=portrait | High |
| Pixabay | No dedicated filter | Low |
| Coverr | Manual browsing | Medium |
| Mixkit | Manual browsing | Medium |
| Storyblocks | Yes β orientation filter | High |
| Artgrid | Limited | Low (cinematic focus) |
Music makes or breaks video content. The wrong track gets your video muted or taken down. The right track doubles watch time.
Free Music Libraries
| Resource | URL | Attribution | Commercial | Library Size | Best For |
|---|---|---|---|---|---|
| Pixabay Music | pixabay.com/music | No | Yes | 100K+ | Background tracks, no-hassle licensing |
| Mixkit Music | mixkit.co/free-stock-music | No | Yes (with limits) | 1,000+ | Curated, high-quality tracks |
| YouTube Audio Library | studio.youtube.com/channel/β¦/music | Some tracks | YouTube only | 3,000+ | YouTube-specific content |
| Freesound | freesound.org | CC license varies | Depends on license | 500K+ sounds | Sound effects, ambient, field recordings |
| Uppbeat | uppbeat.io | Free tier: yes | Yes (Business plan) | 10K+ | Curated, genre-organized |
Pixabay Music β Zero-Friction Default
- No attribution required for commercial use including video projects
- No sign-up needed to download
- Gotcha: Cannot distribute via DSPs (Spotify, Apple Music). Cannot claim or monetize the track itself.
- API: Uses same Pixabay API β search music by genre, mood, duration
- Best for: TikTok, YouTube, Reels background music where you want zero legal overhead
Freesound β The Sound Effects Goldmine
- API available: freesound.org/help/developers β search, filter, download, content analysis
- 500K+ sounds β largest CC-licensed sound database in the world (20th anniversary in 2026)
- Multiple CC licenses per sound β check each soundβs specific license
- API features: Search by tags, filter by license type, retrieve audio analysis features, find similar sounds
- Commercial API use requires a license; individual sound CC requirements still apply
- Best for: Sound effects, ambient audio, foley, field recordings β not music tracks
// Freesound API β search for sound effects
async function searchFreesound(
query: string,
token: string,
filter?: { license?: string; duration_min?: number; duration_max?: number }
): Promise<any> {
const url = new URL("https://freesound.org/apiv2/search/text/");
url.searchParams.set("query", query);
url.searchParams.set("token", token);
url.searchParams.set("fields", "id,name,tags,license,duration,previews,download");
if (filter?.license) {
url.searchParams.set("filter", `license:"${filter.license}"`);
}
const res = await fetch(url.toString());
return res.json();
}
YouTube Audio Library β YouTube Only
- Free for all YouTube creators β access via YouTube Studio
- Some tracks require attribution (marked in library)
- Cannot use outside YouTube β license is platform-specific
- Best for: YouTube-first creators who donβt need cross-platform music
Paid Music Libraries
| Resource | URL | Pricing | Library Size | API | Best For |
|---|---|---|---|---|---|
| Epidemic Sound | epidemicsound.com | $10-75/mo | 50K+ tracks, 200K+ SFX | Yes β API | Professional quality, AI-powered search |
| Artlist | artlist.io | ~$10-34/mo | Large | No | Unlimited downloads, lifetime usage |
| Soundstripe | soundstripe.com | $15-40/mo | 50K+ | No | Budget-friendly unlimited |
| Bensound | bensound.com | Free + $6-26/mo | 2K+ | No | Small but curated library |
Epidemic Sound β The Professional Standard
- 50,000+ tracks across 160 genres, 200,000+ sound effects
- API launched in 2026: Embed licensed music into third-party platforms
- Text prompt search (genre, tempo, length parameters)
- Video prompt analysis (pacing, transitions, style β music recommendations)
- Compatible with desktop, web, and mobile apps
- Supports sublicensing for end users
- Pricing: $9.99-29.99/mo (annual) or $19.99-74.99/mo (monthly)
- All royalty-free β no per-use fees, no Content ID claims
- Artist compensation: $2,000-8,000 per track (as of Q1 2026)
Key insight: Epidemic Soundβs API with AI-powered music matching is the most advanced programmatic music integration available. If youβre building an automated video pipeline, this is the only music API worth integrating.
Artlist β The Creator Favorite
- Social plan: Unlimited music + SFX for one channel per platform
- Pro plan: Universal license, commercial + advertising use
- Lifetime usage rights β downloads during subscription never expire
- AI Max plan (~$34/mo): Includes footage, templates, and plugins alongside music
- No API β all downloads are manual
Music Licensing for TikTok/Commercial
| Library | TikTok Safe | Instagram Safe | YouTube Safe | Ads/Commercial |
|---|---|---|---|---|
| Pixabay Music | Yes | Yes | Yes | Yes |
| Mixkit Music | Yes | Yes | Yes | No (broadcast) |
| Epidemic Sound | Yes | Yes | Yes | Yes (paid plan) |
| Artlist | Yes | Yes | Yes | Yes (Pro plan) |
| YouTube Audio Library | No | No | Yes | No |
| Freesound | Check per sound | Check per sound | Check per sound | Check per sound |
| Uppbeat | Yes (Business) | Yes (Business) | Yes | Yes (Business) |
Images serve as video backgrounds, slide content, thumbnails, and overlay graphics.
Free Image Libraries
| Resource | URL | API | Transparent BG | Quality | Best For |
|---|---|---|---|---|---|
| Pexels | pexels.com | Yes | No | 4/5 | Photos for video backgrounds |
| Unsplash | unsplash.com | Yes β docs | No | 5/5 | High-art photography |
| Pixabay | pixabay.com | Yes | Yes (illustrations) | 3/5 | Illustrations, vectors, photos |
| Freepik | freepik.com | No | Yes (vectors/PSD) | 4/5 | Vectors, PSD templates, illustrations |
Unsplash β Highest Quality Free Photos
- API: 50 requests/hour (demo), higher with production approval
- 2M+ photos from professional photographers
- No attribution required (but appreciated)
- Best for: Hero images, thumbnails, video slide backgrounds
- Gotcha: No illustration or vector content β photos only
Freepik β The Vector/Template Source
- Free tier: Limited downloads with attribution required
- Premium: $8-15/mo for unlimited, no attribution
- Transparent backgrounds on vectors and PSD files
- AI image generation included in premium plans
- Best for: Graphics, icons, illustrations for video overlays
Design Tools with Templates
| Resource | URL | Free Tier | Video Templates | API | Best For |
|---|---|---|---|---|---|
| Canva | canva.com | Yes (generous) | Yes (extensive) | No public API | Thumbnails, social graphics, simple video |
| Figma | figma.com | Yes | Community files | REST API | Custom thumbnails, design systems |
Canva β The Non-Designerβs Best Friend
- Free tier: 5GB storage, 1M+ free templates, limited AI features
- Pro ($13/mo): Brand Kit, background remover, 100M+ premium assets
- Video templates: Thousands of pre-made templates for TikTok, Reels, YouTube
- Thumbnail maker: AI thumbnail generator included
- No public API for automation β editor-only
- Best for: Thumbnails, social media graphics, simple slideshow videos
Voice is the most critical quality differentiator in faceless video. Viewers tolerate mediocre footage but abandon robotic voices.
Cloud TTS Services
| Resource | URL | Pricing | Quality | Voice Cloning | API | Languages |
|---|---|---|---|---|---|---|
| ElevenLabs | elevenlabs.io | Free-$1,320/mo | 5/5 | Yes | Yes β docs | 32+ |
| Play.ht | play.ht | ~$31/mo+ | 4/5 | Yes | Yes | 142 |
| Amazon Polly | aws.amazon.com/polly | $4-100/1M chars | 3/5 | No | Yes | 30+ |
| Google Cloud TTS | cloud.google.com/text-to-speech | $4-16/1M chars | 4/5 | No | Yes | 40+ |
ElevenLabs β The Quality King
- Plans: Free (10K credits/mo), Starter ($5/mo, 30K), Creator ($11/mo, 100K), Pro ($99/mo, 500K)
- Credit system: 1 character = 1 credit (standard models), 0.5-1 credit (Flash/Turbo models)
- Overage: $0.12-0.30 per 1K characters depending on plan
- Voice cloning: Available on all paid plans β clone any voice from audio samples
- Models: V1 English, V1 Multilingual, V2 Multilingual, V2.5 Flash/Turbo
- API: Full REST API, WebSocket streaming, SDKs for Python/JS/Go
- Quality: Indistinguishable from human speech for most listeners
// ElevenLabs TTS API
interface ElevenLabsRequest {
text: string;
model_id: "eleven_monolingual_v1" | "eleven_multilingual_v2" | "eleven_turbo_v2_5";
voice_settings?: {
stability: number; // 0-1
similarity_boost: number; // 0-1
style?: number; // 0-1
use_speaker_boost?: boolean;
};
}
async function generateSpeech(
voiceId: string,
text: string,
apiKey: string
): Promise<ArrayBuffer> {
const res = await fetch(
`https://api.elevenlabs.io/v1/text-to-speech/${voiceId}`,
{
method: "POST",
headers: {
"Content-Type": "application/json",
"xi-api-key": apiKey,
},
body: JSON.stringify({
text,
model_id: "eleven_turbo_v2_5",
voice_settings: {
stability: 0.5,
similarity_boost: 0.75,
},
} satisfies ElevenLabsRequest),
}
);
return res.arrayBuffer();
}
Key insight: ElevenLabs at $11/mo (Creator plan, 100K characters) gives you roughly 30-40 minutes of generated speech. Thatβs 4-5 long-form videos or 30+ short-form videos per month. For most creators, this is the sweet spot.
Amazon Polly β The Cheap Workhorse
- Standard voices: $4/1M characters (cheapest cloud TTS)
- Neural voices: $16/1M characters (good quality, still cheap)
- Generative voices: $30/1M characters
- Long-form voices: $100/1M characters
- Free tier: 5M standard chars/mo, 1M neural chars/mo (first 12 months)
- Best for: Bulk generation where cost matters more than premium quality
- Cost example: 100K characters (~30 min speech) = $0.40 standard, $1.60 neural
Google Cloud TTS β The Middle Ground
- Standard voices: $4/1M characters
- WaveNet voices: Higher quality, higher price
- Free tier: 4M standard chars/mo, 1M WaveNet chars/mo
- $300 new customer credits for testing
- Best for: Multi-language content, integration with Google Cloud ecosystem
Free / Local TTS Solutions
| Resource | URL | Quality | Speed | Voice Cloning | GPU Required |
|---|---|---|---|---|---|
| edge-tts | github.com/rany2/edge-tts | 4/5 | Fast (cloud) | No | No |
| Coqui TTS | github.com/coqui-ai/TTS | 3/5 | Medium | Yes (XTTS) | Recommended |
| Bark | github.com/suno-ai/bark | 4/5 | Slow | No | Yes |
| XTTS v2 | Coqui AI | 4/5 | Medium | Yes (6s clip) | Yes |
| Piper | github.com/rhasspy/piper | 3/5 | Very fast | No | No |
edge-tts β The Free Secret Weapon
- Completely free β uses Microsoft Edgeβs online TTS service
- No API key required β no Microsoft account, no Edge browser, no Windows needed
- Python:
pip install edge-ttsβ PyPI - Node.js:
edge-tts-universalon npm β npm - Quality: Surprisingly good β better than most paid standard voices
- 300+ voices across dozens of languages
- Gotcha: Depends on Microsoftβs service availability. Not officially supported. Could break without warning.
// edge-tts via Node.js (edge-tts-universal package)
import { EdgeTTS } from "edge-tts-universal";
async function generateFreeSpeech(
text: string,
voice: string = "en-US-AriaNeural"
): Promise<Buffer> {
const tts = new EdgeTTS();
await tts.setMetadata(voice, "audio-24khz-48kbitrate-mono-mp3");
const audioStream = await tts.toStream(text);
const chunks: Buffer[] = [];
for await (const chunk of audioStream) {
if (chunk.type === "audio") {
chunks.push(chunk.data);
}
}
return Buffer.concat(chunks);
}
Key insight: edge-tts is genuinely good enough for production short-form video. Itβs free, fast, and has 300+ voices. Start here. Upgrade to ElevenLabs only when voice quality becomes a bottleneck (usually for long-form YouTube content where viewers listen for 10+ minutes).
Bark β The Creative Wild Card
- Open source (MIT license) β full commercial use
- Generates non-speech sounds: laughter, sighing, music, ambient
- Multilingual without separate model downloads
- 2x faster on GPU, 10x faster on CPU (recent optimizations)
- Gotcha: Slow generation. A 10-second clip can take 30+ seconds. Not viable for bulk production.
- Best for: Creative intros, emotional moments, variety in faceless content
Piper β The Speed Demon
- Fastest local TTS β real-time even on Raspberry Pi 4
- ONNX models trained with VITS architecture
- Latest release: v1.4.1 (February 2026)
- No GPU required β runs on CPU
- Best for: Real-time applications, embedded systems, bulk generation where speed matters
- Gotcha: Voice quality is functional but not premium. Good for drafts, not final output.
XTTS v2 β The Voice Cloner
- Zero-shot voice cloning from 6 seconds of audio
- 17 languages supported
- Part of Coqui TTS toolkit β
pip install TTS - Runs locally with Docker support
- Best for: Consistent brand voice, custom character voices
TTS Comparison Matrix
| Service | Cost/30 min | Quality | Speed | Cloning | Offline |
|---|---|---|---|---|---|
| ElevenLabs (Creator) | ~$3.30 | 5/5 | Fast | Yes | No |
| Amazon Polly (Neural) | $1.60 | 3/5 | Fast | No | No |
| Google Cloud TTS | ~$2.00 | 4/5 | Fast | No | No |
| edge-tts | $0 | 4/5 | Fast | No | No |
| Bark | $0 (GPU cost) | 4/5 | Slow | No | Yes |
| Piper | $0 | 3/5 | Very fast | No | Yes |
| XTTS v2 | $0 (GPU cost) | 4/5 | Medium | Yes | Yes |
| Play.ht | ~$10+ | 4/5 | Fast | Yes | No |
When stock music doesnβt fit, generate custom tracks. Quality has reached the point where AI music is indistinguishable from stock library tracks for background use.
Cloud Music Generation
| Resource | URL | Free Tier | Pricing | Commercial | Quality |
|---|---|---|---|---|---|
| Suno | suno.com | 50 credits/day | $10-30/mo | Paid plans only | 5/5 |
| Udio | udio.com | 10 daily + 100 monthly | $10-30/mo | Paid plans (no attribution) | 4/5 |
| Stable Audio | stableaudio.com | Yes (limited) | Paid tiers | Creator/Enterprise license | 4/5 |
Suno β Best Quality, Most Mature
- Free: 50 credits/day β enough to generate ~25 songs/day for personal use
- Pro ($10/mo): 2,500 credits/mo, commercial license, ownership rights
- Premier ($30/mo): 10,000 credits/mo
- No official public API β third-party access via partners ($0.02-0.05/track)
- Commercial license: Paid plans grant full ownership and commercial rights
- Warner Music Group partnership (2026) β licensed AI models coming
- Best for: Background music, intros/outros, custom jingles
Key insight: Sunoβs free tier (50 credits/day) is enough to generate all the background music a small video operation needs. Upgrade to Pro ($10/mo) only when you need commercial licensing.
Udio β The Competitor
- Free: 10 daily + 100 monthly credits
- Standard ($10/mo): 1,200 credits (~600 songs at $0.017/song)
- Pro ($30/mo): 4,800 credits
- Commercial use: All plans, but free requires Udio attribution
- 2026 warning: Downloads temporarily disabled during licensing transition with Warner Music Group and Universal
- Best for: Variety in style, covers and remixes (when licensed service launches)
Local / Open Source Music Generation
| Resource | URL | License | GPU Required | Quality | Best For |
|---|---|---|---|---|---|
| MusicGen (Meta) | github.com/facebookresearch/audiocraft | MIT (code) / CC-BY-NC 4.0 (models) | Yes (16GB rec.) | 4/5 | Research, non-commercial |
| AudioCraft | ai.meta.com/audiocraft | Same as above | Yes | 4/5 | Full audio toolkit |
MusicGen β The Open Source Option
- Trained on 400K recordings (20,000 hours) β Meta-owned and licensed music
- Text-to-music: Describe the style, get a track
- Melodic conditioning: Hum a melody, get a full arrangement
- Models: Small, Medium, Large β larger = better but slower
- 16GB GPU recommended for large model
- Critical licensing note: Model weights are CC-BY-NC 4.0 β non-commercial only
- Replicate API: Run MusicGen via replicate.com/meta/musicgen for ~$0.01-0.05/generation
// MusicGen via Replicate API
async function generateMusic(
prompt: string,
durationSeconds: number = 30
): Promise<string> {
const res = await fetch("https://api.replicate.com/v1/predictions", {
method: "POST",
headers: {
Authorization: `Token ${process.env.REPLICATE_API_TOKEN}`,
"Content-Type": "application/json",
},
body: JSON.stringify({
version: "b05b1dff1d8c6dc63d14b0cdb42135571e41c36ec76ef8bf37c9e36b395767a4",
input: {
prompt,
duration: durationSeconds,
model_version: "stereo-melody-large",
output_format: "mp3",
},
}),
});
const prediction = await res.json();
return prediction.urls.get; // Poll this URL for result
}
Key insight: For commercial video production, use Suno ($10/mo) or stock libraries. MusicGen is non-commercial only (model license). Use MusicGen for prototyping and personal projects, then swap to licensed music for published content.
Every video starts with a script. LLMs have made script generation nearly free. The differentiator is prompt engineering, not the model.
| Resource | URL | Free Tier | Cost/Script | Speed | Quality |
|---|---|---|---|---|---|
| Claude API | anthropic.com | No | ~$0.02-0.10 | Fast | 5/5 |
| Gemini API | ai.google.dev | Yes (generous) | $0-0.05 | Fast | 4/5 |
| OpenRouter | openrouter.ai | No (pay-as-you-go) | $0.001-0.10 | Varies | 4/5 |
| Cloudflare Workers AI | developers.cloudflare.com | Yes (included) | ~$0.001 | Fast | 3/5 |
| Ollama | ollama.ai | Unlimited (local) | $0 (electricity) | Varies | 3-4/5 |
Claude API β Best Script Quality
- Sonnet: ~$0.02/script (60-second video) β best balance of quality and cost
- Opus: ~$0.10/script β use for complex, nuanced narratives
- Best for: Video scripts that need to sound human, maintain narrative arc, hit emotional beats
- SDK:
@anthropic-ai/sdkfor Node.js
Gemini API β Best Free Option
- Free tier: 15 RPM, 1M tokens/min, 1.5K requests/day
- Paid: $0.075/1M input tokens (Flash), $1.25/1M (Pro)
- Best for: Bulk script generation, iterative drafting, research synthesis
- Flash model: Fast enough for real-time script generation in pipelines
OpenRouter β Multi-Model Access
- Single API, 100+ models β Claude, GPT, Gemini, Llama, Mistral, and more
- Pay-as-you-go β no subscriptions, charged per token
- Best for: A/B testing different models for script quality, routing to cheapest adequate model
- Key advantage: Switch models without changing code
// OpenRouter β model-agnostic script generation
interface VideoScript {
hook: string;
scenes: { visual: string; narration: string; duration: number }[];
cta: string;
totalDuration: number;
}
async function generateVideoScript(
topic: string,
durationSeconds: number,
model: string = "anthropic/claude-sonnet-4"
): Promise<VideoScript> {
const res = await fetch("https://openrouter.ai/api/v1/chat/completions", {
method: "POST",
headers: {
Authorization: `Bearer ${process.env.OPENROUTER_API_KEY}`,
"Content-Type": "application/json",
},
body: JSON.stringify({
model,
messages: [
{
role: "system",
content: `You are a viral video scriptwriter. Generate scripts as JSON matching the VideoScript interface. Target duration: ${durationSeconds}s. Every script needs a hook in the first 3 seconds.`,
},
{
role: "user",
content: `Write a ${durationSeconds}-second video script about: ${topic}`,
},
],
response_format: { type: "json_object" },
}),
});
const data = await res.json();
return JSON.parse(data.choices[0].message.content);
}
Ollama β The Local Free Option
- Run any open model locally: Llama 3, Mistral, Phi, Gemma
- Zero cost beyond hardware/electricity
- Best for: Drafting, iteration, private content that shouldnβt leave your machine
- Gotcha: Quality of Llama 3 70B approaches but doesnβt match Claude/GPT-4 for creative writing
This is where scripts, footage, voiceover, and music become an actual video. The choice between cloud APIs and local tools defines your cost structure and flexibility.
Cloud Video APIs
| Resource | URL | Pricing | Free Tier | Templates | Quality |
|---|---|---|---|---|---|
| Creatomate | creatomate.com | $41-249/mo | 50 free credits | Yes (visual editor) | 5/5 |
| Shotstack | shotstack.io | $0.20-0.40/min | 20 min/mo | Yes (JSON + editor) | 4/5 |
| JSON2Video | json2video.com | $20-200/mo | 600 sec | Yes | 3/5 |
Creatomate β Best Cloud API
- Credit-based pricing: Starter ($41/mo), Professional ($99/mo), Enterprise ($249/mo)
- Visual template editor β build templates in-browser, render via API
- API on all plans including starter
- Template gallery with customizable starting points
- No After Effects import β must use Creatomateβs own editor or built-in templates
- Best for: Branded template videos, social media content at scale
- API: REST API with webhooks for render completion
// Creatomate API β render from template
interface CreatomateRender {
template_id: string;
modifications: Record<string, string | number | boolean>;
}
async function renderVideo(
templateId: string,
modifications: Record<string, string>,
apiKey: string
): Promise<{ id: string; url: string; status: string }> {
const res = await fetch("https://api.creatomate.com/v1/renders", {
method: "POST",
headers: {
Authorization: `Bearer ${apiKey}`,
"Content-Type": "application/json",
},
body: JSON.stringify([
{
template_id: templateId,
modifications,
},
]),
});
const renders = await res.json();
return renders[0];
}
Shotstack β Best Value Cloud API
- Free tier: 20 minutes/month of video rendering
- Pay-as-you-go: $0.40/min (or $0.20/min on subscription)
- 1 credit = 1 minute regardless of resolution
- JSON-based editing: Define timeline, clips, transitions, overlays, audio in JSON
- Merge fields: Swap text, images, video, colors via placeholders
- Zapier/Make integration for no-code workflows
- Free sandbox for unlimited testing (watermarked output)
// Shotstack API β JSON-based video composition
interface ShotstackEdit {
timeline: {
tracks: {
clips: {
asset: {
type: "video" | "image" | "title" | "audio";
src?: string;
text?: string;
style?: string;
};
start: number;
length: number;
transition?: { in: string; out: string };
}[];
}[];
};
output: {
format: "mp4" | "gif" | "mp3";
resolution: "sd" | "hd" | "1080" | "4k";
aspectRatio?: "16:9" | "9:16" | "1:1" | "4:5";
};
}
async function renderShotstack(
edit: ShotstackEdit,
apiKey: string
): Promise<{ id: string }> {
const res = await fetch("https://api.shotstack.io/v1/render", {
method: "POST",
headers: {
"x-api-key": apiKey,
"Content-Type": "application/json",
},
body: JSON.stringify(edit),
});
return res.json();
}
JSON2Video β Budget Option
- Credit system: 1 credit/second (1080p), 4 credits/second (4K)
- Free plan: 600 seconds (10 minutes) of rendering
- Professional ($50/mo): 200 minutes, max 10 min video length
- Startup ($100/mo): 500 minutes, max 30 min
- Best for: Simple text-over-image videos, slideshows, social content
Local / Self-Hosted Video Composition
| Resource | URL | Cost | Flexibility | Learning Curve | Best For |
|---|---|---|---|---|---|
| FFmpeg | ffmpeg.org | Free | 5/5 | Steep | Everything β the gold standard |
| Remotion | remotion.dev | Free/$100+/mo | 5/5 | Medium (React) | Programmatic, complex animations |
| MoviePy | zulko.github.io/moviepy | Free | 3/5 | Easy (Python) | Quick scripts, prototyping |
FFmpeg β The Foundation
Everything else is built on top of FFmpeg. If you learn one tool, learn this.
- Free and open source β runs everywhere
- Does everything: Transcode, cut, merge, overlay, add audio, add subtitles, resize, crop, filter
- No API β command-line tool, but wrappable in any language
- GPU acceleration: NVENC (NVIDIA), AMF (AMD), VideoToolbox (Apple)
ffmpeg \
-i background.mp4 \
-i voiceover.mp3 \
-i music.mp3 \
-filter_complex "\
[0:v]scale=1080:1920:force_original_aspect_ratio=increase,crop=1080:1920[bg]; \
[1:a][2:a]amix=inputs=2:duration=first:weights=1 0.15[audio]; \
[bg]subtitles=captions.srt:force_style='FontSize=24,PrimaryColour=&Hffffff&,Bold=1,Alignment=2,MarginV=100'[out]" \
-map "[out]" -map "[audio]" \
-c:v libx264 -preset fast -crf 23 \
-c:a aac -b:a 192k \
-t 60 \
output.mp4
Remotion β React-Based Video
- Write videos as React components β full programmatic control
- Free for individuals and small businesses
- Company license: $100+/mo minimum ($25/dev/mo + $10/1K renders)
- Lambda rendering: Distribute to AWS Lambda, ~$0.001/render for simple videos
- Self-hosted: Run on your own infrastructure
- Best for: Complex, data-driven videos with dynamic content
// Remotion β React video component
import { AbsoluteFill, Video, Audio, Sequence, useCurrentFrame } from "remotion";
export const FinanceExplainer: React.FC<{
title: string;
scenes: { footage: string; narration: string }[];
voiceover: string;
music: string;
}> = ({ title, scenes, voiceover, music }) => {
const frame = useCurrentFrame();
return (
<AbsoluteFill style={{ backgroundColor: "#0a0a0a" }}>
<Audio src={music} volume={0.15} />
<Audio src={voiceover} volume={1} />
{scenes.map((scene, i) => (
<Sequence key={i} from={i * 150} durationInFrames={150}>
<AbsoluteFill>
<Video src={scene.footage} style={{ width: "100%", height: "100%" }} />
<div
style={{
position: "absolute",
bottom: 100,
left: 40,
right: 40,
color: "white",
fontSize: 28,
fontWeight: "bold",
textShadow: "2px 2px 4px rgba(0,0,0,0.8)",
}}
>
{scene.narration}
</div>
</AbsoluteFill>
</Sequence>
))}
</AbsoluteFill>
);
};
MoviePy β Quick Python Scripts
pip install moviepyβ Python library built on FFmpeg- Easy API for concatenation, text overlays, transitions, compositing
- Best for: Prototyping pipelines, batch processing, simple compositions
- Gotcha: Slower than direct FFmpeg. Limited animation support. Not production-grade for complex video.
Video Composition Comparison
| Tool | Cost/min (at scale) | Templates | 9:16 Support | API Quality | Automation |
|---|---|---|---|---|---|
| Creatomate | ~$0.30-0.50 | Visual editor | Yes | Excellent | Full |
| Shotstack | $0.20-0.40 | JSON + editor | Yes | Excellent | Full |
| JSON2Video | ~$0.25 | Limited | Yes | Good | Full |
| Remotion | ~$0.001 (Lambda) | React code | Yes | N/A (self-hosted) | Full |
| FFmpeg | $0 (compute only) | None | Yes | N/A (CLI) | Full |
| MoviePy | $0 (compute only) | None | Yes | N/A (Python) | Full |
Key insight: For <50 videos/month, use Shotstack or Creatomate (cloud APIs, zero infrastructure). For >50 videos/month, invest in Remotion or FFmpeg pipelines β the per-video cost drops 100x.
Templates eliminate design decisions. A good template turns βscript + footage + audioβ into a finished video with consistent branding.
| Resource | URL | Free Tier | API/Programmatic | 9:16 Vertical | Best For |
|---|---|---|---|---|---|
| Canva | canva.com | Yes | No | Yes (extensive) | Non-designers, quick social content |
| CapCut | capcut.com | Yes | No (plugin-only SDK) | Yes (native) | TikTok-optimized, trending styles |
| InVideo AI | invideo.io | Yes (limited) | No | Yes | AI-generated from text prompts |
| Envato Elements | elements.envato.com | No | No | Some | After Effects, Premiere Pro templates |
| Creatomate | creatomate.com | 50 credits | Yes (API) | Yes | Programmatic template rendering |
| Motion Array | motionarray.com | Limited | No | Some | Premiere Pro, After Effects templates |
CapCut β TikTokβs Native Editor
- Free: Auto-captions, text-to-speech, AI avatars, script-to-video, trending templates
- Pro (optional): Remove watermark, more export options
- No public API β Open Platform is editor-plugin only, not server-side rendering
- AutoCut feature: AI identifies viral moments in long footage, auto-clips to vertical
- Best for: Manual short-form creation, fast iteration, TikTok-native styles
- Gotcha: No automation path. Every video requires manual editor interaction.
Creatomate Templates β The Automation Winner
- Only template system with a full REST API for programmatic rendering
- Visual editor: Build templates in-browser with drag-and-drop
- Template gallery: Pre-built templates for social media, ads, product demos
- Dynamic elements: Swap text, images, video, colors, fonts via API
- Best for: Branded content at scale with consistent look
Captions are non-negotiable. 85% of Facebook video is watched without sound. TikTokβs algorithm favors captioned content. Animated word-by-word captions are the current standard.
Caption Generation Tools
| Resource | URL | Free Tier | Word-Level | Styling | API |
|---|---|---|---|---|---|
| CapCut | capcut.com | Yes | Yes | Extensive | No |
| Kapwing | kapwing.com | 10 min/mo | Yes | 100+ presets | No |
| Descript | descript.com | 1 hr transcription | Yes | Limited | No |
| Whisper | github.com/openai/whisper | Unlimited (local) | Sentence-level | None (SRT output) | N/A |
| WhisperX | github.com/m-bain/whisperX | Unlimited (local) | Yes (word-level) | None (SRT output) | N/A |
| whisper-timestamped | github.com/linto-ai/whisper-timestamped | Unlimited (local) | Yes (word-level) | None | N/A |
Whisper + WhisperX β The Pipeline Solution
Native Whisper produces sentence-level timestamps (1-second accuracy). For word-level sync needed for animated captions, use WhisperX or whisper-timestamped.
- Whisper: Free, open source by OpenAI. Runs locally. Multiple model sizes (tiny β large).
- WhisperX: Adds word-level timestamps via forced alignment. Speaker diarization. Faster inference.
- whisper-timestamped: DTW-based word timestamps on cross-attention weights. Drop-in Whisper extension.
// Pipeline: Whisper transcription β word-level SRT β FFmpeg burn-in
import { execSync } from "child_process";
function transcribeWithWordTimestamps(audioPath: string): string {
// whisper-timestamped outputs word-level JSON
execSync(
`whisper_timestamped ${audioPath} --model medium --language en --output_format json`
);
return audioPath.replace(/\.\w+$/, ".json");
}
function generateAnimatedSRT(transcriptionJson: string): string {
const data = JSON.parse(require("fs").readFileSync(transcriptionJson, "utf-8"));
let srt = "";
let index = 1;
for (const segment of data.segments) {
for (const word of segment.words) {
const start = formatTimestamp(word.start);
const end = formatTimestamp(word.end);
srt += `${index}\n${start} --> ${end}\n${word.text.trim()}\n\n`;
index++;
}
}
return srt;
}
function formatTimestamp(seconds: number): string {
const h = Math.floor(seconds / 3600);
const m = Math.floor((seconds % 3600) / 60);
const s = Math.floor(seconds % 60);
const ms = Math.floor((seconds % 1) * 1000);
return `${String(h).padStart(2, "0")}:${String(m).padStart(2, "0")}:${String(s).padStart(2, "0")},${String(ms).padStart(3, "0")}`;
}
function burnCaptions(videoPath: string, srtPath: string, outputPath: string): void {
execSync(
`ffmpeg -i ${videoPath} -vf "subtitles=${srtPath}:force_style='FontSize=22,PrimaryColour=&H00FFFFFF,Bold=1,Alignment=2,MarginV=80,BackColour=&H80000000,BorderStyle=4'" -c:a copy ${outputPath}`
);
}
Kapwing β Best Browser-Based
- Free: 10 min auto-subtitling/mo, watermark on exports, 720p max
- Pro ($16/mo): 300 min, no watermark, 4K, AI editing tools
- 99% accuracy claim on auto-subtitle generation
- 100+ caption presets β animated word-by-word, karaoke style, etc.
- Best for: Quick manual captioning when you donβt want to run Whisper locally
Descript β The Editor That Transcribes
- Free: 1 hour transcription
- Hobbyist ($16/mo): 10 hours
- Creator ($24/mo): 30 hours, Overdub voice, AI Green Screen, Studio Sound
- Text-based editing: Edit video by editing the transcript β delete words to delete video
- 25 languages supported
- Speaker detection: Auto-labels multiple speakers
- Best for: Podcast-to-video, interview content, audio-first workflows
Thumbnails determine click-through rate. A 2% CTR difference can double your views.
| Resource | URL | Free Tier | AI Generation | Quality | Best For |
|---|---|---|---|---|---|
| Canva | canva.com | Yes | Yes (AI maker) | 4/5 | Template-based, consistent branding |
| Figma | figma.com | Yes | Community plugins | 5/5 | Custom design, pixel-perfect |
| Thumbly | thumbly.ai | $0.01/thumbnail | Yes | 3/5 | Bulk generation, budget option |
| Midjourney | midjourney.com | No | Yes | 5/5 | AI-generated hero images |
| DALL-E | openai.com/dall-e | Via API | Yes | 4/5 | Programmatic generation |
| Flux | blackforestlabs.ai | Via Replicate | Yes | 5/5 | Open-source, self-hostable |
Thumbly β AI Thumbnails for YouTube
- $0.01/thumbnail ($1 for 100 credits)
- $3.99 per on-demand thumbnail (pay-as-you-go)
- Generates multiple options in under 10 seconds
- Full commercial rights on all generated thumbnails
- Best for: High-volume channels that need fast, cheap thumbnail options
- Gotcha: Quality is functional but not premium. Use for A/B testing, then refine winners in Canva/Figma.
Thumbnail Best Practices for CTR
- Face with emotion (if applicable) β 2-3x higher CTR than no face
- Large, bold text β 3-5 words max, readable at mobile thumbnail size
- High contrast β bright colors against dark, or vice versa
- Rule of thirds β face on one side, text on the other
- Curiosity gap β show enough to intrigue, not enough to satisfy
- 1280x720px minimum (16:9 ratio) for YouTube
- A/B test β tools like ThumbnailTest.com or YouTubeβs built-in test feature
Publishing whatβs trending multiplies views 5-10x versus publishing what you feel like making. Trend intelligence is the highest-leverage investment in a video pipeline.
| Resource | URL | Free | Pricing | API | Platforms Covered |
|---|---|---|---|---|---|
| Virlo | virlo.ai | No | $29-49/mo | Yes β dev.virlo.ai | TikTok, YouTube Shorts, Reels |
| Google Trends | trends.google.com | Yes | Free | Unofficial scrapers | Web search, YouTube |
| TikTok Creative Center | ads.tiktok.com/business/creativecenter | Yes | Free | No (3rd-party scrapers) | TikTok |
| YouTube Trending | youtube.com/feed/trending | Yes | Free | YouTube Data API | YouTube |
| Exploding Topics | explodingtopics.com | Limited | $39/mo+ | Pro Business only | Cross-platform |
| VidIQ | vidiq.com | Yes (limited) | $17.50/mo+ | No | YouTube |
| TubeBuddy | tubebuddy.com | Yes (limited) | $2.25-14.50/mo | No | YouTube |
Virlo β The Short-Form Intelligence API
- 25+ API endpoints: Social listening, monitoring, creator intelligence, hashtags, trends
- Research Analyst ($29/mo): Unlimited viral outliers, niche analysis, trend data
- Starter Creator ($49/mo): + 1,000 credits, TikTok/YouTube searches, AI script/audio/video generation
- API features:
- Keyword search across TikTok, YouTube, Instagram β paginated videos + creator outliers
- Trending niches with virality scores
- Cultural velocity measurement (trend rise/decay speed)
- Historical tracking for trend forecasting
- Creator lookup by platform/username β profile stats, engagement, top content
Key insight: Virlo is the only API that provides structured, machine-readable trend data for short-form video. Google Trends and TikTok Creative Center are free but require scraping. Virlo at $29/mo eliminates the scraping overhead and adds virality scoring.
TikTok Creative Center β Free Manual Intelligence
- Trending hashtags (last 7 days) with view counts and growth
- Trending songs β what audio is going viral
- Top ads β see whatβs working in paid TikTok campaigns
- No official API β third-party scrapers available via Apify ($5-20/mo)
- Best for: Manual weekly research, understanding what formats work
VidIQ vs. TubeBuddy β YouTube SEO
| Feature | VidIQ | TubeBuddy |
|---|---|---|
| Keyword research | Strongest β Keyword Inspector with volume, competition, score | Good β SEO Studio |
| Bulk processing | No | Yes β cards, end screens, descriptions |
| Pricing | $17.50/mo (Boost) | $2.25/mo (Pro) β $14.50/mo (Legend) |
| A/B testing | No | Yes β thumbnail + title testing |
| Best for | Keyword research, competitor analysis | Channel management, optimization at scale |
Publishing to one platform is easy. Publishing to six platforms with correct formats, captions, and scheduling is a workflow nightmare without tools.
| Resource | URL | Free Tier | Platforms | Auto-Post | API |
|---|---|---|---|---|---|
| Buffer | buffer.com | 3 channels, 10 posts each | All major | Yes | No public API |
| Later | later.com | Limited | Instagram, TikTok, Facebook, X, Pinterest | Yes | No |
| Repurpose.io | repurpose.io | No | YouTube, TikTok, IG, FB, Snapchat, Pinterest, LinkedIn, X, Bluesky | Yes (auto cross-post) | No |
| YouTube Data API | developers.google.com/youtube/v3 | 10K units/day | YouTube | Yes (programmatic) | Yes |
| TikTok API | developers.tiktok.com | Limited | TikTok | Limited | Yes (restricted) |
| Meta Graph API | developers.facebook.com | Yes | Instagram, Facebook | Yes (programmatic) | Yes |
Buffer β Simple Multi-Platform Scheduling
- Free: 3 channels, 10 scheduled posts per channel
- Essentials ($5/mo/channel): Unlimited scheduling, analytics
- Team ($10/mo/channel): Unlimited users, approval workflows
- Supports: Facebook, Instagram, LinkedIn, Pinterest, TikTok, Twitter/X, YouTube, Mastodon, Bluesky, Threads
- No public developer API β scheduling through their interface only
Repurpose.io β The Cross-Post Automator
- $29-149/mo (annual billing saves ~17%)
- Auto-reformats content per platform β resizes, adds captions, adjusts aspect ratios
- Workflow automation: YouTube long-form β auto-clip to TikTok, Reels, Shorts
- Supports 9+ platforms including YouTube, TikTok, Instagram, Facebook, LinkedIn, X, Snapchat, Pinterest, Bluesky
- Best for: βPublish once, distribute everywhereβ workflow
YouTube Data API β Programmatic Upload
- Free: 10,000 quota units/day (1 upload = 1,600 units β 6 uploads/day)
- OAuth 2.0 required for uploads
- Post-July 2020 projects: Uploads default to private until API project is verified/audited
- Set metadata: Title, description, tags, category, privacy, thumbnails
- Best for: Automated publishing pipelines
// YouTube Data API v3 β programmatic video upload
import { google } from "googleapis";
async function uploadToYouTube(
auth: any,
videoPath: string,
metadata: {
title: string;
description: string;
tags: string[];
categoryId: string;
privacyStatus: "private" | "unlisted" | "public";
}
): Promise<string> {
const youtube = google.youtube({ version: "v3", auth });
const res = await youtube.videos.insert({
part: ["snippet", "status"],
requestBody: {
snippet: {
title: metadata.title,
description: metadata.description,
tags: metadata.tags,
categoryId: metadata.categoryId,
},
status: {
privacyStatus: metadata.privacyStatus,
selfDeclaredMadeForKids: false,
},
},
media: {
body: require("fs").createReadStream(videoPath),
},
});
return res.data.id!;
}
These tools promise to handle the entire pipeline β from script to published video. They trade flexibility for convenience.
| Resource | URL | Pricing | Script | Footage | Voice | Captions | Publish | API |
|---|---|---|---|---|---|---|---|---|
| AutoShorts.ai | autoshorts.ai | $19-69/mo | Yes | Yes | Yes | Yes | Yes (TikTok, YouTube, IG) | No |
| InVideo AI | invideo.io | $0-100/mo | Yes | Yes (16M+ clips) | Yes (50+ langs) | Yes | No | No |
| Pictory | pictory.ai | $25-119/mo | No | Yes (3M+ clips) | Yes (ElevenLabs) | Yes | No | Team plan+ |
| Fliki | fliki.ai | $0-44/mo | Yes | Yes | Yes (2K+ voices) | Yes | No | No |
| Vizard.ai | vizard.ai | $0-30/seat/mo | No | N/A (input video) | No | Yes (32 langs) | No | No |
| OpusClip | opus.pro | $0-29/mo | No | N/A (input video) | No | Yes | Yes (scheduler) | Business only |
| ShortsDaily | shortsdaily.com | Credit-based | Yes | Yes (AI visuals) | Yes | Yes | Yes | No |
| Syllaby | syllaby.io | $29-99/mo | Yes | Yes | Yes | Yes | No | No |
AutoShorts.ai β Faceless Shorts on Autopilot
- Free: 1 video
- Starter ($19/mo): 3 videos/week
- Daily ($39/mo): 1 video/day
- Hardcore ($69/mo): 2 videos/day
- Handles: Script β stock footage β voiceover β captions β auto-publish
- Platforms: TikTok, YouTube Shorts, Instagram Reels
- Best for: Set-and-forget faceless short-form channels
- Gotcha: Limited customization. Videos look similar to other AutoShorts users.
InVideo AI β Most Capable All-in-One
- Free tier available with limited features
- Plus ($28/mo): HD exports, 100GB storage, iStock media
- Max ($50/mo): Unlimited exports, 400GB storage
- Generative ($100/mo): 200 min/mo AI generation (Sora 2 + VEO 3.1)
- Text-based editing: Type changes into βMagic Boxβ and AI applies them
- 16M+ stock assets from iStock integrated
- 50+ language voiceovers
- Brand Kit for consistent styling
- Best for: Creators who want maximum AI assistance with manual polish
OpusClip β Best Long-to-Short Converter
- Free: 60 credits/mo (~60 min processing), watermarked
- Starter ($15/mo): 150 credits
- Pro ($29/mo): 300 credits, AI B-roll, social scheduler, XML export
- Business (custom): API access
- AI features: Auto-clip viral moments, facial tracking, dynamic captions, B-roll insertion
- Viral score: Predicts clip potential
- Best for: Repurposing long-form content into shorts
Vizard.ai β The Smart Clipper
- Free: 120 upload minutes/mo
- Creator ($20/mo): 1,200 minutes, unlimited exports
- Team ($30/seat/mo): 6,000 minutes, faster processing
- AI reframing: Facial tracking keeps speakers centered in vertical crop
- 32-language captions with auto-translation
- Viral score per clip
- Best for: Podcast/interview content β social clips
Pictory β Article-to-Video Specialist
- Free trial: 14 days, 3 projects, all features, watermarked
- Starter ($25/mo annual): 200 video minutes
- Professional ($35/mo): 600 min, ElevenLabs integration, 29 languages
- Team ($119/mo): API access, collaboration
- Workflows: Text-to-Video, URL-to-Video, PPT-to-Video, Audio-to-Video
- 3M+ video clips and 15K+ music tracks included
- Best for: Blog-to-video conversion, article repurposing
All-in-One Comparison
| Criteria | AutoShorts | InVideo AI | Pictory | OpusClip | Vizard |
|---|---|---|---|---|---|
| Input type | Topic/prompt | Text/URL/script | Text/URL/media | Long video | Long video |
| Output | Shorts | Any format | Any format | Shorts | Shorts |
| Customization | Low | High | Medium | Medium | Medium |
| Auto-publish | Yes | No | No | Yes (Pro) | No |
| API | No | No | Team+ | Business | No |
| Starting cost | $19/mo | $0 | $25/mo | $0 | $0 |
| Best quality | 3/5 | 4/5 | 4/5 | 4/5 | 4/5 |
Key insight: All-in-one tools are great for getting started fast but terrible for scale. They limit customization, charge per-video premiums, and lock you into their rendering pipeline. Use them to validate which video formats work, then rebuild winners with composable tools (FFmpeg + Remotion + your own templates).
Making videos is a cost center until you connect them to revenue. These are the primary monetization paths for video content.
| Platform | URL | Revenue Model | Threshold | RPM Range | Best For |
|---|---|---|---|---|---|
| YouTube Partner Program | youtube.com/creators | AdSense (ads) | 1K subs + 4K watch hours | $2-21 | Long-form, evergreen |
| TikTok Creator Rewards | tiktok.com/creators | Per-view payment | 10K followers + 100K views/30d | $0.50-3.00/1K | Short-form, viral |
| Content Rewards | contentrewards.com | Brand campaigns (per view) | None | $1-2.50/1K | UGC, brand content |
| Instagram Bonuses | instagram.com | Reels Play bonuses | Invite-only | Varies | Reels creators |
| Grin | grin.co | Sponsorship marketplace | Brand-dependent | Per-deal | DTC brand sponsorships |
| Aspire | aspire.io | Sponsorship marketplace | Brand-dependent | $150-300/video | UGC, influencer deals |
YouTube Partner Program β The Long Game
- Threshold: 1,000 subscribers + 4,000 watch hours (or 10M Shorts views in 90 days)
- Finance niche RPM: $9-$21 (highest of any niche)
- At 50K views/mo: $450-1,050/mo
- At 500K views/mo: $4,500-10,500/mo
- Evergreen: Videos keep earning for years
- Sponsor-friendly: $500-5K per integration for finance niche
- Time to first dollar: 1-3 months
Content Rewards β Fastest to First Dollar
- No audience required β anyone can post
- Earn per view on brand campaigns ($1-2.50/1K views)
- Browse campaigns with specific creative requirements
- Creator testimonials: Claims of $4K-$11K/month
- Best for: New creators who need revenue validation before building audience
- Platform: Runs on Whop
Sponsorship Marketplaces
- Grin: Built for DTC/ecommerce brands. Subscription-based (contact sales). Best for ongoing partnerships.
- Aspire: Starts at ~$2,000/mo. Worldβs largest sponsorship marketplace. Best for brand campaigns.
- Average UGC creator rate (2026): $150-300/video, average $212/video
- UGC market growth: 93% increase in creators
| Donβt | Do Instead | Why |
|---|---|---|
| Build a stock footage scraper | Use Pexels/Pixabay APIs | Free, legal, better indexed, maintained by someone else |
| Build custom TTS | Use edge-tts (free) or ElevenLabs (paid) | Voice quality requires billions in training data |
| Build a music library | Use Pixabay Music or Suno | Licensing alone makes this impossible to self-host |
| Build caption rendering from scratch | Use Whisper + FFmpeg subtitle filter | Solved problem with production-grade open source |
| Pay for all-in-one before validating | Use free tiers of 3-4 composable tools | All-in-ones charge 10x per video vs. DIY at scale |
| Automate posting before manual validation | Post manually for 2 weeks first | Youβll change strategy 3x before it stabilizes |
| Skip trend research | Use TikTok Creative Center (free) or Virlo (paid) | Publishing without trend data is random β youβll waste months |
| Build a video editor | Use FFmpeg + Remotion | Video editing is a billion-dollar problem. Use the solutions. |
| Pay for YouTube SEO tools on day one | Use free tiers of VidIQ/TubeBuddy first | Most value is in keyword research, available on free/cheap plans |
| Render in the cloud at low volume | Use local FFmpeg/Remotion | Cloud rendering has minimums and per-minute charges that donβt make sense below 50 videos/month |
| Build a cross-posting tool | Use Buffer (free) or Repurpose.io ($29/mo) | Each platform API has quirks that take months to master |
| Generate AI music for commercial use with MusicGen | Use Suno Pro ($10/mo) | MusicGen model weights are CC-BY-NC (non-commercial only) |
The $0/mo Stack β Minimum Viable Pipeline
Everything here is genuinely free with no catch (no watermarks, no attribution required for most).
| Stage | Tool | Notes |
|---|---|---|
| Script | Gemini API (free tier) or Ollama (local) | 1.5K requests/day on Gemini free |
| Stock footage | Pexels API | 20K requests/mo, portrait filter |
| Stock music | Pixabay Music | No attribution, commercial OK |
| Sound effects | Freesound | CC-licensed, check per sound |
| Voice | edge-tts | 300+ voices, no API key |
| Captions | Whisper + whisper-timestamped | Local, word-level timestamps |
| Video composition | FFmpeg | The gold standard, steep learning curve |
| Thumbnails | Canva free tier | Templates + AI maker |
| Trend research | TikTok Creative Center + Google Trends | Manual but free |
| Publishing | Manual or Buffer free (3 channels) | 10 posts/channel |
| Monetization | Content Rewards, TikTok Creator Rewards | No threshold for Content Rewards |
What you get: Fully functional pipeline for 10-20 videos/month. Voice quality is good (edge-tts), not great. Video composition requires FFmpeg knowledge. Manual trend research takes 1-2 hours/week.
What you sacrifice: Premium voice quality, automated posting, trend intelligence, visual template system.
The $50/mo Stack β Quality Pipeline
| Stage | Tool | Cost | Why |
|---|---|---|---|
| Script | Claude API (Sonnet) | ~$5/mo | Best script quality |
| Stock footage | Pexels API + Storyblocks | $21/mo | Unlimited downloads |
| Stock music | Pixabay Music + Suno Pro | $10/mo | Custom tracks when needed |
| Sound effects | Freesound | $0 | CC-licensed |
| Voice | ElevenLabs Creator | $11/mo | 100K chars (~35 min), voice cloning |
| Captions | WhisperX (local) | $0 | Word-level, speaker diarization |
| Video composition | FFmpeg + Shotstack | ~$0-5/mo | Local for most, cloud for complex |
| Thumbnails | Canva Pro | $0 (bundled with Canva sub if needed) | Or Figma free |
| Trend research | TikTok Creative Center | $0 | Manual research |
| Publishing | Buffer Essentials (3 channels) | $15/mo | Unlimited scheduling |
| Monetization | YouTube Partner + Content Rewards | $0 | Dual-track revenue |
| Total | ~$52-62/mo |
What you get: Production-quality voice (ElevenLabs), unlimited stock (Storyblocks), custom music when stock doesnβt fit (Suno), automated scheduling (Buffer), best scripts (Claude).
What you sacrifice: API-level trend intelligence, automated cross-posting, all-in-one convenience.
The $200/mo Stack β Scale Pipeline
| Stage | Tool | Cost | Why |
|---|---|---|---|
| Script | Claude API (Sonnet + Opus) | ~$15/mo | Opus for long-form narratives |
| Stock footage | Storyblocks All Access | $30/mo | Unlimited everything |
| Stock music | Epidemic Sound | $15/mo | 50K+ tracks, API integration |
| Sound effects | Epidemic Sound (included) | $0 | 200K+ SFX |
| Voice | ElevenLabs Pro | $99/mo | 500K chars, voice cloning, lower overage |
| Captions | WhisperX (local) | $0 | Word-level |
| Video composition | Remotion (self-hosted) | $0 (under threshold) | React-based, Lambda rendering |
| Thumbnails | Canva Pro + Midjourney | ~$10/mo | AI generation for hero images |
| Trend research | Virlo Starter | $29/mo | API access, 25+ endpoints |
| Publishing | Buffer + Repurpose.io | $15+29/mo | Auto cross-post to all platforms |
| Monetization | YouTube Partner + Sponsorships | $0 | Multiple revenue streams |
| Total | ~$242/mo |
What you get: Full automation pipeline. 100+ videos/month capacity. API-driven trend intelligence. Cross-platform auto-publishing. Premium voice and music. Data-driven content decisions.
What you sacrifice: Nothing meaningful. This stack covers 95% of use cases.
-
Pipeline orchestration β The glue that connects script generation β asset selection β composition β captioning β publishing. No existing tool handles your specific workflow end-to-end.
-
Template system β Your brand-specific video templates (intro, outro, lower thirds, text styles). Build once in Remotion or Creatomate, reuse forever.
-
Content intelligence layer β Connecting trend data (Virlo) to your content library to decide what to produce next. This is your competitive advantage.
-
Analytics aggregation β Pulling performance data from YouTube, TikTok, Instagram into one view to decide whatβs working.
-
A/B testing framework β Publishing variants and measuring which hooks, thumbnails, and formats perform best.
-
Text-to-speech engine β Requires billions in training data. Use ElevenLabs or edge-tts.
-
Stock footage library β Pexels and Pixabay have millions of clips with free APIs.
-
Music library or music generation model β Use Pixabay Music, Epidemic Sound, or Suno.
-
Speech-to-text / captioning engine β Whisper is free and state-of-the-art.
-
Video encoding/rendering β FFmpeg is the gold standard. Donβt rewrite it.
-
Social media posting APIs β Each platform has quirks. Use Buffer or Repurpose.io.
-
Video editor β Adobe, CapCut, and DaVinci Resolve have spent billions. You wonβt beat them.
-
Thumbnail design tool β Canva exists. Use it.
Quick-reference table of every resource in this catalog.
Stock Video (4 free, 4 paid)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 1 | Pexels | pexels.com | Yes | Yes | 4/5 |
| 2 | Pixabay | pixabay.com | Yes | Yes | 3/5 |
| 3 | Coverr | coverr.co | Yes | No | 4/5 |
| 4 | Mixkit | mixkit.co | Yes | No | 4/5 |
| 5 | Storyblocks | storyblocks.com | No | Enterprise | 4/5 |
| 6 | Artgrid | artgrid.io | No | No | 5/5 |
| 7 | Envato Elements | elements.envato.com | No | No | 4/5 |
| 8 | iStock/Getty | istockphoto.com | No | Yes | 5/5 |
Stock Music & SFX (5 free, 4 paid)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 9 | Pixabay Music | pixabay.com/music | Yes | Yes | 3/5 |
| 10 | Mixkit Music | mixkit.co/free-stock-music | Yes | No | 4/5 |
| 11 | YouTube Audio Library | YouTube Studio | Yes (YT only) | No | 3/5 |
| 12 | Freesound | freesound.org | Yes (CC) | Yes | 4/5 |
| 13 | Uppbeat | uppbeat.io | Yes (attribution) | No | 4/5 |
| 14 | Epidemic Sound | epidemicsound.com | No | Yes | 5/5 |
| 15 | Artlist | artlist.io | No | No | 5/5 |
| 16 | Soundstripe | soundstripe.com | No | No | 4/5 |
| 17 | Bensound | bensound.com | Partial | No | 3/5 |
Stock Images & Graphics (6)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 18 | Pexels | pexels.com | Yes | Yes | 4/5 |
| 19 | Unsplash | unsplash.com | Yes | Yes | 5/5 |
| 20 | Pixabay | pixabay.com | Yes | Yes | 3/5 |
| 21 | Freepik | freepik.com | Partial | No | 4/5 |
| 22 | Canva | canva.com | Yes | No | 4/5 |
| 23 | Figma | figma.com | Yes | REST | 5/5 |
TTS / Voice (8)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 24 | ElevenLabs | elevenlabs.io | 10K chars | Yes | 5/5 |
| 25 | Play.ht | play.ht | No | Yes | 4/5 |
| 26 | Amazon Polly | aws.amazon.com/polly | 5M chars/mo | Yes | 3/5 |
| 27 | Google Cloud TTS | cloud.google.com/text-to-speech | 4M chars/mo | Yes | 4/5 |
| 28 | edge-tts | github.com/rany2/edge-tts | Unlimited | N/A | 4/5 |
| 29 | Coqui TTS / XTTS | github.com/coqui-ai/TTS | Unlimited | N/A | 4/5 |
| 30 | Bark | github.com/suno-ai/bark | Unlimited | N/A | 4/5 |
| 31 | Piper | github.com/rhasspy/piper | Unlimited | N/A | 3/5 |
Music Generation AI (4)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 32 | Suno | suno.com | 50 credits/day | 3rd party | 5/5 |
| 33 | Udio | udio.com | 10 daily + 100/mo | No | 4/5 |
| 34 | Stable Audio | stableaudio.com | Limited | No | 4/5 |
| 35 | MusicGen/AudioCraft | github.com/facebookresearch/audiocraft | Unlimited | Replicate | 4/5 |
Script Generation / LLMs (5)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 36 | Claude API | anthropic.com | No | Yes | 5/5 |
| 37 | Gemini API | ai.google.dev | Yes | Yes | 4/5 |
| 38 | OpenRouter | openrouter.ai | No | Yes | 4/5 |
| 39 | Cloudflare Workers AI | developers.cloudflare.com | Yes | Yes | 3/5 |
| 40 | Ollama | ollama.ai | Unlimited | Local | 3-4/5 |
Video Composition (6)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 41 | Creatomate | creatomate.com | 50 credits | Yes | 5/5 |
| 42 | Shotstack | shotstack.io | 20 min/mo | Yes | 4/5 |
| 43 | JSON2Video | json2video.com | 600 sec | Yes | 3/5 |
| 44 | Remotion | remotion.dev | Yes | Self-hosted | 5/5 |
| 45 | FFmpeg | ffmpeg.org | Yes | CLI | 5/5 |
| 46 | MoviePy | zulko.github.io/moviepy | Yes | Python | 3/5 |
Templates (6)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 47 | Canva | canva.com | Yes | No | 4/5 |
| 48 | CapCut | capcut.com | Yes | No | 4/5 |
| 49 | InVideo AI | invideo.io | Yes | No | 4/5 |
| 50 | Envato Elements | elements.envato.com | No | No | 4/5 |
| 51 | Creatomate | creatomate.com | 50 credits | Yes | 5/5 |
| 52 | Motion Array | motionarray.com | Limited | No | 4/5 |
Captions (6)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 53 | CapCut | capcut.com | Yes | No | 4/5 |
| 54 | Kapwing | kapwing.com | 10 min/mo | No | 4/5 |
| 55 | Descript | descript.com | 1 hr | No | 4/5 |
| 56 | Whisper | github.com/openai/whisper | Unlimited | Local | 4/5 |
| 57 | WhisperX | github.com/m-bain/whisperX | Unlimited | Local | 5/5 |
| 58 | whisper-timestamped | github.com/linto-ai/whisper-timestamped | Unlimited | Local | 4/5 |
Thumbnails (6)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 59 | Canva | canva.com | Yes | No | 4/5 |
| 60 | Figma | figma.com | Yes | REST | 5/5 |
| 61 | Thumbly | thumbly.ai | $0.01/each | No | 3/5 |
| 62 | Midjourney | midjourney.com | No | No | 5/5 |
| 63 | DALL-E | openai.com/dall-e | Via API | Yes | 4/5 |
| 64 | Flux | blackforestlabs.ai | Replicate | Yes | 5/5 |
Trend Discovery (7)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 65 | Virlo | virlo.ai | No | Yes | 5/5 |
| 66 | Google Trends | trends.google.com | Yes | Scrapers | 3/5 |
| 67 | TikTok Creative Center | ads.tiktok.com | Yes | 3rd party | 4/5 |
| 68 | YouTube Trending | youtube.com | Yes | YT API | 3/5 |
| 69 | Exploding Topics | explodingtopics.com | Limited | Pro only | 4/5 |
| 70 | VidIQ | vidiq.com | Yes | No | 4/5 |
| 71 | TubeBuddy | tubebuddy.com | Yes | No | 4/5 |
Distribution (6)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 72 | Buffer | buffer.com | 3 channels | No | 4/5 |
| 73 | Later | later.com | Limited | No | 3/5 |
| 74 | Repurpose.io | repurpose.io | No | No | 4/5 |
| 75 | YouTube Data API | developers.google.com | 10K units/day | Yes | 5/5 |
| 76 | TikTok API | developers.tiktok.com | Limited | Yes | 2/5 |
| 77 | Meta Graph API | developers.facebook.com | Yes | Yes | 3/5 |
All-in-One Pipeline (8)
| # | Resource | URL | Free | API | Rating |
|---|---|---|---|---|---|
| 78 | AutoShorts.ai | autoshorts.ai | 1 video | No | 3/5 |
| 79 | InVideo AI | invideo.io | Yes | No | 4/5 |
| 80 | Pictory | pictory.ai | 14-day trial | Team+ | 4/5 |
| 81 | Fliki | fliki.ai | Yes | No | 3/5 |
| 82 | Vizard.ai | vizard.ai | 120 min/mo | No | 4/5 |
| 83 | OpusClip | opus.pro | 60 credits/mo | Business | 4/5 |
| 84 | ShortsDaily | shortsdaily.com | 500 credits | No | 3/5 |
| 85 | Syllaby | syllaby.io | No | No | 3/5 |
Monetization (6)
| # | Resource | URL | Free | Threshold | Rating |
|---|---|---|---|---|---|
| 86 | YouTube Partner | youtube.com/creators | Yes | 1K subs + 4K hrs | 5/5 |
| 87 | TikTok Creator Rewards | tiktok.com | Yes | 10K followers | 3/5 |
| 88 | Content Rewards | contentrewards.com | Yes | None | 4/5 |
| 89 | Instagram Bonuses | instagram.com | Invite only | Invite only | 2/5 |
| 90 | Grin | grin.co | No | Brand-dependent | 3/5 |
| 91 | Aspire | aspire.io | No | ~$2K/mo | 3/5 |
Total: 91 resources across 14 categories.
Official Documentation
- Pexels API Documentation β Video and photo search API with orientation filters
- Pixabay API Documentation β Multi-asset search (images, videos, music)
- Freesound API β Creative Commons sound effects API
- ElevenLabs API Docs β Text-to-speech and voice cloning API
- Amazon Polly Pricing β AWS TTS pricing tiers
- Google Cloud TTS Pricing β GCP TTS pricing
- Creatomate API Introduction β Cloud video rendering API
- Shotstack API β JSON-based cloud video editing API
- JSON2Video API β Credit-based video generation API
- Remotion Documentation β React-based video with Lambda rendering
- Remotion Lambda Cost β Lambda rendering cost examples
- YouTube Data API v3 β Programmatic video upload and management
- YouTube Upload Guide β Video upload API guide
- Epidemic Sound API β AI-powered music licensing API
- Virlo Developer API β Short-form video trend intelligence API
- Storyblocks API β Enterprise stock media API
- AudioCraft by Meta β Open source audio generation
- MusicGen on GitHub β Music generation model
- Coverr License β Free stock video commercial license terms
- Mixkit License β Free assets license terms
- Suno Pricing β AI music generation plans
- Udio Pricing β AI music generation plans
- Buffer Pricing β Social media scheduling plans
- OpusClip Pricing β AI video clipping plans
- Vizard Pricing β Long-to-short video plans
- Pictory Pricing β AI video creation plans
- InVideo Plans β InVideo AI pricing details
- Kapwing Pricing β Online video editor plans
- Descript Pricing β Transcription-based editor plans
Open Source Tools
- edge-tts (Python) β Free Microsoft Edge TTS, no API key required
- edge-tts-universal (npm) β TypeScript port for Node.js
- openai-edge-tts β OpenAI-compatible endpoint wrapper
- Coqui TTS β Deep learning TTS toolkit, 1100+ languages
- XTTS v2 β Zero-shot voice cloning from 6 seconds
- Bark by Suno β Transformer text-to-audio with non-speech sounds
- Piper TTS β Fast local neural TTS (v1.4.1, Feb 2026)
- Whisper by OpenAI β Speech recognition model
- WhisperX β Word-level timestamps and speaker diarization
- whisper-timestamped β DTW-based word timestamps
- Remotion β Make videos programmatically with React
- FFmpeg β Universal multimedia framework
- MoviePy β Python video editing library
- MusicGen on Replicate β Cloud API for Metaβs music generation
Comparison Articles and Reviews
- 7 Best Video Editing APIs (2026) β Plainlyβs comparison of video APIs
- Creatomate Review 2026 β Detailed Creatomate assessment
- Shotstack Review 2026 β Detailed Shotstack assessment
- Best AI Video Generation APIs β Creatomateβs API comparison
- VidIQ vs TubeBuddy β YouTube SEO tool comparison
- Best AI Music Generators 2026 β Suno vs Udio vs ElevenLabs
- 6 Popular Open-Source TTS Models 2026 β Open source TTS comparison
- Best Open-Source TTS Models 2026 β BentoMLβs TTS survey
- ElevenLabs Plans Breakdown β Detailed pricing analysis
- Suno Pricing 2026 β Free vs Pro vs Premier analysis
- Suno Commercial License β License terms analysis
- Udio Pricing Review β Plans and real cost per song
- OpusClip Review 2026 β Detailed OpusClip assessment
- Repurpose.io Review 2026 β Cross-posting tool review
- Buffer Pricing 2026 β Plan comparison
- AutoShorts.ai Review β Honest faceless video assessment
- Pixabay Music Review β Licensing risk analysis
- UGC Rates 2026 β Creator pricing guide
- TikTok Trend Discovery 2026 β How to find viral trends
- Virlo Trends API Launch β API announcement
- YouTube API Guide 2026 β Data API v3 setup and quotas
- YouTube Upload API Guide β Programmatic upload walkthrough
Last updated: March 2026. Prices and features change frequently. Verify current pricing at each resourceβs official site before committing.