Content Rewards Clipping Guide

Org Status: 🟡 Dormant Cloudflare: N/A Last Audited: 2026-04-28

Content Rewards is a pay-per-view marketplace where brands fund campaigns and creators (called “clippers”) earn $1-2.50 per 1,000 views by posting short-form video content on TikTok, Instagram Reels, and YouTube Shorts. Unlike traditional influencer marketing, you don’t need a following, a product, or even a camera. You clip, post, submit, and get paid based on actual views.

This guide covers the business model end-to-end — from your first $50 payout to building an automated pipeline that produces and posts AI-generated UGC at scale. The target audience is developers and indie hackers who can write code, not just drag-and-drop editors.

What you’ll learn:

How the Content Rewards marketplace works (for clippers AND for brands funding campaigns)
The exact step-by-step process from sign-up to first payout
Five content formats ranked by effort vs. earnings potential
How to build an AI automation pipeline: LLM scripts, AI avatars, TTS, stock footage, FFmpeg composition
Real earnings data — what outliers make, what most people actually make, and what’s realistic
When to use Content Rewards as a brand to market your own products
Full cost analysis: manual vs. AI-assisted vs. fully automated production
Code examples you can run today

The Problem: Why Traditional Creator Monetization Is Broken
What Is Content Rewards?
The Clipping Business Model
How It Works: Step by Step
Content Formats That Work
Real Earnings Data
The AI Automation Pipeline
Script Generation with LLMs
AI UGC Avatars
Text-to-Speech: Paid vs. Open Source
Stock Footage APIs
Video Composition: FFmpeg, Remotion, and Creatomate
Auto-Posting Pipeline Architecture
Local GPU Rendering
Using Content Rewards as a Brand
Cost Analysis: Manual vs. AI-Assisted vs. Fully Automated
Platform Comparison: Content Rewards vs. Everything Else
Anti-Patterns
The Scaling Playbook
References

The creator economy has a structural problem. Every major monetization path requires you to build an audience first, then figure out how to extract revenue from it. This creates a chicken-and-egg problem that kills most aspiring creators before they earn their first dollar.

The Audience-First Trap

TikTok Creator Rewards Program requires 10,000 followers and 100,000 views in the last 30 days just to apply. Even then, it pays $0.40-$1.00 per 1,000 views — and only for original videos over 1 minute long. Most creators spend 6-12 months building to eligibility while earning exactly $0.

YouTube AdSense requires 1,000 subscribers and 4,000 watch hours. The average RPM is $3-15 per 1,000 views, but reaching the threshold takes most channels 12-18 months of consistent posting. During that entire period: $0.

Instagram Reels pays $0.07-$0.50 per 1,000 views through its bonus program — when it’s even available. The program is invite-only and has been paused and restarted multiple times.

Freelance UGC creation pays $150-2,000 per video, but requires a portfolio, client relationships, invoicing, revisions, and creative direction calls. It’s a service business with all the overhead that implies.

What Changes If You Get This Right

Content Rewards inverts the model. Brands pre-fund campaigns. You create content and post it. Views are tracked automatically. Payouts happen instantly. No follower count required. No invoicing. No client management. No waiting months to qualify.

The question becomes: can you make content that gets views? If yes, you get paid. If no, you iterate.

Key insight: Content Rewards decouples earning from audience-building. You don’t need followers — you need views. And views are a function of content quality and volume, both of which are automatable.

Content Rewards is a performance-based marketplace built on the Whop platform. It connects brands who want organic reach with creators who can produce and distribute short-form video content.

The Core Loop

Brand funds campaign ($1K-$50K budget)
    → Sets payout rate ($1-5 per 1K views)
    → Defines content guidelines
    → Creators browse and join campaigns
    → Creators produce content (clips, UGC, reactions)
    → Creators post on TikTok / IG / YouTube
    → Creators submit post URL to Content Rewards
    → Platform verifies views hourly
    → Creators get paid automatically
    → Brand gets organic reach at $1-5 CPM vs. $25+ CPM for ads

Key Platform Details

Detail	Value
Platform fee	7% of clipper payouts
Payout frequency	Weekly (every 7 days)
Payout method	Instant via Whop (Stripe)
Minimum payout	Varies by campaign
Submission limit	Unlimited per campaign
Follower requirement	None
Account requirement	Free Whop account
Content ownership	Creator retains ownership
Geographic restrictions	Some campaigns restrict view geography (e.g., US/UK/CA/AU/NZ only)

How It Differs from Traditional Influencer Marketing

Traditional influencer deals: Brand pays $5,000 flat fee to creator with 100K followers. Creator posts one video. Maybe it gets 50K views ($100 CPM). Maybe it gets 500K views ($10 CPM). Brand takes all the risk.

Content Rewards: Brand deposits $5,000 into campaign at $2/1K views. 200 creators each post 3 videos. The ones that get views earn money. The ones that don’t cost the brand nothing. Brand gets exactly 2,500,000 views for their $5,000 ($2 CPM). Risk shifts to creators.

Key insight: Content Rewards is arbitrage. Brands pay $1-5 per 1K views for organic-looking content. Facebook/Instagram ads cost $25+ per 1K impressions. The 5-25x cost advantage is what funds the entire ecosystem.

Why It Works

The clipping model is attractive because it strips away nearly every barrier to entry in the creator economy:

No product needed. You’re promoting someone else’s brand/product/content. You never handle inventory, customer service, or refunds.

No audience needed. Content Rewards doesn’t require follower counts. A brand-new TikTok account with 0 followers can earn if the content gets views.

No invoicing. Payments are automated through the platform. You never send an invoice, chase a payment, or negotiate a rate.

No creative direction calls. Campaign briefs are self-serve. You read the requirements, create content, submit. No Zoom calls.

No exclusivity. You can work multiple campaigns simultaneously. One video might serve 1 campaign, but you can have 50 campaigns running at once across different brands.

The Unit Economics

A single video takes 15-60 minutes to produce manually (depending on format). At a $2/1K views payout rate:

Views	Earnings	Time to Produce (Manual)	Effective Hourly Rate
1,000	$2	30 min	$4/hr
10,000	$20	30 min	$40/hr
100,000	$200	30 min	$400/hr
1,000,000	$2,000	30 min	$4,000/hr

The variance is extreme. Most videos get 1,000-10,000 views. A few go viral. The business model works because:

Cost per video approaches zero with automation — AI can produce videos for $0.10-$2.00 each
Volume compensates for variance — 100 videos at 5,000 average views = 500K views = $1,000 at $2 CPM
Long tail earnings — Videos continue earning views (and money) for weeks after posting

Revenue Math at Scale

Assumptions:
- 10 videos/day (automated pipeline)
- Average 5,000 views per video
- $2/1K views payout rate
- 30 days/month

Daily: 10 videos × 5,000 views × $2/1K = $100/day
Monthly: $100 × 30 = $3,000/month
Annual: $3,000 × 12 = $36,000/year

Cost (automated):
- AI avatar/TTS: ~$1/video = $300/month
- Stock footage APIs: Free (Pexels/Pixabay)
- Compute: $0 (local GPU) or ~$50/month (cloud)
- Total: ~$350/month

Net profit: ~$2,650/month = ~$31,800/year

This is the “realistic automation” scenario. Not the $45K/36-hours outlier. Not the $50/month beginner. The achievable middle ground for someone who builds the pipeline.

Step 1: Create a Whop Account

Go to whop.com and sign up for a free account. You’ll need:

Email address
Payment method (for receiving payouts via Stripe)
Social media accounts (TikTok, Instagram, YouTube)

Step 2: Browse Active Campaigns

Navigate to Content Rewards Discovery or the Whop Content Rewards hub. You’ll see active campaigns with:

Brand name — who’s paying
Budget remaining — how much money is left in the campaign
Payout rate — $ per 1,000 views (typically $1-5)
Max payout per clip — cap per individual video (e.g., $1,000-$3,000)
Platform requirements — which social platforms are accepted
Content guidelines — what the brand wants (and doesn’t want)
Geographic restrictions — which countries’ views count

Key insight: Only join campaigns with at least 60% of their budget remaining. If a campaign is 90%+ drained, you risk posting content that earns views but doesn’t get paid because the budget ran out.

Step 3: Read the Campaign Brief Carefully

Every campaign has specific requirements. Common ones include:

No AI voiceovers — some brands (like Lovable) explicitly ban obvious AI content
Geographic targeting — views must come from specific countries
Comments enabled — engagement metrics matter
Minimum video length — usually 15-60 seconds
Content type — podcast clips, UGC, reaction videos, tutorials, etc.
Prohibited content — misleading claims, competitor mentions, etc.

Missing a single requirement can get your submission rejected, wasting the views you already generated.

Step 4: Warm Up New Accounts (Critical)

If you’re posting from new social media accounts, spend 3-4 days acting like a normal user before posting campaign content:

Day 1-2: Scroll, like, comment on 20-30 videos in your niche
Day 3: Follow 10-20 accounts in your niche
Day 4: Post your first video

Why: Platforms detect and suppress new accounts that immediately
start posting promotional content. The "warmup" period signals
to the algorithm that you're a real user.

This is not optional. New accounts that skip warmup consistently get shadowbanned, meaning your videos get shown to almost nobody.

Step 5: Create Your Content

Choose a format (see Content Formats That Work). Create the video using any combination of:

Screen recording + voiceover
Stock footage + text overlay
AI avatar + script
Reaction video + commentary
Slideshow + music

Step 6: Post and Submit Within 1 Hour

This is the most critical timing requirement:

Post the video on TikTok/Instagram/YouTube
Copy the post URL
Go to the Content Rewards campaign page
Click “Submit”
Paste the URL and upload the media file
Submit

You only get paid for views that happen after submission. If your video goes viral before you submit, those views don’t count. Submit within 1 hour of posting.

Step 7: Track and Collect

The platform checks views hourly. Payouts are processed weekly. You can track earnings in real-time on the Whop dashboard.

Payout calculation example:
- Payout rate: $2/1K views
- Your video gets 50,000 views
- Campaign max payout per clip: $3,000
- Your earnings: min(50,000 × $2/1,000, $3,000) = $100

Another example (viral clip):
- Payout rate: $2/1K views
- Your video gets 2,000,000 views
- Campaign max payout per clip: $3,000
- Your earnings: min(2,000,000 × $2/1,000, $3,000) = $3,000 (capped)

Step 8: Iterate and Scale

Your first few videos will probably underperform. That’s normal. The iteration loop:

Post 5-10 videos across different formats
Check which ones get traction (1,000+ views in 24 hours)
Double down on the winning format
Increase volume on what works
Automate production of the winning format

Not all content formats perform equally. Here they are ranked by effort-to-earnings ratio, from lowest effort to highest production value.

Format 1: Faceless Slideshows (Lowest Effort)

What it is: A series of images/screenshots with text overlay, background music, and optional voiceover. No camera, no face, no editing skill required.

Example topics:

“5 apps that pay you to do nothing”
“Things you didn’t know about [brand]”
“This AI tool replaced my [job function]”

Production time: 5-15 minutes (manual), < 1 minute (automated)

Typical performance: 1K-50K views. Low ceiling but extremely high volume potential.

Tools needed:
- CapCut (free) or Canva (free tier)
- Stock images from Pexels/Unsplash
- Background music (TikTok library or royalty-free)
- Text: bold, centered, large font

Why it works: TikTok’s algorithm rewards watch time and completion rate. Short slideshows (15-30 seconds) with curiosity-driven text get high completion rates. The algorithm doesn’t care if you showed your face.

Format 2: Screen Recording + Voiceover

What it is: Record your screen showing a product, tool, or process. Add voiceover explaining what’s happening.

Example topics:

“I tested [brand’s product] for 7 days”
“Watch me build a website with [tool] in 60 seconds”
“This is what [app] looks like from the inside”

Production time: 15-30 minutes (manual), 5-10 minutes (semi-automated)

Typical performance: 5K-200K views. Higher ceiling because it demonstrates real product usage.

Tools needed:
- Screen recorder (OBS free, or QuickTime on Mac)
- Audio editor or TTS for voiceover
- CapCut for assembly + captions

Format 3: “Did You Know” / Facts Format

What it is: Quick-fire facts or surprising information about a topic, presented with kinetic text, stock footage, and an engaging voiceover.

Example topics:

“Did you know this AI can clone your voice in 5 seconds?”
“3 things nobody tells you about [product]”
“I found out [brand] has a secret feature”

Production time: 20-40 minutes (manual), 2-5 minutes (automated)

Typical performance: 10K-500K views. This format has the highest viral potential for faceless content.

Pipeline:
1. LLM generates 5 "hook + fact + CTA" scripts
2. TTS converts script to voiceover
3. Stock footage matched to keywords
4. FFmpeg composites video + audio + text overlay
5. Output: 5 ready-to-post videos

Format 4: Reaction Videos

What it is: Record yourself reacting to the brand’s existing content. Show genuine surprise, interest, or commentary.

Production time: 15-30 minutes per video

Typical performance: 10K-1M+ views. Human faces dramatically increase engagement.

Limitation: Cannot be fully automated (requires a real human reacting). Can be semi-automated with templates, auto-captions, and batch processing.

Format 5: AI UGC Avatars (Highest Automation Potential)

What it is: An AI-generated human avatar delivers a scripted message about the brand’s product. Looks like a real person talking to camera.

Production time: 2-5 minutes per video (once pipeline is built)

Typical performance: 5K-100K views. Performance depends heavily on avatar quality.

Critical caveat: Some campaigns explicitly ban AI-generated content. Read the brief before using this format.

Pipeline:
1. LLM generates script
2. AI avatar service renders video (HeyGen, Arcads, Creatify)
3. Add captions + background music
4. Post and submit

Format Comparison

Format	Effort	Automation	Avg. Views	Scalability	Campaign Restrictions
Faceless Slideshow	Very Low	Full	1K-50K	Excellent	Rarely restricted
Screen Recording	Low	Partial	5K-200K	Good	Rarely restricted
”Did You Know”	Medium	Full	10K-500K	Excellent	Rarely restricted
Reaction Video	Medium	None	10K-1M+	Poor	Never restricted
AI UGC Avatar	Low	Full	5K-100K	Excellent	Sometimes banned

Key insight: The “Did You Know” format hits the sweet spot of high automation potential, high average views, and few campaign restrictions. It’s the format most suited to the developer/automation approach.

Let’s be honest about what people actually earn. The outliers get all the attention. The median earner makes far less.

The Outliers (Top 0.1%)

@jessieclipping — Reported earning $45K in 36 hours from Content Rewards. This tweet received 15,489 bookmarks, making it one of the most-saved clipping posts on X. This is not a realistic benchmark. This is the equivalent of winning a lottery ticket — it happened because multiple videos went massively viral simultaneously during a high-budget campaign.

In a more instructive post, @jessieclipping reported earning $340 in her first month — a much more realistic entry point.

@reyaffrev — Reports earning $50K+ total at age 17, with $1,600/week from reposting clips. This represents consistent high performance over months, not a single viral moment. Still top 1% territory.

Agency model — The clippa.net guide documents an agency owner with $664,000 in lifetime Stripe revenue from managing multiple clipping accounts and campaigns.

The Realistic Middle (Top 10-30%)

Based on aggregated data from multiple sources:

Timeline	Realistic Earnings	Assumptions
Month 1	$50-200	2-3 videos/day, learning what works
Month 2	$200-500	5 videos/day, one format dialed in
Month 3	$500-1,000	10 videos/day, multi-platform posting
Month 6	$1,000-3,000	Automated pipeline, 15+ videos/day
Year 1	$3,000-10,000/month	Full automation, multiple campaigns

Source: The Complete Whop Clipping Guide

The Median Earner (Most People)

Most clippers earn $50-200/month. That’s $600-$2,400/year from a side hustle with zero startup costs. Not life-changing, but not nothing.

The clippers who stay at this level typically:

Post inconsistently (a few videos per week, not per day)
Use only one platform (missing the 3x multiplier from cross-posting)
Don’t iterate on formats (keep posting what doesn’t work)
Don’t read campaign briefs carefully (submissions get rejected)

Campaign-Level Data

Lovable AI campaign:

Budget: $10,000
Payout rate: $2 per 1K views
Requirements: No AI voiceovers, US/UK/CA/AU/NZ views only, comments enabled
Content types: Podcast clips and UGC

Hostage Tape campaign:

Result: Sold out entire inventory
2,000+ Whop community members
~400 clips produced by creators

Lil Baby music campaign:

Payout rate: $0.30 per 1K views
Result: 6,000+ fans joined, 2,500+ TikTok videos created

High-CPM campaigns:

Iman Gadzhi: $5-$50 per 1K views plus up to $50K in bonus rewards
Whop itself: $50 RPM for long-form content about the Whop platform

The Honest Take

Earnings distribution (estimated):

Bottom 50%:  $0 - $100/month   (gave up or post < 5 videos/week)
50th-75th:   $100 - $500/month (consistent posting, one platform)
75th-90th:   $500 - $2,000/month (multi-platform, 5+ videos/day)
90th-95th:   $2,000 - $5,000/month (automation + multiple campaigns)
95th-99th:   $5,000 - $20,000/month (agency model or viral consistency)
Top 1%:      $20,000+/month (viral outliers, huge scale)

The people making $10K+/month aren’t hoping for one video to blow up. They’re posting 100+ clips per week, tracking what performs, and systemizing the entire process. Which is exactly what automation enables.

Here’s the architecture for a fully automated content production pipeline. Each component is replaceable — use the specific tools that fit your budget and technical skill.

Pipeline Architecture

┌─────────────────────────────────────────────────────────────────┐
│                    CONTENT PRODUCTION PIPELINE                   │
├─────────────────────────────────────────────────────────────────┤
│                                                                  │
│  ┌──────────┐    ┌──────────┐    ┌──────────┐    ┌──────────┐  │
│  │  Campaign │    │  Script  │    │  Media   │    │  Video   │  │
│  │  Scanner  │───▶│Generator │───▶│ Fetcher  │───▶│Compositor│  │
│  │          │    │  (LLM)   │    │(Pexels/  │    │(FFmpeg/  │  │
│  │          │    │          │    │ Pixabay) │    │Remotion) │  │
│  └──────────┘    └──────────┘    └──────────┘    └──────────┘  │
│                                                       │          │
│                                                       ▼          │
│  ┌──────────┐    ┌──────────┐    ┌──────────┐    ┌──────────┐  │
│  │ Analytics │    │  Submit  │    │  Auto    │    │  Audio   │  │
│  │ Tracker  │◀───│  to CR   │◀───│ Poster   │◀───│Generator │  │
│  │          │    │          │    │(API/Bot) │    │  (TTS)   │  │
│  └──────────┘    └──────────┘    └──────────┘    └──────────┘  │
│                                                                  │
└─────────────────────────────────────────────────────────────────┘

Component Breakdown

Component	Purpose	Options (Free → Paid)
Campaign Scanner	Find high-budget campaigns	Manual → Whop API scraping
Script Generator	Write hooks + scripts	Local LLM → GPT-4o/Claude
Media Fetcher	Get stock footage/images	Pexels API (free) → Shutterstock
Audio Generator	TTS voiceover	Piper/Coqui (free) → ElevenLabs
Video Compositor	Assemble final video	FFmpeg (free) → Remotion → Creatomate
Auto Poster	Post to TikTok/IG/YT	Manual → API bots → OpenClaw
Submission	Submit URL to Content Rewards	Manual (required for most)
Analytics	Track views/earnings	Whop dashboard + custom tracking

The script is the most important part of any video. A great script with mediocre visuals outperforms great visuals with a weak script every time.

Script Structure

Every high-performing short-form video follows this structure:

HOOK (0-3 seconds):
  Provocative statement, question, or visual that stops the scroll.
  "This AI tool just replaced my entire marketing team."

BODY (3-25 seconds):
  2-3 key points delivered rapidly.
  Show the product/tool/concept in action.
  Each point should build on the previous one.

CTA (25-30 seconds):
  Clear next step.
  "Link in bio" / "Follow for more" / "Try it free"

TypeScript Script Generator

import Anthropic from "@anthropic-ai/sdk";

interface VideoScript {
  hook: string;
  body: string[];
  cta: string;
  searchTerms: string[]; // for stock footage matching
  duration: number; // estimated seconds
}

interface CampaignBrief {
  brandName: string;
  productDescription: string;
  targetAudience: string;
  keyBenefits: string[];
  restrictions: string[];
  tone: "casual" | "professional" | "excited" | "educational";
}

async function generateScripts(
  brief: CampaignBrief,
  count: number = 5
): Promise<VideoScript[]> {
  const client = new Anthropic();

  const prompt = `Generate ${count} short-form video scripts for a Content Rewards campaign.

Brand: ${brief.brandName}
Product: ${brief.productDescription}
Target audience: ${brief.targetAudience}
Key benefits: ${brief.keyBenefits.join(", ")}
Restrictions: ${brief.restrictions.join(", ")}
Tone: ${brief.tone}

Each script must:
1. Have a hook that stops the scroll (first 3 seconds)
2. Deliver 2-3 key points in the body (3-25 seconds)
3. End with a clear CTA (25-30 seconds)
4. Total duration: 15-30 seconds
5. Include search terms for matching stock footage

Return as JSON array of objects with fields:
hook, body (string array), cta, searchTerms (string array), duration (number)

Focus on "Did You Know" and educational formats.
Avoid clickbait that doesn't deliver.
Each script should take a different angle on the product.`;

  const response = await client.messages.create({
    model: "claude-sonnet-4-20250514",
    max_tokens: 2000,
    messages: [{ role: "user", content: prompt }],
  });

  const text =
    response.content[0].type === "text" ? response.content[0].text : "";
  const jsonMatch = text.match(/\[[\s\S]*\]/);
  if (!jsonMatch) throw new Error("Failed to parse script response");

  return JSON.parse(jsonMatch[0]) as VideoScript[];
}

// Usage
const scripts = await generateScripts({
  brandName: "Lovable",
  productDescription: "AI-powered web app builder",
  targetAudience: "Indie hackers, non-technical founders",
  keyBenefits: [
    "Build full-stack apps with prompts",
    "No coding required",
    "Ships in minutes not months",
  ],
  restrictions: ["No AI voiceovers", "Must show real product usage"],
  tone: "excited",
});

console.log(`Generated ${scripts.length} scripts`);
for (const script of scripts) {
  console.log(`\nHook: "${script.hook}"`);
  console.log(`Duration: ${script.duration}s`);
  console.log(`Stock footage terms: ${script.searchTerms.join(", ")}`);
}

Prompt Engineering Tips for Video Scripts

const HOOK_FORMULAS = [
  // Curiosity gap
  "This {tool} just {unexpected_result} and nobody is talking about it.",
  // Contrarian take
  "Stop {common_advice}. Here's what actually works.",
  // Social proof
  "{number} people switched to {product} last month. Here's why.",
  // Fear of missing out
  "If you're still {old_way}, you're leaving money on the table.",
  // Direct challenge
  "I bet you didn't know {product} could do this.",
];

const BODY_STRUCTURES = {
  // Problem → Solution → Proof
  problemSolutionProof: [
    "State a pain point the audience feels daily",
    "Show how the product solves it (screen recording or demo)",
    "Show a result/metric/testimonial",
  ],
  // Before → After → How
  beforeAfterHow: [
    "Show the old/manual/painful way",
    "Show the result with the product",
    "Quick walkthrough of the process",
  ],
  // Three reasons
  threeReasons: [
    "Reason 1: The obvious benefit",
    "Reason 2: The unexpected benefit",
    "Reason 3: The emotional/status benefit",
  ],
};

Cost Per Script

Provider	Cost per Script	Quality	Speed
Claude 3.5 Sonnet	~$0.003	Excellent	2-3s
GPT-4o	~$0.005	Excellent	2-3s
GPT-4o-mini	~$0.0005	Good	1-2s
Llama 3 (local)	$0	Good	5-10s
Gemini Flash	~$0.001	Good	1-2s

At $0.003/script, generating 100 scripts/day costs $0.30. Script generation is essentially free.

AI-generated avatars deliver scripted content that looks like a real person talking to camera. Quality has improved dramatically — the best tools are nearly indistinguishable from real humans in short-form content.

Tool Comparison

Tool	Starting Price	Per-Video Cost	Avatar Quality	Lip Sync	Gestures	API Available
HeyGen	$29/mo	~$0.50-2.00	Excellent	Excellent	Yes	Yes
Arcads	$100/mo (10 videos)	~$10.00	Best-in-class	Excellent	Natural	Yes
Creatify	$39/mo	~$0.50-1.50	Very Good	Good	Limited	Yes
MakeUGC	Credit-based	~$5-10	Good	Good	Limited	No
Synthesia	$29/mo	~$1-3	Excellent	Excellent	Yes	Yes

Key insight: Creatify delivers 85-90% of Arcads’ quality at 60-70% of the price. For high-volume clipping, Creatify or HeyGen are the best value. Arcads is for when quality matters more than cost.

HeyGen API Example

interface HeyGenVideoRequest {
  video_inputs: Array<{
    character: {
      type: "avatar";
      avatar_id: string;
      avatar_style: "normal" | "circle" | "closeUp";
    };
    voice: {
      type: "text";
      input_text: string;
      voice_id: string;
      speed?: number;
    };
    background?: {
      type: "color" | "image" | "video";
      value: string;
    };
  }>;
  dimension?: { width: number; height: number };
  aspect_ratio?: "16:9" | "9:16" | "1:1";
}

async function generateAvatarVideo(
  script: string,
  avatarId: string
): Promise<string> {
  const response = await fetch("https://api.heygen.com/v2/video/generate", {
    method: "POST",
    headers: {
      "X-Api-Key": process.env.HEYGEN_API_KEY!,
      "Content-Type": "application/json",
    },
    body: JSON.stringify({
      video_inputs: [
        {
          character: {
            type: "avatar",
            avatar_id: avatarId,
            avatar_style: "normal",
          },
          voice: {
            type: "text",
            input_text: script,
            voice_id: "en-US-JennyNeural",
            speed: 1.1, // slightly faster for short-form
          },
        },
      ],
      aspect_ratio: "9:16", // vertical for TikTok/Reels/Shorts
    } satisfies HeyGenVideoRequest),
  });

  const data = await response.json();
  return data.data.video_id; // poll for completion
}

async function pollVideoStatus(videoId: string): Promise<string> {
  while (true) {
    const res = await fetch(
      `https://api.heygen.com/v1/video_status.get?video_id=${videoId}`,
      { headers: { "X-Api-Key": process.env.HEYGEN_API_KEY! } }
    );
    const data = await res.json();

    if (data.data.status === "completed") {
      return data.data.video_url;
    }
    if (data.data.status === "failed") {
      throw new Error(`Video generation failed: ${data.data.error}`);
    }

    await new Promise((resolve) => setTimeout(resolve, 5000));
  }
}

Important: Campaign Restrictions on AI Content

Many campaigns explicitly ban AI-generated content. The Lovable campaign states “No AI voiceovers or obvious AI content.” Before using AI avatars for any campaign, check the brief.

Campaigns that typically allow AI content:

Tech/SaaS product promotions
Educational content campaigns
“Spread the word” brand awareness campaigns

Campaigns that typically ban AI content:

Lifestyle/authenticity-focused brands
Food/beauty/fashion brands
Campaigns specifically requesting “real UGC”

TTS is the audio backbone of faceless content. You need natural-sounding voiceover that doesn’t trigger the “this is AI” detector in viewers’ brains.

Paid Options

Service	Cost	Quality	Latency	Languages	API
ElevenLabs	$5/mo starter, ~$180/1M chars	Best-in-class	1-3s	29+	Yes
OpenAI TTS	$15/1M chars	Very Good	1-2s	50+	Yes
Google Cloud TTS	$4/1M chars (standard)	Good	<1s	40+	Yes
Amazon Polly	$4/1M chars	Good	<1s	30+	Yes

Open Source Options (Free, Run Locally)

Project	Quality	Speed (GPU)	Languages	Voice Cloning
Piper	Good	Real-time+	30+	No
Coqui TTS	Very Good	Near real-time	16+	Yes (XTTS-v2)
Bark	Excellent	Slow (10-30s)	10+	Limited
StyleTTS2	Excellent	Real-time	English	Yes

Key insight: For high-volume clipping, OpenAI TTS at $15/1M characters is the best value. A typical 30-second script is ~500 characters, so $15 buys you 2,000 voiceovers. At 10 videos/day, that’s 200 days of production for $15. ElevenLabs sounds better but costs 12x more.

OpenAI TTS Example

import OpenAI from "openai";
import { writeFile } from "node:fs/promises";

const openai = new OpenAI();

async function generateVoiceover(
  text: string,
  outputPath: string,
  voice: "alloy" | "echo" | "fable" | "onyx" | "nova" | "shimmer" = "nova"
): Promise<void> {
  const response = await openai.audio.speech.create({
    model: "tts-1",
    voice,
    input: text,
    speed: 1.1, // slightly faster for short-form engagement
    response_format: "mp3",
  });

  const buffer = Buffer.from(await response.arrayBuffer());
  await writeFile(outputPath, buffer);
}

// Generate voiceovers for multiple scripts
async function batchVoiceover(
  scripts: Array<{ id: string; text: string }>
): Promise<Map<string, string>> {
  const results = new Map<string, string>();

  // Process in parallel batches of 5
  for (let i = 0; i < scripts.length; i += 5) {
    const batch = scripts.slice(i, i + 5);
    await Promise.all(
      batch.map(async (script) => {
        const path = `/tmp/voiceover-${script.id}.mp3`;
        await generateVoiceover(script.text, path);
        results.set(script.id, path);
      })
    );
  }

  return results;
}

Piper (Free, Local) Example

pip install piper-tts

wget https://huggingface.co/rhasspy/piper-voices/resolve/main/en/en_US/amy/medium/en_US-amy-medium.onnx
wget https://huggingface.co/rhasspy/piper-voices/resolve/main/en/en_US/amy/medium/en_US-amy-medium.onnx.json

echo "Did you know this AI tool can build entire websites from a single prompt?" | \
  piper --model en_US-amy-medium.onnx --output_file voiceover.wav

ffmpeg -i voiceover.wav -codec:a libmp3lame -qscale:a 2 voiceover.mp3

TTS Cost Comparison for 100 Videos/Day

Service	Cost/Video	Cost/Day (100)	Cost/Month (3,000)
ElevenLabs	~$0.09	$9.00	$270.00
OpenAI TTS	~$0.0075	$0.75	$22.50
Google Cloud TTS	~$0.002	$0.20	$6.00
Piper (local)	$0.00	$0.00	$0.00
Coqui (local)	$0.00	$0.00	$0.00

Assumes 500 characters per script average.

Stock footage gives faceless content visual variety without needing to record anything. Two APIs dominate the free tier.

Pexels API

Pexels offers completely free access to their entire library of photos and videos. Rate-limited to 200 requests/hour and 20,000 requests/month.

interface PexelsVideo {
  id: number;
  width: number;
  height: number;
  duration: number;
  url: string;
  video_files: Array<{
    id: number;
    quality: "hd" | "sd" | "uhd";
    file_type: string;
    width: number;
    height: number;
    fps: number;
    link: string;
  }>;
}

interface PexelsSearchResponse {
  page: number;
  per_page: number;
  total_results: number;
  videos: PexelsVideo[];
}

async function searchPexelsVideos(
  query: string,
  options: {
    perPage?: number;
    orientation?: "landscape" | "portrait" | "square";
    minDuration?: number;
    maxDuration?: number;
  } = {}
): Promise<PexelsVideo[]> {
  const params = new URLSearchParams({
    query,
    per_page: String(options.perPage ?? 10),
    orientation: options.orientation ?? "portrait", // 9:16 for TikTok
  });

  if (options.minDuration) {
    params.set("min_duration", String(options.minDuration));
  }
  if (options.maxDuration) {
    params.set("max_duration", String(options.maxDuration));
  }

  const response = await fetch(
    `https://api.pexels.com/videos/search?${params}`,
    {
      headers: {
        Authorization: process.env.PEXELS_API_KEY!,
      },
    }
  );

  const data: PexelsSearchResponse = await response.json();
  return data.videos;
}

async function downloadBestQualityVideo(
  video: PexelsVideo,
  outputPath: string
): Promise<void> {
  // Prefer HD portrait video files
  const file =
    video.video_files
      .filter((f) => f.quality === "hd" && f.height > f.width) // portrait
      .sort((a, b) => b.height - a.height)[0] ??
    video.video_files.sort((a, b) => b.height - a.height)[0];

  if (!file) throw new Error(`No video files for video ${video.id}`);

  const response = await fetch(file.link);
  const buffer = Buffer.from(await response.arrayBuffer());
  const { writeFile } = await import("node:fs/promises");
  await writeFile(outputPath, buffer);
}

// Usage: fetch footage matching script search terms
async function fetchFootageForScript(
  searchTerms: string[],
  clipsNeeded: number = 3
): Promise<string[]> {
  const paths: string[] = [];

  for (let i = 0; i < Math.min(searchTerms.length, clipsNeeded); i++) {
    const videos = await searchPexelsVideos(searchTerms[i], {
      perPage: 3,
      orientation: "portrait",
      minDuration: 5,
      maxDuration: 15,
    });

    if (videos.length > 0) {
      const randomIdx = Math.floor(Math.random() * videos.length);
      const path = `/tmp/footage-${i}.mp4`;
      await downloadBestQualityVideo(videos[randomIdx], path);
      paths.push(path);
    }
  }

  return paths;
}

Pixabay API

Pixabay also offers free video access, with a slightly different rate limit structure.

interface PixabayVideo {
  id: number;
  pageURL: string;
  type: string;
  tags: string;
  duration: number;
  videos: {
    large: { url: string; width: number; height: number; size: number };
    medium: { url: string; width: number; height: number; size: number };
    small: { url: string; width: number; height: number; size: number };
    tiny: { url: string; width: number; height: number; size: number };
  };
}

async function searchPixabayVideos(
  query: string,
  perPage: number = 10
): Promise<PixabayVideo[]> {
  const params = new URLSearchParams({
    key: process.env.PIXABAY_API_KEY!,
    q: query,
    video_type: "film",
    per_page: String(perPage),
    safesearch: "true",
  });

  const response = await fetch(
    `https://pixabay.com/api/videos/?${params}`
  );
  const data = await response.json();
  return data.hits as PixabayVideo[];
}

Stock Footage API Comparison

API	Free Tier	Rate Limit	Video Quality	Portrait Videos	Attribution Required
Pexels	Unlimited	200/hr, 20K/mo	Up to 4K	Many	Photographer credit appreciated but not required
Pixabay	Unlimited	100/min	Up to 1080p	Limited	Not required
Unsplash	50/hr	50/hr	Photos only	Photos only	Required
Shutterstock	Paid only	Plan-based	Up to 4K	Many	License required
Storyblocks	Subscription	Plan-based	Up to 4K	Many	No

Key insight: Pexels is the best free option by far. Good portrait video selection, generous rate limits, high quality, and no attribution required for commercial use. Use Pixabay as a backup when Pexels doesn’t have what you need.

This is where all the components come together. You need to combine stock footage, text overlays, voiceover audio, and captions into a final MP4 file optimized for short-form platforms.

Option 1: FFmpeg (Free, Local, Maximum Control)

FFmpeg is the Swiss Army knife of video processing. Every other video tool uses it under the hood. It’s free, runs locally, and gives you complete control.

Basic Composition: Images + Text + Audio → MP4

#!/bin/bash

INPUT_DIR="./assets"
OUTPUT="output.mp4"
AUDIO="voiceover.mp3"
DURATION=30  # total video duration in seconds

AUDIO_DURATION=$(ffprobe -v error -show_entries format=duration \
  -of default=noprint_wrappers=1:nokey=1 "$AUDIO")

ffmpeg -y \
  -loop 1 -t 10 -i "${INPUT_DIR}/slide1.jpg" \
  -loop 1 -t 10 -i "${INPUT_DIR}/slide2.jpg" \
  -loop 1 -t 10 -i "${INPUT_DIR}/slide3.jpg" \
  -i "$AUDIO" \
  -filter_complex "
    [0:v]scale=1080:1920:force_original_aspect_ratio=decrease,
      pad=1080:1920:(ow-iw)/2:(oh-ih)/2:black,
      drawtext=text='Did you know?':
        fontsize=72:fontcolor=white:
        x=(w-text_w)/2:y=200:
        fontfile=/System/Library/Fonts/Helvetica.ttc:
        enable='between(t,0,3)'[v0];
    [1:v]scale=1080:1920:force_original_aspect_ratio=decrease,
      pad=1080:1920:(ow-iw)/2:(oh-ih)/2:black,
      drawtext=text='This AI tool builds apps':
        fontsize=64:fontcolor=white:
        x=(w-text_w)/2:y=200:
        fontfile=/System/Library/Fonts/Helvetica.ttc[v1];
    [2:v]scale=1080:1920:force_original_aspect_ratio=decrease,
      pad=1080:1920:(ow-iw)/2:(oh-ih)/2:black,
      drawtext=text='Try it free - link in bio':
        fontsize=64:fontcolor=yellow:
        x=(w-text_w)/2:y=200:
        fontfile=/System/Library/Fonts/Helvetica.ttc[v2];
    [v0][v1][v2]concat=n=3:v=1:a=0[outv]
  " \
  -map "[outv]" -map 3:a \
  -c:v libx264 -preset fast -crf 23 \
  -c:a aac -b:a 128k \
  -shortest \
  -movflags +faststart \
  "$OUTPUT"

echo "Created: $OUTPUT"

Advanced: Stock Footage + Animated Captions + Audio

#!/bin/bash

FOOTAGE1="footage-0.mp4"
FOOTAGE2="footage-1.mp4"
FOOTAGE3="footage-2.mp4"
AUDIO="voiceover.mp3"
OUTPUT="final.mp4"

for i in 0 1 2; do
  ffmpeg -y -i "footage-${i}.mp4" \
    -vf "scale=1080:1920:force_original_aspect_ratio=increase,crop=1080:1920" \
    -c:v libx264 -preset fast -crf 23 \
    -an \
    "scaled-${i}.mp4"
done

for i in 0 1 2; do
  ffmpeg -y -i "scaled-${i}.mp4" -t 10 -c copy "trimmed-${i}.mp4"
done

echo "file 'trimmed-0.mp4'" > concat.txt
echo "file 'trimmed-1.mp4'" >> concat.txt
echo "file 'trimmed-2.mp4'" >> concat.txt

ffmpeg -y -f concat -safe 0 -i concat.txt \
  -c copy "combined-footage.mp4"

ffmpeg -y \
  -i "combined-footage.mp4" \
  -i "$AUDIO" \
  -filter_complex "
    [0:v]drawtext=text='This changes everything':
      fontsize=56:fontcolor=white:borderw=3:bordercolor=black:
      x=(w-text_w)/2:y=h-300:
      fontfile=/System/Library/Fonts/Helvetica.ttc:
      enable='between(t,0,5)',
    drawtext=text='Here\\'s how it works':
      fontsize=56:fontcolor=white:borderw=3:bordercolor=black:
      x=(w-text_w)/2:y=h-300:
      fontfile=/System/Library/Fonts/Helvetica.ttc:
      enable='between(t,5,15)',
    drawtext=text='Try it now - link in bio':
      fontsize=56:fontcolor=yellow:borderw=3:bordercolor=black:
      x=(w-text_w)/2:y=h-300:
      fontfile=/System/Library/Fonts/Helvetica.ttc:
      enable='between(t,15,30)'[outv]
  " \
  -map "[outv]" -map 1:a \
  -c:v libx264 -preset fast -crf 23 \
  -c:a aac -b:a 128k \
  -shortest \
  -movflags +faststart \
  "$OUTPUT"

rm -f scaled-*.mp4 trimmed-*.mp4 combined-footage.mp4 concat.txt
echo "Created: $OUTPUT"

TypeScript FFmpeg Wrapper

import { execSync, exec } from "node:child_process";
import { writeFile, unlink } from "node:fs/promises";
import { join } from "node:path";

interface TextOverlay {
  text: string;
  startTime: number;
  endTime: number;
  fontSize?: number;
  color?: string;
  position?: "top" | "center" | "bottom";
}

interface CompositionConfig {
  footageClips: string[]; // paths to video files
  audioPath: string; // path to voiceover
  overlays: TextOverlay[];
  outputPath: string;
  resolution?: { width: number; height: number };
}

function buildDrawTextFilter(overlays: TextOverlay[]): string {
  return overlays
    .map((o) => {
      const y =
        o.position === "top"
          ? "200"
          : o.position === "center"
            ? "(h-text_h)/2"
            : "h-300";
      const fontSize = o.fontSize ?? 56;
      const color = o.color ?? "white";
      const escaped = o.text.replace(/'/g, "\\'").replace(/:/g, "\\:");

      return `drawtext=text='${escaped}':fontsize=${fontSize}:fontcolor=${color}:borderw=3:bordercolor=black:x=(w-text_w)/2:y=${y}:fontfile=/System/Library/Fonts/Helvetica.ttc:enable='between(t,${o.startTime},${o.endTime})'`;
    })
    .join(",");
}

async function composeVideo(config: CompositionConfig): Promise<string> {
  const { width, height } = config.resolution ?? {
    width: 1080,
    height: 1920,
  };
  const tmpDir = "/tmp/compose";
  execSync(`mkdir -p ${tmpDir}`);

  // Scale and trim each footage clip
  const scaledClips: string[] = [];
  const clipDuration = 10; // seconds per clip

  for (let i = 0; i < config.footageClips.length; i++) {
    const scaled = join(tmpDir, `scaled-${i}.mp4`);
    execSync(
      `ffmpeg -y -i "${config.footageClips[i]}" ` +
        `-vf "scale=${width}:${height}:force_original_aspect_ratio=increase,crop=${width}:${height}" ` +
        `-t ${clipDuration} -c:v libx264 -preset fast -crf 23 -an "${scaled}"`,
      { stdio: "pipe" }
    );
    scaledClips.push(scaled);
  }

  // Create concat file
  const concatFile = join(tmpDir, "concat.txt");
  const concatContent = scaledClips
    .map((p) => `file '${p}'`)
    .join("\n");
  await writeFile(concatFile, concatContent);

  // Concatenate
  const combined = join(tmpDir, "combined.mp4");
  execSync(
    `ffmpeg -y -f concat -safe 0 -i "${concatFile}" -c copy "${combined}"`,
    { stdio: "pipe" }
  );

  // Add overlays + audio
  const drawText = buildDrawTextFilter(config.overlays);
  const filterComplex = drawText
    ? `-filter_complex "[0:v]${drawText}[outv]" -map "[outv]" -map 1:a`
    : `-map 0:v -map 1:a`;

  execSync(
    `ffmpeg -y -i "${combined}" -i "${config.audioPath}" ` +
      `${filterComplex} ` +
      `-c:v libx264 -preset fast -crf 23 ` +
      `-c:a aac -b:a 128k -shortest -movflags +faststart ` +
      `"${config.outputPath}"`,
    { stdio: "pipe" }
  );

  // Cleanup
  for (const f of scaledClips) await unlink(f).catch(() => {});
  await unlink(concatFile).catch(() => {});
  await unlink(combined).catch(() => {});

  return config.outputPath;
}

// Usage
await composeVideo({
  footageClips: [
    "/tmp/footage-0.mp4",
    "/tmp/footage-1.mp4",
    "/tmp/footage-2.mp4",
  ],
  audioPath: "/tmp/voiceover.mp3",
  overlays: [
    {
      text: "Did you know?",
      startTime: 0,
      endTime: 5,
      fontSize: 72,
      color: "white",
      position: "top",
    },
    {
      text: "This AI builds full apps",
      startTime: 5,
      endTime: 15,
      position: "bottom",
    },
    {
      text: "Try it free - link in bio",
      startTime: 15,
      endTime: 30,
      color: "yellow",
      position: "bottom",
    },
  ],
  outputPath: "/tmp/final-video.mp4",
});

Option 2: Remotion (React-Based, Developer-Friendly)

Remotion lets you compose videos using React components. If you’re a TypeScript developer, this is the most natural way to build complex video templates.

// src/UGCVideo.tsx
import { AbsoluteFill, Sequence, Video, Audio, Img } from "remotion";
import { useCurrentFrame, useVideoConfig, interpolate } from "remotion";

interface UGCVideoProps {
  hook: string;
  bodyPoints: string[];
  cta: string;
  footageUrls: string[];
  voiceoverUrl: string;
}

const AnimatedText: React.FC<{
  text: string;
  delay?: number;
}> = ({ text, delay = 0 }) => {
  const frame = useCurrentFrame();
  const { fps } = useVideoConfig();

  const opacity = interpolate(
    frame - delay * fps,
    [0, fps * 0.3],
    [0, 1],
    { extrapolateRight: "clamp" }
  );

  const translateY = interpolate(
    frame - delay * fps,
    [0, fps * 0.3],
    [30, 0],
    { extrapolateRight: "clamp" }
  );

  return (
    <div
      style={{
        opacity,
        transform: `translateY(${translateY}px)`,
        fontSize: 56,
        fontWeight: "bold",
        color: "white",
        textShadow: "2px 2px 8px rgba(0,0,0,0.8)",
        textAlign: "center",
        padding: "0 40px",
        lineHeight: 1.3,
      }}
    >
      {text}
    </div>
  );
};

export const UGCVideo: React.FC<UGCVideoProps> = ({
  hook,
  bodyPoints,
  cta,
  footageUrls,
  voiceoverUrl,
}) => {
  const { fps } = useVideoConfig();
  const segmentDuration = 10 * fps; // 10 seconds per segment

  return (
    <AbsoluteFill style={{ backgroundColor: "black" }}>
      {/* Background footage */}
      {footageUrls.map((url, i) => (
        <Sequence
          key={i}
          from={i * segmentDuration}
          durationInFrames={segmentDuration}
        >
          <Video
            src={url}
            style={{
              width: "100%",
              height: "100%",
              objectFit: "cover",
            }}
          />
        </Sequence>
      ))}

      {/* Text overlays */}
      <AbsoluteFill
        style={{
          justifyContent: "flex-end",
          paddingBottom: 300,
        }}
      >
        {/* Hook */}
        <Sequence from={0} durationInFrames={segmentDuration}>
          <AnimatedText text={hook} />
        </Sequence>

        {/* Body points */}
        {bodyPoints.map((point, i) => (
          <Sequence
            key={i}
            from={(i + 1) * segmentDuration}
            durationInFrames={segmentDuration}
          >
            <AnimatedText text={point} delay={0.2} />
          </Sequence>
        ))}

        {/* CTA */}
        <Sequence
          from={(bodyPoints.length + 1) * segmentDuration}
          durationInFrames={segmentDuration}
        >
          <AnimatedText text={cta} />
        </Sequence>
      </AbsoluteFill>

      {/* Voiceover audio */}
      <Audio src={voiceoverUrl} />
    </AbsoluteFill>
  );
};

// render.ts - Render the video programmatically
import { bundle } from "@remotion/bundler";
import { renderMedia, selectComposition } from "@remotion/renderer";
import path from "node:path";

async function renderUGCVideo(props: {
  hook: string;
  bodyPoints: string[];
  cta: string;
  footageUrls: string[];
  voiceoverUrl: string;
  outputPath: string;
}): Promise<string> {
  const bundled = await bundle({
    entryPoint: path.resolve("./src/index.ts"),
    webpackOverride: (config) => config,
  });

  const composition = await selectComposition({
    serveUrl: bundled,
    id: "UGCVideo",
    inputProps: {
      hook: props.hook,
      bodyPoints: props.bodyPoints,
      cta: props.cta,
      footageUrls: props.footageUrls,
      voiceoverUrl: props.voiceoverUrl,
    },
  });

  await renderMedia({
    composition,
    serveUrl: bundled,
    codec: "h264",
    outputLocation: props.outputPath,
    inputProps: {
      hook: props.hook,
      bodyPoints: props.bodyPoints,
      cta: props.cta,
      footageUrls: props.footageUrls,
      voiceoverUrl: props.voiceoverUrl,
    },
  });

  return props.outputPath;
}

Option 3: Creatomate (Cloud API, No Local Rendering)

Creatomate is a cloud video rendering API. You define templates with dynamic inputs, then trigger renders via API. Starting at $54/month for 2,000 credits (~550 videos at 720p, 15s).

interface CreatomateRenderRequest {
  template_id: string;
  modifications: Record<
    string,
    | string
    | { source: string; trim_start?: number; trim_duration?: number }
  >;
  webhook_url?: string;
}

async function renderWithCreatomate(
  templateId: string,
  modifications: Record<string, string>
): Promise<string> {
  const response = await fetch("https://api.creatomate.com/v1/renders", {
    method: "POST",
    headers: {
      Authorization: `Bearer ${process.env.CREATOMATE_API_KEY}`,
      "Content-Type": "application/json",
    },
    body: JSON.stringify([
      {
        template_id: templateId,
        modifications,
      },
    ]),
  });

  const renders = await response.json();
  const renderId = renders[0].id;

  // Poll for completion
  while (true) {
    const statusRes = await fetch(
      `https://api.creatomate.com/v1/renders/${renderId}`,
      {
        headers: {
          Authorization: `Bearer ${process.env.CREATOMATE_API_KEY}`,
        },
      }
    );
    const status = await statusRes.json();

    if (status.status === "succeeded") {
      return status.url; // download URL for the rendered video
    }
    if (status.status === "failed") {
      throw new Error(`Render failed: ${status.error_message}`);
    }

    await new Promise((resolve) => setTimeout(resolve, 3000));
  }
}

// Usage with a pre-built template
const videoUrl = await renderWithCreatomate("your-template-id", {
  "hook-text": "Did you know?",
  "body-text": "This AI builds full-stack apps from a single prompt",
  "cta-text": "Try it free — link in bio",
  "background-video": "https://example.com/stock-footage.mp4",
  "voiceover-audio": "https://example.com/voiceover.mp3",
});

Composition Tool Comparison

Tool	Cost	Setup Time	Flexibility	Rendering Speed	Best For
FFmpeg	Free	2-4 hours	Maximum	Fast (local GPU)	High volume, full control
Remotion	Free (OSS)	4-8 hours	Very High	Medium (headless Chrome)	React developers, complex templates
Creatomate	$54-249/mo	1-2 hours	Medium	Fast (cloud)	Non-developers, quick start
Shotstack	$25-200/mo	1-2 hours	Medium	Fast (cloud)	Simple templates
Plainly	Custom	2-4 hours	High	Fast (cloud)	After Effects templates

Key insight: FFmpeg is free and fast but requires shell scripting knowledge. Remotion is the best option for TypeScript developers who want full control. Creatomate is for when you need results fast and don’t want to manage rendering infrastructure.

The final piece: automatically posting generated videos to TikTok, Instagram, and YouTube.

The Hard Truth About Auto-Posting

TikTok has no official posting API for individual creators. You need either:

Manual posting (copy video to phone, post)
Unofficial bots (against TOS, account risk)
TikTok for Business API (requires business account + approval)
Third-party tools like Buffer, Later, Hootsuite

Instagram allows posting via the Instagram Graph API, but only for business/creator accounts connected to a Facebook Page.

YouTube has the most accessible Data API v3 with full upload support.

Practical Auto-Posting Architecture

┌──────────────────────────────────────────────────────────┐
│                  POSTING PIPELINE                          │
├──────────────────────────────────────────────────────────┤
│                                                           │
│  Video Queue (filesystem / database / queue)              │
│  ┌─────────┐  ┌─────────┐  ┌─────────┐  ┌─────────┐    │
│  │video1.mp4│  │video2.mp4│  │video3.mp4│  │video4.mp4│   │
│  └────┬─────┘  └────┬─────┘  └────┬─────┘  └────┬─────┘  │
│       │              │              │              │        │
│       ▼              ▼              ▼              ▼        │
│  ┌──────────────────────────────────────────────────┐     │
│  │              Scheduler (cron / queue)              │     │
│  │  - Spaces posts 2-4 hours apart                   │     │
│  │  - Varies posting times (not exactly same time)    │     │
│  │  - Respects per-platform daily limits              │     │
│  └──────────────┬────────────────┬───────────────┘      │
│                 │                │                         │
│     ┌───────────▼──┐    ┌───────▼──────┐    ┌──────────┐ │
│     │   YouTube    │    │  Instagram   │    │  TikTok  │  │
│     │  Data API v3 │    │  Graph API   │    │  Manual  │  │
│     │  (automated) │    │  (automated) │    │  or Bot  │  │
│     └──────┬───────┘    └──────┬───────┘    └────┬─────┘  │
│            │                   │                  │         │
│            ▼                   ▼                  ▼         │
│  ┌──────────────────────────────────────────────────┐     │
│  │           Submission Queue                        │     │
│  │  - Captures post URLs                            │     │
│  │  - Submits to Content Rewards within 1 hour      │     │
│  │  - Tracks submission status                      │     │
│  └──────────────────────────────────────────────────┘     │
│                                                           │
└──────────────────────────────────────────────────────────┘

YouTube Upload Example

import { google } from "googleapis";
import { createReadStream } from "node:fs";

const oauth2Client = new google.auth.OAuth2(
  process.env.YOUTUBE_CLIENT_ID,
  process.env.YOUTUBE_CLIENT_SECRET,
  "http://localhost:3000/oauth/callback"
);

oauth2Client.setCredentials({
  refresh_token: process.env.YOUTUBE_REFRESH_TOKEN,
});

const youtube = google.youtube({ version: "v3", auth: oauth2Client });

interface UploadOptions {
  videoPath: string;
  title: string;
  description: string;
  tags: string[];
  categoryId?: string; // "22" = People & Blogs, "28" = Science & Tech
}

async function uploadToYouTube(options: UploadOptions): Promise<string> {
  const response = await youtube.videos.insert({
    part: ["snippet", "status"],
    requestBody: {
      snippet: {
        title: options.title.slice(0, 100),
        description: options.description,
        tags: options.tags,
        categoryId: options.categoryId ?? "28",
      },
      status: {
        privacyStatus: "public",
        selfDeclaredMadeForKids: false,
      },
    },
    media: {
      body: createReadStream(options.videoPath),
    },
  });

  const videoId = response.data.id;
  return `https://www.youtube.com/shorts/${videoId}`;
}

AI Clipping Agents

Several tools now offer end-to-end automation including auto-posting:

AutoClip — Uses Gemini 2.5 Flash to identify viral moments, auto-reframe to 9:16, add captions, and post directly to TikTok, YouTube Shorts, Instagram Reels, and X. Compresses the entire workflow into under 2 minutes.

AutoClips — Set up once and auto-post AI videos to 4 platforms daily. Claims 100% hands-free video automation.

OpenClaw — Open-source AI assistant (247K GitHub stars) with an autonomous content clipper that uses FFmpeg to trim videos and burn in captions automatically. Creators report producing 5-10x more content without increasing working hours.

OpusClip — AI-powered tool that turns long videos into viral short clips. Identifies high-retention moments, adds captions, and optimizes for each platform.

If you’re producing 10+ videos per day, cloud rendering costs add up. A local GPU pays for itself quickly.

GPU Requirements

Task	Minimum GPU	Recommended GPU	Notes
FFmpeg encoding	Any (CPU-only works)	NVIDIA GTX 1660+	NVENC hardware encoding
TTS (Piper)	CPU sufficient	Any	Runs real-time on CPU
TTS (Coqui XTTS-v2)	4GB VRAM	8GB+ VRAM	Voice cloning needs GPU
TTS (Bark)	8GB VRAM	12GB+ VRAM	Slow even on GPU
AI image generation	8GB VRAM	12GB+ VRAM	SDXL for custom images
Video upscaling	8GB VRAM	12GB+ VRAM	Real-ESRGAN

Recommended Setups

Budget build ($300-500 used):

NVIDIA RTX 3060 12GB (~$200 used)
Any desktop with PCIe slot
Handles: FFmpeg NVENC, Piper, Coqui XTTS-v2
Throughput: ~50 videos/hour

Performance build ($500-800 used):

NVIDIA RTX 3080 10GB (~~$350 used) or RTX 4070 12GB (~~$500)
Handles: Everything above + Bark, SDXL
Throughput: ~100 videos/hour

Mac alternative:

M1/M2/M3 Mac with 16GB+ unified memory
Handles: FFmpeg, Piper, some Coqui models
Throughput: ~30 videos/hour (Metal acceleration)

FFmpeg with NVIDIA GPU Encoding

ffmpeg -encoders 2>/dev/null | grep nvenc

ffmpeg -y \
  -i input.mp4 \
  -c:v h264_nvenc \
  -preset p4 \
  -cq 23 \
  -c:a aac -b:a 128k \
  -movflags +faststart \
  output.mp4

Cost Comparison: Local vs. Cloud

Metric	Local GPU (RTX 3080)	Creatomate	AWS MediaConvert
Upfront cost	$350 (used)	$0	$0
Monthly cost	~$10 electricity	$54-249	$50-200
Cost per video	~$0.002	~$0.10-0.50	~$0.05-0.20
100 videos/day	$6/month	$300-1,500/month	$150-600/month
Break-even	Month 1	Never	Never
Latency	5-30 seconds	30-120 seconds	30-60 seconds

Key insight: A used RTX 3080 pays for itself in the first month if you’re producing 50+ videos/day. The combination of FFmpeg NVENC encoding + Piper TTS + Pexels stock footage gives you a zero-marginal-cost video pipeline.

Content Rewards isn’t just for clippers. If you have a product, you can fund a campaign and get 200+ creators making content about your product for $1-5 per 1,000 views.

Why Brands Use It

Metric	Content Rewards	Facebook/Instagram Ads	Influencer Deal
CPM	$1-5	$25+	$10-100 (variable)
Content type	Organic UGC	Polished ad creative	Influencer post
Trust factor	High (looks real)	Low (labeled as ad)	Medium
Volume	200+ videos	1-5 ad variants	1-3 posts
Risk	Pay only for views	Pay for impressions	Pay upfront
Setup time	30 minutes	Hours (creative + targeting)	Days-weeks (outreach)

A startup with a $1,000 budget gets 500,000+ authentic views for the same cost as 40,000 impressions on Instagram Ads.

Setting Up a Brand Campaign

Step 1: Create a Whop account (whop.com)
Step 2: Navigate to Content Rewards for brands
Step 3: Configure your campaign:
  - Budget: $500 - $50,000+
  - Payout rate: $1-5 per 1K views
  - Max payout per clip: $500-3,000
  - Platforms: TikTok, Instagram, YouTube (pick which ones)
  - Content guidelines: What you want, what you don't want
  - Geographic restrictions: Where views should come from
  - Flat fee bonus: Optional per-submission bonus ($5-50)
Step 4: Fund the campaign via Stripe
Step 5: Creators discover and join your campaign
Step 6: Monitor dashboard for submissions, views, spend

Campaign Math

Example: SaaS product launch campaign

Budget: $5,000
Payout rate: $2/1K views
Max per clip: $1,000

Expected results:
- Total views: 2,500,000
- Creators: 100-300
- Videos produced: 300-1,000
- Cost per acquisition (if 0.5% click + 2% convert):
  2,500,000 views × 0.5% click = 12,500 clicks
  12,500 × 2% convert = 250 customers
  $5,000 / 250 = $20 CAC

Compare: Google Ads for SaaS typically $50-200 CAC

The Quality Control Problem

This is the cautionary side. When you open a campaign to the public, you get a mix of quality levels. Brand campaigns commonly report receiving 40+ low-quality submissions alongside a handful of excellent ones.

Common issues:

AI-generated content that violates your “no AI” policy
Content that misrepresents your product
Videos with no effort (single static image + stock music)
Content posted in wrong geographies
Same clip submitted to multiple campaigns with minimal changes

Mitigation strategies:

Set clear, specific guidelines (examples help)
Use manual approval before views start counting
Set a flat fee bonus to attract higher-quality creators
Require comments to be enabled (filters out bot accounts)
Geographic restrictions filter for target market views

Key insight: Content Rewards campaigns work best as a supplement to your marketing, not a replacement. The 80/20 rule applies: 80% of your views will come from 20% of the creators. Budget accordingly.

Here’s what it actually costs to produce content at each level of automation.

Manual Production

Per video:
- Research/ideation: 15 minutes
- Screen recording or filming: 10 minutes
- Editing in CapCut: 20 minutes
- Captions + text overlay: 10 minutes
- Posting to 3 platforms: 5 minutes
- Submitting to Content Rewards: 2 minutes
Total: ~62 minutes

Daily (5 videos): 5+ hours
Monthly (150 videos): 155 hours
Annual (1,800 videos): 1,860 hours

Cost: $0 (just your time)
Effective hourly rate at $500/month earnings: $3.22/hour

AI-Assisted Production

Per video:
- LLM generates script: 30 seconds ($0.003)
- TTS generates voiceover: 30 seconds ($0.008)
- Manual footage selection: 5 minutes
- FFmpeg composition: 1 minute (automated)
- Manual review + post: 5 minutes
- Submit to Content Rewards: 2 minutes
Total: ~14 minutes

Daily (10 videos): 2.3 hours
Monthly (300 videos): 70 hours
Annual (3,600 videos): 840 hours

Cost: ~$3.30/month (AI services)
Effective hourly rate at $1,500/month earnings: $21.38/hour

Fully Automated Production

Per video:
- Script generation: automated ($0.003)
- TTS: automated ($0.008)
- Stock footage fetch: automated ($0.00)
- Video composition: automated ($0.002)
- Auto-posting: automated ($0.00)
- Content Rewards submission: manual* ($0.00)
Total production time: ~30 seconds
Manual time: 2 minutes (submission only)

*Submission to Content Rewards currently requires
 manual URL input. This is the bottleneck.

Daily (20 videos): 40 minutes (submission only)
Monthly (600 videos): 20 hours
Annual (7,200 videos): 240 hours

Cost: ~$8/month (AI services) + $0-50/month (cloud if used)
Effective hourly rate at $3,000/month earnings: $150/hour

Full Cost Comparison Table

Category	Manual	AI-Assisted	Fully Automated
Videos/day	5	10	20
Videos/month	150	300	600
Time/day	5 hrs	2.3 hrs	40 min
Time/month	155 hrs	70 hrs	20 hrs
LLM cost/mo	$0	$1	$2
TTS cost/mo	$0	$2.40	$4.80
Stock footage	$0	$0	$0
Video rendering	$0	$0	$0 (local)
Total cost/mo	$0	$3.40	$6.80
Expected earnings	$500	$1,500	$3,000
Net monthly	$500	$1,497	$2,993
Effective $/hr	$3.22	$21.38	$149.65
Setup time	0 hrs	4-8 hrs	20-40 hrs
Technical skill	None	Basic	Intermediate

Key insight: The jump from manual to AI-assisted is the highest-ROI change. It cuts time by 55% and triples output. The jump from AI-assisted to fully automated requires more upfront engineering but reduces ongoing time by 71%.

Monetization Platform Comparison

Platform	Earnings per 1K Views	Follower Requirement	Content Ownership	Payout Speed	Barrier to Entry
Content Rewards	$1-5	None	Creator	Weekly (instant approval)	Very Low
TikTok Creator Rewards	$0.40-1.00	10K followers + 100K views/30d	Platform license	Monthly	High
YouTube AdSense	$3-15 (RPM)	1K subs + 4K watch hours	Creator	Monthly (21-day hold)	Very High
Instagram Reels Bonus	$0.07-0.50	10K followers	Platform license	Monthly	High
Freelance UGC	$150-2,000/video flat	Portfolio required	Negotiated	Net 30	Medium
Affiliate Marketing	Variable (CPA-based)	None (but need traffic)	Creator	Monthly	Medium

Clipping Platform Comparison

Platform	Model	Best For	Commission/Fee
Content Rewards (Whop)	Pay-per-view	UGC + brand campaigns	7% platform fee
Clip	Pay-per-view	Similar to Content Rewards	Varies
Topr	Brand-creator marketplace	Direct brand deals	Negotiated
ClipReward	Connect editors with creators	Editing services	Service fee
OpusClip	AI clipping tool	Content generation	Subscription
Clipping.net	Agency marketplace	Agency-scale operations	Varies

When to Use What

Scenario	Best Platform	Why
Zero followers, want income now	Content Rewards	No follower requirement
10K+ TikTok followers	Content Rewards + TikTok Creator Rewards	Stack both income streams
1K+ YouTube subscribers	Content Rewards + YouTube AdSense	Stack both income streams
Building a personal brand	YouTube long-form + Content Rewards for cash flow	YouTube builds equity, CR pays the bills
Running a SaaS product	Content Rewards (as brand)	Cheapest organic reach
Agency model	Content Rewards + direct clients	Scale via managing multiple clippers

Don’t	Do Instead	Why
Submit before posting	Post first, then submit within 1 hour	Views only count after submission
Use AI avatars on “no AI” campaigns	Read the brief, respect restrictions	Rejections waste your views
Post from brand-new accounts	Warm up accounts for 3-4 days first	New accounts get shadowbanned
Join campaigns with <20% budget remaining	Only join campaigns with 60%+ budget	Low-budget campaigns might not pay out
Post the same video to multiple campaigns	Create unique content per campaign	Platforms detect and suppress duplicates
Focus on one platform only	Cross-post to TikTok + IG + YouTube	3x the views from the same content effort
Chase viral hits	Focus on volume and consistency	100 videos at 5K views > waiting for 1 viral hit
Use copyrighted music	Use platform music libraries or royalty-free	Copyright strikes kill accounts
Skip captions	Always add captions	80%+ of social video is watched on mute
Post at random times	Post during peak hours (7-9am, 12-2pm, 7-10pm local)	Algorithm rewards early engagement
Ignore analytics	Track which formats/hooks/topics get views	Data-driven iteration beats guessing
Build a massive pipeline before testing	Start with 5-10 manual videos to learn what works	Automating the wrong format wastes engineering time
Use the most expensive AI tools first	Start with free/cheap options, upgrade when profitable	Piper + Pexels + FFmpeg = $0 production cost
Assume $45K/36hrs is normal	Budget for $200-500/month initially	Outlier results are not benchmarks

Phase 1: Manual Learning (Week 1-2)

Goal: Post 5-10 videos manually. Learn what gets views.

Actions:
- Join 3-5 campaigns with good budgets ($2+ CPM)
- Try all 5 content formats
- Post on TikTok + Instagram + YouTube
- Track views per format per platform
- Total investment: $0, ~10 hours

Expected outcome: $50-200 in earnings
Key learning: Which format works for which campaign type

Phase 2: AI-Assisted Production (Week 3-4)

Goal: 10x your output using LLM scripts and TTS voiceover.

Actions:
- Set up script generation (see Script Generator section)
- Set up TTS pipeline (OpenAI TTS or Piper)
- Batch-produce 10 videos/day
- Continue manual posting + submission
- Total investment: ~$5/month, ~8 hours setup

Expected outcome: $300-800/month
Key learning: Production bottleneck is now posting, not creating

Phase 3: Semi-Automated Pipeline (Month 2-3)

Goal: Automate everything except posting and submission.

Actions:
- Build FFmpeg composition pipeline
- Integrate Pexels API for stock footage
- Create 3-5 video templates (different formats)
- Set up cron job to produce videos overnight
- Manual posting in morning batch (30-60 min/day)
- Total investment: ~$10/month, 20-40 hours engineering

Expected outcome: $1,000-2,000/month
Key learning: Template quality matters more than volume

Phase 4: Full Automation (Month 3-6)

Goal: Minimize daily manual time to <30 minutes.

Actions:
- Add auto-posting for YouTube (API) and Instagram (Graph API)
- TikTok remains semi-manual (use scheduling tools)
- Automated campaign scanning for new high-budget campaigns
- Analytics dashboard tracking per-video ROI
- A/B testing frameworks for hooks and formats
- Total investment: ~$50/month (tools), 40-80 hours engineering

Expected outcome: $2,000-5,000/month
Key learning: The bottleneck shifts to campaign selection and content quality

Phase 5: Agency Scale (Month 6+)

Goal: Manage multiple accounts and campaigns.

Actions:
- Hire 1-2 VAs for posting/submission ($3-5/hr)
- Run 10+ campaigns simultaneously
- Build template library (20+ templates)
- Offer campaign management to brands (20-30% fee)
- Total investment: $500-1,000/month (VAs + tools)

Expected outcome: $5,000-20,000/month
Key learning: Operations management becomes the job, not content creation

Here’s everything wired together — a single TypeScript script that takes a campaign brief and produces a ready-to-post video.

import Anthropic from "@anthropic-ai/sdk";
import OpenAI from "openai";
import { execSync } from "node:child_process";
import { writeFile, mkdir, unlink } from "node:fs/promises";
import { join } from "node:path";

// ── Types ──────────────────────────────────────────────────

interface CampaignBrief {
  brandName: string;
  productDescription: string;
  targetAudience: string;
  keyBenefits: string[];
  restrictions: string[];
  tone: "casual" | "professional" | "excited" | "educational";
  payoutRate: number; // $/1K views
}

interface VideoScript {
  hook: string;
  body: string[];
  cta: string;
  searchTerms: string[];
  fullNarration: string;
}

interface PexelsVideo {
  id: number;
  video_files: Array<{
    quality: string;
    width: number;
    height: number;
    link: string;
  }>;
}

// ── Step 1: Generate Script ────────────────────────────────

async function generateScript(
  brief: CampaignBrief
): Promise<VideoScript> {
  const client = new Anthropic();

  const response = await client.messages.create({
    model: "claude-sonnet-4-20250514",
    max_tokens: 1000,
    messages: [
      {
        role: "user",
        content: `Generate ONE short-form video script (30 seconds) for ${brief.brandName}.
Product: ${brief.productDescription}
Audience: ${brief.targetAudience}
Benefits: ${brief.keyBenefits.join(", ")}
Restrictions: ${brief.restrictions.join(", ")}
Tone: ${brief.tone}

Return JSON: { hook, body: string[], cta, searchTerms: string[], fullNarration: string }
fullNarration = the complete spoken script (hook + body + cta as one flowing text).`,
      },
    ],
  });

  const text =
    response.content[0].type === "text" ? response.content[0].text : "";
  const match = text.match(/\{[\s\S]*\}/);
  if (!match) throw new Error("Failed to parse script");
  return JSON.parse(match[0]);
}

// ── Step 2: Generate Voiceover ─────────────────────────────

async function generateVoiceover(
  text: string,
  outputPath: string
): Promise<void> {
  const openai = new OpenAI();
  const response = await openai.audio.speech.create({
    model: "tts-1",
    voice: "nova",
    input: text,
    speed: 1.1,
    response_format: "mp3",
  });
  const buffer = Buffer.from(await response.arrayBuffer());
  await writeFile(outputPath, buffer);
}

// ── Step 3: Fetch Stock Footage ────────────────────────────

async function fetchFootage(
  searchTerms: string[],
  outputDir: string
): Promise<string[]> {
  const paths: string[] = [];

  for (let i = 0; i < Math.min(searchTerms.length, 3); i++) {
    const params = new URLSearchParams({
      query: searchTerms[i],
      per_page: "5",
      orientation: "portrait",
      min_duration: "5",
      max_duration: "15",
    });

    const res = await fetch(
      `https://api.pexels.com/videos/search?${params}`,
      { headers: { Authorization: process.env.PEXELS_API_KEY! } }
    );
    const data = await res.json();
    const videos: PexelsVideo[] = data.videos ?? [];

    if (videos.length > 0) {
      const video = videos[Math.floor(Math.random() * videos.length)];
      const file =
        video.video_files.find(
          (f) => f.quality === "hd" && f.height > f.width
        ) ?? video.video_files[0];

      if (file) {
        const videoRes = await fetch(file.link);
        const buf = Buffer.from(await videoRes.arrayBuffer());
        const path = join(outputDir, `footage-${i}.mp4`);
        await writeFile(path, buf);
        paths.push(path);
      }
    }
  }

  return paths;
}

// ── Step 4: Compose Video ──────────────────────────────────

function composeVideo(
  footagePaths: string[],
  audioPath: string,
  script: VideoScript,
  outputPath: string
): void {
  const tmpDir = "/tmp/compose-pipeline";
  execSync(`mkdir -p ${tmpDir}`);

  // Scale and trim clips
  const scaled: string[] = [];
  for (let i = 0; i < footagePaths.length; i++) {
    const out = join(tmpDir, `s-${i}.mp4`);
    execSync(
      `ffmpeg -y -i "${footagePaths[i]}" ` +
        `-vf "scale=1080:1920:force_original_aspect_ratio=increase,crop=1080:1920" ` +
        `-t 10 -c:v libx264 -preset fast -crf 23 -an "${out}"`,
      { stdio: "pipe" }
    );
    scaled.push(out);
  }

  // Concat
  const concatFile = join(tmpDir, "list.txt");
  const concatTxt = scaled.map((p) => `file '${p}'`).join("\n");
  execSync(`echo '${concatTxt}' > "${concatFile}"`);

  const combined = join(tmpDir, "combined.mp4");
  execSync(
    `ffmpeg -y -f concat -safe 0 -i "${concatFile}" -c copy "${combined}"`,
    { stdio: "pipe" }
  );

  // Add text overlays + audio
  const hookEsc = script.hook.replace(/'/g, "\\'").replace(/:/g, "\\:");
  const ctaEsc = script.cta.replace(/'/g, "\\'").replace(/:/g, "\\:");

  execSync(
    `ffmpeg -y -i "${combined}" -i "${audioPath}" ` +
      `-filter_complex "[0:v]` +
      `drawtext=text='${hookEsc}':fontsize=64:fontcolor=white:borderw=3:bordercolor=black:x=(w-text_w)/2:y=h-350:enable='between(t,0,5)',` +
      `drawtext=text='${ctaEsc}':fontsize=56:fontcolor=yellow:borderw=3:bordercolor=black:x=(w-text_w)/2:y=h-350:enable='between(t,20,30)'` +
      `[outv]" ` +
      `-map "[outv]" -map 1:a ` +
      `-c:v libx264 -preset fast -crf 23 -c:a aac -b:a 128k -shortest -movflags +faststart ` +
      `"${outputPath}"`,
    { stdio: "pipe" }
  );

  // Cleanup
  scaled.forEach((p) => execSync(`rm -f "${p}"`));
  execSync(`rm -f "${concatFile}" "${combined}"`);
}

// ── Main Pipeline ──────────────────────────────────────────

async function produceVideo(brief: CampaignBrief): Promise<string> {
  const workDir = "/tmp/pipeline-" + Date.now();
  await mkdir(workDir, { recursive: true });

  console.log("Step 1: Generating script...");
  const script = await generateScript(brief);
  console.log(`  Hook: "${script.hook}"`);
  console.log(`  Search terms: ${script.searchTerms.join(", ")}`);

  console.log("Step 2: Generating voiceover...");
  const audioPath = join(workDir, "voiceover.mp3");
  await generateVoiceover(script.fullNarration, audioPath);

  console.log("Step 3: Fetching stock footage...");
  const footagePaths = await fetchFootage(script.searchTerms, workDir);
  console.log(`  Found ${footagePaths.length} clips`);

  if (footagePaths.length === 0) {
    throw new Error("No footage found for search terms");
  }

  console.log("Step 4: Composing video...");
  const outputPath = join(workDir, "final.mp4");
  composeVideo(footagePaths, audioPath, script, outputPath);

  console.log(`Done! Video saved to: ${outputPath}`);
  return outputPath;
}

// ── Run ────────────────────────────────────────────────────

const videoPath = await produceVideo({
  brandName: "Lovable",
  productDescription: "AI-powered full-stack web app builder",
  targetAudience: "Indie hackers and non-technical founders",
  keyBenefits: [
    "Build complete web apps with natural language",
    "No coding required",
    "Deploy in minutes",
  ],
  restrictions: ["No AI voiceovers", "Must show real product usage"],
  tone: "excited",
  payoutRate: 2.0,
});

console.log(`\nReady to post: ${videoPath}`);
console.log("Next steps:");
console.log("1. Post to TikTok, Instagram Reels, YouTube Shorts");
console.log("2. Submit URLs to Content Rewards within 1 hour");
console.log("3. Track views and earnings on Whop dashboard");

Running the Pipeline

npm install @anthropic-ai/sdk openai
brew install ffmpeg  # or apt-get install ffmpeg

export ANTHROPIC_API_KEY="sk-..."
export OPENAI_API_KEY="sk-..."
export PEXELS_API_KEY="..."

npx tsx pipeline.ts

Cost Per Video (This Pipeline)

Component	Cost
Claude Sonnet (script)	$0.003
OpenAI TTS (voiceover)	$0.008
Pexels (footage)	$0.00
FFmpeg (rendering)	$0.00
Total per video	$0.011

At $0.011 per video and a $2/1K views payout, you break even at just 6 views per video. Everything above that is profit.

”Is this legal?”

Yes. Content Rewards is a legitimate marketplace. Brands explicitly authorize you to create and post content about their products. You’re not stealing content — you’re creating it under a paid campaign agreement.

”Do I need to show my face?”

No. Faceless formats (slideshows, screen recordings, “Did You Know” videos) perform well. Many top clippers never show their face.

”Can I use multiple accounts?”

The platform doesn’t explicitly ban it, but TikTok, Instagram, and YouTube all restrict operating multiple accounts from the same device. Use different devices or be prepared for account flags. This is a gray area — proceed at your own risk.

”What happens when a campaign runs out of budget?”

You stop earning from that campaign. Any views generated after the budget depletes aren’t paid. This is why you should check budget remaining before joining.

”Can I do this outside the US?”

Yes. Content Rewards is available globally. However, some campaigns restrict which geographic locations views come from. A campaign targeting US views won’t pay for views from non-US audiences, even if the creator is based elsewhere.

”Is this just reposting other people’s content?”

It can be, but the highest earners create original content. “Clipping” originally meant cutting highlights from long-form videos (with permission via the campaign). Today, many campaigns want original UGC — not just clips of existing content.

”How long do videos keep earning?”

Videos continue earning as long as they get views and the campaign budget hasn’t been depleted. A viral video can earn for weeks. Most videos earn 80% of their total views in the first 48 hours.

Content Rewards clipping is a real business model with real money flowing through it. But like any opportunity, the marketing around it is more optimistic than the median reality.

What’s real:

Brands are spending real money ($1K-$50K+ per campaign)
Creators are getting paid real money (average $1-5/1K views)
The barrier to entry is genuinely low (free to start, no followers needed)
AI automation can dramatically reduce production time and cost
Cross-platform posting multiplies earnings from the same effort

What’s overhyped:

The $45K/36hrs stories are extreme outliers, not benchmarks
Most clippers earn $50-200/month, not $10K/month
“Passive income” requires significant upfront work to build the pipeline
Campaign budgets drain fast — the best campaigns get crowded quickly
Platform algorithm changes can tank your views overnight

Who should try this:

Developers who can build automation (highest ROI from engineering skills)
Content marketers who understand hooks and engagement
People willing to iterate through 100+ videos before seeing consistent results
Side hustlers who want $500-2,000/month without a huge time commitment

Who should skip this:

People expecting overnight $10K/month results
People unwilling to learn video editing basics
People who want truly passive income (this requires ongoing work)
People who can earn more per hour doing something else

The automation angle is what makes this interesting for technical people. A non-technical clipper competes on creativity and hustle. A developer competes on scale and efficiency. Both can win, but the developer’s ceiling is higher because marginal cost approaches zero.

Official Platforms and Documentation

Content Rewards — The Marketplace for Virality — The main platform for performance-based UGC campaigns
Content Rewards Discovery — Browse Active Campaigns — Live campaign listings with budgets and payout rates
Content Rewards FAQs — Official help center and frequently asked questions
Whop Content Rewards Documentation — Official docs for brands setting up campaigns
Content Rewards Blog — Get Paid to Clip Videos — Official guide for new clippers
TikTok Creator Rewards Program — TikTok’s official monetization requirements and rates

Guides and Tutorials

The Complete Whop Clipping Guide (2025) — Most comprehensive clipping guide with tiered earnings data and strategy
What is Clipping? The Ultimate Guide — Whop Blog — Official Whop explanation of the clipping model
How to Use Content Rewards on Whop — Creator and Brand Guide — Step-by-step for both sides of the marketplace
Content Rewards: Grow Your Brand Without Wasting Money — Brand-focused guide with campaign examples (Hostage Tape case study)
How to Set Up Whop Content Rewards — Technical setup guide for brands
Whop Clipping 101: How to Make Money with Whop (2026) — Beginner guide with tool recommendations
How to Make Money Clipping Videos with Whop (2026) — Practical earnings walkthrough
How I Earned $2,500/Month with Whop Clipping — Creator case study

Creator Earnings and Case Studies

@jessieclipping — $45K in 36 hours (X/Twitter) — Viral earnings screenshot (15,489 bookmarks)
@jessieclipping — First month earnings journey (X/Twitter) — More realistic $340 first month data (688 bookmarks)
@reyaffrev — $50K+ total earnings at age 17 (X/Twitter) — Consistent high earner case study (762 bookmarks)
@reyaffrev — $1,600/week from reposting clips (X/Twitter) — Weekly earnings breakdown (188 bookmarks)
@alexxgrowth — 30-Day $10K Challenge (X/Twitter) — Clipping education and challenge format (760 bookmarks)
Content Rewards x Lovable Case Study — $10K budget, $2/1K views, campaign brief details
How to Make $10K/month With Whop Clipping — Medium — Detailed earnings walkthrough

AI UGC and Avatar Tools

HeyGen — AI Avatar Video Generator — Starting at $29/mo, 1,100+ avatars, API available
Arcads — AI UGC Ad Creator — Best-in-class avatar quality, $100/mo for 10 videos
Creatify — AI Video Ad Generator — 1,500+ avatars, batch mode, $39/mo
7 Best AI UGC Tools: Video Generators Ranked (2026) — Comprehensive tool comparison
Best AI UGC Generators 2026: 7 Tools Compared — Price-per-video analysis
Compare Pricing for UGC Video Production Tools — Side-by-side pricing breakdown

Text-to-Speech

ElevenLabs — AI Voice Generator — Best quality, $5/mo starter, ~$180/1M characters
ElevenLabs API Pricing — Detailed API cost breakdown
OpenAI Text-to-Speech API — $15/1M characters, best value for volume
Piper TTS — Open Source — Free, runs locally, 30+ languages
Coqui TTS — Open Source — Free, voice cloning via XTTS-v2
Best TTS APIs in 2026 Compared — 12 services compared
Open Source TTS Alternatives Compared — Piper, Coqui, Bark, and more

Stock Footage APIs

Pexels API Documentation — Free API, 200 req/hr, photos + videos
Pexels API Overview — Getting started guide
Pixabay API Documentation — Free API, photos + videos + illustrations

Video Composition and Rendering

FFmpeg — Official Site — The universal video processing toolkit
Remotion — Make Videos Programmatically with React — React-based video generation framework
Remotion GitHub Repository — Source code and documentation
Creatomate — Cloud Video Editing API — Programmatic video rendering, $54/mo starting
Creatomate Developer Docs — API integration guides
Creatomate Pricing — Credit-based pricing breakdown
7 Best Video Editing APIs (2026) — API comparison

Auto-Posting and Automation Tools

AutoClip — AI Clip Generator — Uses Gemini 2.5 Flash, auto-posts to 4 platforms
AutoClips — AI Video Automation — 100% hands-free posting pipeline
OpenClaw — Open Source AI Assistant — 247K GitHub stars, autonomous content clipper
OpusClip — AI Video Clipping — AI-powered viral clip extraction
OpenClaw + OpusClip Content Machine — End-to-end automation guide

Creator Earnings Comparisons

Which Social Platform Pays the Most (2026) — Cross-platform earnings comparison
Creator Earnings Reports 2026 — Comprehensive earnings data
YouTube CPM & RPM Rates 2026 — Average rates by niche and country
TikTok Creator Earnings Breakdown 2026 — TikTok-specific earnings analysis
UGC Rates: What Brands Actually Pay — Freelance UGC pricing data

Alternative Platforms

Clip — Create Videos and Earn Cash — Alternative to Content Rewards
Topr — Brand-Creator Marketplace — Direct brand-creator deals
ClipReward — Connect Creators with Video Editors — Editing services marketplace
Clipping.net — Agency-scale clipping marketplace
15 UGC Platforms That Pay Creators — Comprehensive platform list

Community and Discussion

Clipping & Content Rewards — Skool Business Ideas — Community discussion on the business model
Whop Clipping Masterplan — Whop’s official clipping community
Content Rewards on Whop Discover — Browse all Content Rewards communities