Hey, I’m Dora. I recorded a 90‑second explainer about vector databases and thought, “This is clean.” Then I checked the analytics: average view duration 31 seconds, a cliff at 0:07. Ouch. The first seven seconds were me warming up. Viewers didn’t stick around to see the good parts.
That little sting pushed me into a weekend rabbit hole on script structure for video. I tried different frameworks, rewrote my hooks, and even timed my beats with a kitchen timer (Dec 1–3, 2025). Here’s what actually changed my metrics and my sanity.
Script Structure Types for Better Video Writing

Key Frameworks for Clear Script Structure in Video
I used to improvise. It felt “creative,” but it also meant rambling. These are the structures that made my videos tighter and easier to watch:
- Three-Act (Setup → Confrontation → Resolution): Great for narratives, case studies, or feature walkthroughs that build tension. Keeps viewers moving toward a payoff.
- Problem–Agitate–Solve (PAS): For tutorials and tool reviews. You name the pain, make it real, then fix it. Fast and satisfying.
- AIDA (Attention–Interest–Desire–Action): Ideal for landing-page videos or product teasers. Works when you need a clean CTA.
- Hook–Story–Offer (HSO): Short-form and social-friendly. Hook hard, deliver one insight, then give a next step.
- Story Spine (Pixar-style: “Once upon a time…” → “Until finally…”): Surprisingly useful for case studies and customer stories.
Why structure matters: It reduces cognitive load. Viewers know where they are in the journey, which helps retention. YouTube literally rewards that: audience retention and relative retention are core signals for discovery (see YouTube’s docs on audience retention). When I switched to PAS for explainers, my average view duration jumped from 31s to 49s on a 1:20 video (Dec 2, 2025, sample size: 3 uploads, small but noticeable).

Choosing the Right Script Structure for Your Video Goals
- Quick Tutorial (under 2 min): PAS. Example: “Your data is messy (P). Here’s why that breaks SEO (A). Here’s the 60‑second cleanup flow (S).”
- Deep Dive (5–10 min): Three-Act. Setup context and stakes, walk through the conflict (trade-offs, gotchas), land the resolution with a checklist.
- Product Demo: AIDA. Grab attention with the outcome, spark interest with a mini-story, build desire with one killer feature, then a simple CTA.
- Shorts/Reels: HSO. Lead with a visual hook or bold line in the first 2 seconds. One insight. One offer.
- Case Study: Story Spine. It humanizes numbers without fluff.
If you’re unsure, ask: what must the viewer do or feel by the end? Choose the frame that best delivers that feeling or action with the least friction.
Scene Breakdown for Clear Script Structure in Video Content
I stopped writing “scripts” and started writing “scenes.” That tiny change made editing faster and my pacing more intentional.
How to Organize Scenes for Smooth Video Flow
Here’s the scene card template I used on Dec 3, 2025, for a 3‑minute tutorial:
- Scene 0: Cold Open (0:00–0:05)
Visual: fast before/after screen.
Line: “This messy CSV cost us 12% traffic. Here’s the 3‑minute fix.”
Note: no greeting yet.
- Scene 1: Promise (0:05–0:12)
Visual: clean dashboard.
Line: “By minute three, you’ll batch-fix schema errors.”
- Scene 2: Context (0:12–0:25)
Visual: one slide with the problem.
Line: one sentence of why this matters for search indexation.
- Scene 3–5: Steps (0:25–2:15)
Visual: screen capture + tight zooms.
Lines: one action per beat, verbs first. Add B‑roll notes and on‑screen text (“Press Shift+Cmd+L”).
- Scene 6: Proof (2:15–2:40)
Visual: before/after metrics.
Line: “Watch the warnings drop from 127 to 18.”
- Scene 7: Next Step (2:40–3:00)
Visual: end card.
Line: soft CTA: “Template link below.”
Tiny extras that helped:
- I tag audio mood per scene (calm, punchy) so music doesn’t fight narration.
- I pre-write lower-thirds and any key on-screen text. This avoids re-records.
- I mark “cut points” in the script. Editing becomes connect-the-dots.
Result: same day, my edit time dropped from 1h12 to 42 minutes, and the retention curve smoothed, fewer sharp dips after Scene 2.
Timing Guidelines for Polished Video Script Structure
If script structure is the map, timing is the speed limit. I track beats in seconds, not minutes.
- Hooks: 0–5s. Show the payoff or the pain immediately. Don’t greet: earn the next second first.
- Promise: by 0:08–0:12. One clear outcome.
- Context: 0:12–0:25. Use one slide, one sentence. If it needs more, drip it later.
- Steps/Demo: 60–70% of runtime. One action per beat, 6–12s each.
- Proof: 10–20% of runtime: graph, before/after, or live result.
- CTA: last 8–12s. Make it adjacent to the benefit you just showed.
Word math that actually helps:
- Speaking rate: ~140–160 words/minute conversational. If you’re aiming for 90 seconds, target 210–240 words of spoken lines. Leave buffer for breaths and cuts.
- On-screen text: people read slower than you think. Keep lower-thirds under 7 words when possible.
My mini test on Dec 5, 2025: I tightened hooks to 0–3s across two Shorts. Average watch percentage improved from 78% to 86%, and replays ticked up by ~11%. It’s not a lab study, but it felt real in the graph.
For Shorts (0:15–0:45):
- 0–2s: visual hook.
- 2–10s: one insight.
- 10–30s: quick proof or micro-demo.
- 30–40s: single next step.
Reference if you want to go deeper: YouTube’s guidance on retention and intros is solid starting material.

Examples of Effective Script Structure for Video Creators
Here are three structures I actually used, with sample lines you can steal and bend.
- PAS for a 2‑min SEO cleanup
- Problem (0:00–0:07): “Your sitemap looks fine, but search is skipping 18% of pages.”
- Agitate (0:07–0:15): quick clip of crawl anomalies: one sentence on lost clicks.
- Solve (0:15–1:40): three steps, one per beat. “Open… Filter… Fix…”
- Proof (1:40–1:55): “Warnings down from 127 to 18 in one run.”
- CTA (1:55–2:00): “Grab the checklist below.”
- Three-Act for a 6‑min case study
- Act I (0:00–0:45): protagonist + stakes. “We were stuck at 2.3% CTR.”
- Act II (0:45–4:30): the work: tests, failures, the small win at minute 3.
- Act III (4:30–6:00): results and what you can copy this week.
- AIDA for a product walkthrough
- Attention (0:00–0:05): before/after animation.
- Interest (0:05–0:25): “This trims your research time by ~37%.”
- Desire (0:25–1:30): two features framed by outcomes, not specs.
- Action (1:30–1:50): “Template link: free version works.”
Small tooling note (Dec 6, 2025): I drafted beats with an LLM, then rewrote the hook myself. The AI was great for listing steps, mediocre for the first line. If you try this, prompt for time-stamped beats (“give each step a 6–10s window”). Then read it aloud with a timer. Trim anything that sounds foggy.
If you want a nudge to start: pick one recent video that underperformed, re-script only the first 20 seconds using PAS + a 3‑second visual hook, and re-upload as a new cut. Track average view duration and the first 30‑second retention. When I did this on Dec 2, 2025, average view duration on that piece improved from 31s to 49s, and the first dip moved from 0:07 to ~0:18.
None of this is magic. It’s scaffolding. But wow, a little scaffolding makes the climb feel a lot less exhausting.
By the way, I’ve recently been using Crepal, an AI tool that speeds up my entire script-to-video workflow. It can turn a simple prompt or rough draft straight into a complete video — automatically structuring the script, breaking it into scenes, tightening the hook and pacing, then adding voiceover, subtitles, and background music.
If you try any of the frameworks above, send me your timing experiments. I’m curious what beats hold for your audience too.
Previous posts:






