Lyric Video File Formats Explained: MP4, MOV, WebM, and What to Use Where
Most musicians export lyric videos as "MP4" without thinking about it. That works 90% of the time. The other 10% — Spotify Canvas size limits, transparent backgrounds, ProRes for sync libraries — is where wrong-format choices break things.
Here's the plain-English guide to lyric video file formats, what each does, and when to use which.
The Quick Answer
For 95% of cases:
- MP4 (H.264) — TikTok, YouTube, Reels, Shorts, X, Pinterest, sync pitches, Spotify Canvas. Universal default.
- MOV (ProRes) — high-end sync delivery, archival masters. Big files but lossless.
- WebM (VP9) — web-embedded video where you don't want MP4's licensing overhead. Rare for musicians.
- GIF — short looping promo clips for X and Tumblr. Tiny resolution, smaller file.
- PNG sequence — exporting frames for further editing in After Effects or Final Cut. Niche.
- MP4 (H.265 / HEVC) — better compression than H.264 but spotty support. Avoid unless you have a specific reason.
MP4 (H.264) — The Default
MP4 with H.264 video codec and AAC audio codec is the universal video format. It plays on every device, every platform, every social network.
Specs that work everywhere:
- Container: MP4
- Video codec: H.264 (also called AVC)
- Video bitrate: 8-12 Mbps for 1080p, 35-45 Mbps for 4K
- Audio codec: AAC
- Audio bitrate: 192-256 kbps
- Frame rate: 30 fps (60 fps if your template benefits, but doubles file size)
In Epitrite's export settings, this is the default option labeled "MP4 (1080p)" or "MP4 (4K)".
Use MP4 (H.264) for:
- TikTok upload
- YouTube upload (long-form and Shorts)
- Instagram Reels
- Spotify Canvas
- X / Twitter
- Most sync agency submissions
- Direct file delivery to anyone who isn't a film professional
If someone asks for "an MP4," they mean this.
MOV (ProRes) — High-End Sync and Archival
MOV is Apple's container format. ProRes is the codec inside it. ProRes is essentially uncompressed video — it preserves every detail, color, and pixel exactly.
The trade-off:
- ProRes 422 file: ~1GB per minute of 1080p video
- ProRes 4444 file: ~3GB per minute of 1080p video (with alpha channel for transparency)
- vs MP4 H.264 1080p: ~50-100MB per minute
ProRes files are 10-30x larger than equivalent MP4. The benefit is they're lossless — they don't compress, so they won't degrade if re-encoded by a downstream system.
Use ProRes for:
- Sync pitches to professional music supervisors (some agencies require it)
- Archival masters (the file you keep forever)
- Handing off to a video editor for further work
- Color grading workflows where compression artifacts would hurt
For most independent artists shipping lyric videos to social platforms, ProRes is overkill. The platforms re-encode your upload to their own format anyway, so the extra fidelity gets lost.
WebM (VP9) — Web-Embedded Video
WebM is Google's open-source video format. It's used heavily on YouTube internally (your "MP4" YouTube serves is often actually WebM with VP9 codec).
Why WebM exists: H.264 has licensing fees for some commercial uses; WebM/VP9 is patent-free.
For musicians: rarely matters. You won't export WebM unless you're embedding video on your own website and want to avoid H.264 licensing complications.
If you're not building your own website with embedded video, ignore WebM.
GIF — Tiny Loops for X and Tumblr
GIF is an old image format (1987!) that supports animated frames. It's not really a video format — it's a series of images.
Trade-offs:
- File size: huge for what you get (MB for a 5-second loop)
- Resolution: typically capped at 480-720px for web delivery
- Color: 256 colors max — looks bad for any video with gradients or photographs
- Audio: none
Use GIF for:
- Short looping clips (3-6 seconds) for X / Twitter
- Tumblr lyric snippets
- Email signatures or in-message embedding
- Discord stickers / emoji uses
Don't use GIF for:
- Anything with audio (use MP4 silent)
- Anything that needs to be longer than 6 seconds
- Anything that needs full color or smooth gradients
In Epitrite, you'd export MP4 short, then convert to GIF in a separate tool (Photoshop, ezgif.com, ffmpeg).
PNG Sequence — For Further Editing
A PNG sequence is exactly what it sounds like: every frame of your video exported as a separate PNG image. So a 30-frame-per-second 60-second video becomes 1,800 PNG files.
Use PNG sequence for:
- Importing into After Effects for further compositing
- Importing into Final Cut Pro for additional editing layers
- Frame-perfect manual editing or color grading
- Archival of high-fidelity individual frames
Don't use PNG sequence for:
- Direct platform upload (no platform accepts it)
- Any general delivery (it's a working format, not a delivery format)
Niche, but useful if you're handing off to a video editor.
Transparent Backgrounds (Alpha Channel)
If you want your lyric video to play over something else (a background video, a website hero, an overlay on someone else's footage), you need transparency.
Formats that support transparency:
- MOV (ProRes 4444) — full transparency, professional grade, large files
- MOV (Animation codec) — full transparency, even larger files
- WebM (VP9 with alpha) — supports transparency, browser playback
- PNG sequence — every frame can be transparent
- MP4 (H.264) — does NOT support transparency. The "transparent" output will be black.
If you're exporting from Epitrite for use as an overlay (e.g., keying into a music video later), use ProRes 4444 MOV. The file is large but the transparency is preserved.
If you just want a transparent-looking output for visual style (lyrics floating with no background), Epitrite's transparent background export uses ProRes 4444 by default.
Spotify Canvas Specs
Spotify Canvas has tight specs that don't match social-platform defaults:
- Aspect ratio: 9:16 (1080×1920)
- Length: 3-8 seconds (must loop seamlessly)
- File format: MP4
- Codec: H.264
- Max file size: 8MB
- Audio: ignored (Canvas plays silently with track audio underneath)
The 8MB limit is the trickiest. For 8 seconds of 1080×1920 video, you need a video bitrate around 6-7 Mbps to fit. Epitrite's "Spotify Canvas" export preset handles this automatically.
If you upload a higher-bitrate file, Spotify rejects it.
YouTube Specs
YouTube accepts almost any reasonable format and re-encodes everything internally. But to avoid quality loss in YouTube's re-encode:
- Format: MP4
- Codec: H.264 (H.265 also accepted)
- Resolution: 1080p or 4K (4K gets better treatment in YouTube's encoder)
- Frame rate: 24, 30, or 60 fps
- Audio: AAC stereo at 384 kbps recommended (more than required)
- Bitrate: at least 8 Mbps for 1080p, 35 Mbps for 4K
Higher bitrates than required don't hurt — YouTube re-encodes anyway, but starting from a higher-quality source means the re-encoded output looks better.
TikTok and Reels Specs
TikTok and Reels are forgiving of format but unforgiving of length and aspect:
- Format: MP4
- Codec: H.264
- Resolution: 1080×1920 (9:16)
- Length: TikTok up to 10 minutes (most lyric videos: 30-90 seconds); Reels up to 90 seconds
- Frame rate: 30 fps (60 fps also works)
- File size: under 287MB for TikTok, under 4GB for Reels
- Audio: AAC stereo
Sync Library / Music Supervisor Delivery
Sync libraries usually request a specific format. Common asks:
- Master audio: WAV 44.1kHz / 16-bit (or 96kHz / 24-bit for high-end)
- Instrumental WAV: same specs as master
- Lyric video: MP4 H.264 1080p OR ProRes 422 MOV (some prefer one, some accept either)
- One-sheet: PDF
- Splits: PDF or signed agreement
If the agency doesn't specify, default to MP4 H.264 1080p for the lyric video.
When to Use 4K vs 1080p
- TikTok / Reels / Shorts: 1080p (4K gets compressed to 1080p anyway by the platforms)
- YouTube long-form: 4K if you have it (Epitrite Pro), 1080p otherwise
- Spotify Canvas: 1080p only (8MB limit makes 4K impossible)
- Sync delivery: 4K if requested, 1080p if not specified
- Personal archive: 4K if you have the source assets at that resolution
For lyric videos specifically, 4K is most valuable on YouTube long-form where viewers might fullscreen on TVs. On phones, 1080p is visually identical to 4K.
Common Questions
What's the difference between MP4 and MOV?
MP4 is a universal container format that plays everywhere. MOV is Apple's container format, often used with the ProRes codec for high-fidelity work. For most lyric video uses, MP4 is the right choice.
Can I upload MOV to TikTok or Instagram?
Yes, both accept MOV — but they re-encode to their own format. Better to upload MP4 directly to avoid the re-encoding loss.
What's H.264 vs H.265?
H.264 (AVC) is the universal video codec, supported everywhere. H.265 (HEVC) is newer, more efficient (smaller files for same quality), but not universally supported. Stick with H.264 unless you have a specific reason.
Why does Spotify Canvas have an 8MB limit?
Spotify wants Canvas to load instantly without buffering. The 8MB cap ensures Canvas loads under 1 second on most connections.
Can I export 60fps lyric videos?
Yes — most platforms accept 60fps. The trade-off is double the file size for marginal visual benefit on text-driven content. 30fps is the lyric video default.
Takeaway
For 95% of lyric video work: MP4 H.264 1080p. For Spotify Canvas: MP4 with 8MB cap. For sync delivery: MP4 or ProRes if requested. For transparency: ProRes 4444 MOV. For everything else: MP4.
Try Epitrite free — every export preset configured for the right platform automatically.