Stock photo vs video: which should you shoot to sell?
Sooner or later every stock contributor asks it: should I be shooting video too? Video has a reputation for paying more and facing less competition — and that reputation is mostly deserved. But there is a catch almost nobody mentions: video is much harder to get found, and that, not the camera work, is where most contributors lose the money. Here is how photo and video actually compare.
The case for video: it pays more, and fewer people do it
On a per-sale basis, video simply earns more. Adobe Stock pays contributors 35% on video versus 33% on photos and illustrations, and because clips license at much higher prices than single images, the gap in absolute dollars is far wider than those two numbers suggest. Shutterstock goes further and tracks your earnings level for video separately from images — so building a video portfolio is its own ladder, not diluted by your photo downloads.
Just as important: far fewer contributors shoot video than stills. The photo libraries are saturated; the footage libraries are comparatively thin. Less competition for the same buyer demand is exactly the kind of gap a serious contributor wants to exploit.
The case for sticking with photos
Video is not a free lunch. A photo is one decisive moment; a clip has to hold up for every frame of 5 to 60 seconds, with stable exposure, clean motion, and no distracting artifacts. Production is slower, files are heavier, and the technical bar is higher — Adobe requires at least 1080 pixels on both sides and clips between 5 and 60 seconds. If your strength is volume and fast turnaround, photos still let you build a large portfolio quickly.
The hidden difference: video is harder to get found
Here is the part that decides whether video is actually worth it for you. A photo's keywords describe a single frozen frame. A clip's keywords have to describe motion — and motion is exactly what you cannot see in any one frame. The most valuable, most-searched video terms are about movement and technique: zoom, panning, tracking shot, aerial view, time-lapse, slow motion. Adobe explicitly tells contributors to include those camera techniques in video metadata, because buyers search for them directly.
This is where most video keywording quietly fails. Contributors tag a clip the way they would tag a thumbnail — subject, setting, mood — and leave out the motion entirely. The clip is technically well-described and still invisible to anyone searching "aerial drone shot" or "slow motion."
How video metadata actually differs
- Same keyword rules, higher stakes.The first 10 keywords still carry the most search weight, your title's words should still appear in that top 10, and the cap is still 49 — see our guide to keyword order.
- Movement is a keyword category of its own. Zoom in, zoom out, panning, tilt, tracking shot, aerial view — add the technique when it is clearly there, and skip it when it is ambiguous (a wrong movement keyword is worse than none).
- Don't keyword the gear or the format. Camera specs and trademarks (a brand name, a camera model) do not belong in your titles or keywords.
The workflow tax — and how to remove it
The real reason contributors avoid video is not the shooting. It is the keywording: scrubbing through a clip, working out what moved, and writing a motion-aware keyword list for every file is slow, dull work — and it is the work that decides whether the clip ever sells.
That is the exact problem PixTagger was built around. Instead of guessing from one frame, it reads three frames in time order from every clip (start, middle, end), compares them to infer how the camera moved, and merges the results into one keyword list — already ordered strongest-first and capped at 49. The motion gets captured, and the dull part disappears.
The verdict
Ready to tag footage properly? See the stock video keyword generator — multi-frame analysis that catches the camera moves single-frame tools miss. Start free with 15 files on the house.