Tinkr
All tools
Designer·Live

AI Alt-Text Generator

Drop an image. Get accessible alt text following WCAG 2.1 — under 125 characters, no "image of" padding, with a longer description when the image warrants it. Useful for blog posts, e-commerce product photos, and design system documentation.

Drop an image here, or click to browse
PNG, JPG, WebP, GIF, AVIF — max 4 MB

Frequently asked questions

Why do images need alt text?
Alt text makes images accessible to screen readers (used by people with vision impairment), provides a fallback when images fail to load, and helps search engines understand image content. WCAG 2.1 considers alt text a legal accessibility requirement on most public websites.
How does this generator work?
Your image is sent to Google's Gemini vision model with a strict system prompt that follows WCAG best practices. The model returns a concise alt text under 125 characters, an optional longer description for complex images, and detects whether the image is decorative (and should use alt='') or contains readable text.
Is my image uploaded or stored?
The image is sent to Google's Gemini API for analysis but is not persisted by Tinkr. We don't log images or alt text. Only an IP-based rate-limit counter is kept. See our privacy policy. If your image contains sensitive PII, generate alt text on private copies first.
What's the maximum file size?
4 MB. Most photos are under that even at high resolution. If your image is larger, downscale it or convert to a more efficient format like AVIF or WebP. The vision model doesn't need pixel-perfect input — a 1024px wide version captures the same content for alt-text purposes.
Why under 125 characters?
Most screen readers stop reading alt text around 125 characters. Anything beyond is silently truncated for users who actually need accessibility, defeating the purpose. If an image needs more, use a longer description (returned separately) in surrounding text or in a `figcaption`.
What is a decorative image?
An image that adds no information beyond visual decoration — abstract patterns, divider lines, repeated icons, ornamental flourishes. These should use `alt=''` (empty string) so screen readers skip them entirely. Don't omit the alt attribute — that lets some screen readers fall back to reading the filename.
Should I always use the AI's suggestion verbatim?
Treat it as a starting point. The model can't know context that isn't visually verifiable — who the people are, what's significant about the moment, brand-specific terminology. Use the 'context' field to give it that information, then edit the output to match your voice and copy guidelines.
Can it transcribe text in screenshots?
Yes — the `containsText` field returns any readable text the vision model detects in the image (signs, labels, screenshot UI text). Useful when you want to provide a text alternative for an image that's primarily textual content. Watermarks and pure logo text are intentionally not transcribed.

Related tools