Image Caption
LLM-assisted screenshot captioning & renaming
Image Caption
A Python tool that uses vision models through Ollama to automatically generate captions or rename images based on their content. Perfect for organizing screenshots and images with AI-powered descriptions.
Features
Caption Generation: Automatically generates detailed text captions for images
Smart Renaming: Renames images with descriptive filenames based on their content
Screenshot Focus: Specifically designed to process screenshots, identifying applications, UI elements, and visible text
Error Handling: Robust retry mechanism with error logging
Flexible Configuration: Support for custom Ollama hosts and model selection
Batch Processing: Processes all matching images in a directory automatically
Last updated