Image Caption

LLM-assisted screenshot captioning & renaming

Image Caption

A Python tool that uses vision models through Ollama to automatically generate captions or rename images based on their content. Perfect for organizing screenshots and images with AI-powered descriptions.

View on Github

Features

  • Caption Generation: Automatically generates detailed text captions for images

  • Smart Renaming: Renames images with descriptive filenames based on their content

  • Screenshot Focus: Specifically designed to process screenshots, identifying applications, UI elements, and visible text

  • Error Handling: Robust retry mechanism with error logging

  • Flexible Configuration: Support for custom Ollama hosts and model selection

  • Batch Processing: Processes all matching images in a directory automatically

Last updated