> For the complete documentation index, see [llms.txt](https://docs.bdurham.dev/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.bdurham.dev/r-and-d/image-caption.md).

# Image Caption

## Image Caption

A Python tool that uses vision models through Ollama to automatically generate captions or rename images based on their content. Perfect for organizing screenshots and images with AI-powered descriptions.

[View on Github](https://github.com/thebriandurham/image-caption)

### Features

* **Caption Generation**: Automatically generates detailed text captions for images
* **Smart Renaming**: Renames images with descriptive filenames based on their content
* **Screenshot Focus**: Specifically designed to process screenshots, identifying applications, UI elements, and visible text
* **Error Handling**: Robust retry mechanism with error logging
* **Flexible Configuration**: Support for custom Ollama hosts and model selection
* **Batch Processing**: Processes all matching images in a directory automatically