Image analysis

4 Replies, 230 Views

I'm experimenting with Hermes (https://www.likera.com/forum/mybb/showth...p?tid=4606), and for some of my projects I need a detailed image analysis AND proper tagging.

It turned out, that it's VERY difficult to find an LLM which indeed understands what's pictured.

Post your experience, ideas and solutions here.
(This post was last modified: 26 May 2026, 17:18 by Like Ra.)
To my great surprise, only Claude and Gemini were able to determine what the girl is wearing on this image: https://www.likera.com/forum/mybb/Thread...1#pid86451

Neither ChatGPT, nor Grok, nor Gemma, nor Qwen, nor Mistral were able to understand the concept of a "single-glove"!

From the small local models, only Qwen was close. From small local models, only Qwen can describe bondage scenes quite detailed.

So far my recommendation for local "image description" models - https://ollama.com/lukey03/qwen3.5-9b-ab...ted-vision (it's uncensored)
(This post was last modified: 23 May 2026, 01:06 by Like Ra.)
(23 May 2026, 01:04 )Like Ra Wrote: So far my recommendation for local "image description" models - https://ollama.com/lukey03/qwen3.5-9b-ab...ted-vision (it's uncensored)

I've been using https://ollama.com/sorc/qwen3.5-instruct-heretic for image description and Stable-Diffusion img->prompt generation . I'll give lukey's a try. Thanks!
It's still a 9b Qwen3.5 version, so should be very similar. I also tried https://ollama.com/huihui_ai/Qwen3.6-abliterated:27b with only 30 layers offloaded to the GPU.
While slow, it's only a bit better, than Qwen3.5 9B, yet can go completely crazy.

https://ollama.com/huihui_ai/qwen3-vl-abliterated 8b is only 6.1GB, and should understand videos, but it's a bit less precise for images.
(This post was last modified: 26 May 2026, 00:12 by Like Ra.)
A hack: ask the model to check what it wrote about the image with the image itself. THAT result is much more precise.

Possibly Related Threads…
Thread Author Replies Views Last Post
  Is this [image] AI generated? FireDesire 17 2,967 16 Jan 2026, 13:08
Last Post: krypton85
  Text to image tutorial Bound Whore 6 2,336 23 Feb 2025, 20:18
Last Post: theo
  General Image AI thread Like Ra 10 3,005 15 May 2024, 14:41
Last Post: RedCattyLatex