Image analysis

1 Replies, 19 Views

I'm experimenting with Hermes, and for some of my projects I need a detailed image analysis AND proper tagging.

It turned out, that it's VERY difficult to find an LLM which indeed understands what's pictured.

Post your experience, ideas and solutions here.
To my great surprise, only Claude and Gemini were able to determine what the girl is wearing on this image: https://www.likera.com/forum/mybb/Thread...1#pid86451

Neither ChatGPT, nor Grok, nor Gemma, nor Qwen, nor Mistral were able to understand the concept of a "single-glove"!

From the small local models, only Qwen was close. From small local models, only Qwen can describe bondage scenes quite detailed.

So far my recommendation for local "image description" models - https://ollama.com/lukey03/qwen3.5-9b-ab...ted-vision (it's uncensored)
(This post was last modified: 6 hours ago by Like Ra.)

Possibly Related Threads…
Thread Author Replies Views Last Post
  Is this [image] AI generated? FireDesire 17 2,952 16 Jan 2026, 13:08
Last Post: krypton85
  Text to image tutorial Bound Whore 6 2,327 23 Feb 2025, 20:18
Last Post: theo
  General Image AI thread Like Ra 10 2,987 15 May 2024, 14:41
Last Post: RedCattyLatex