[arXiv Paper] [Project Page] [Github Repo] [Hugging Face Model]
This demo is powered by Gradio and uses OmniParserv2 to generate Set-of-Mark prompts.
The demo supports three modes: