Enterprise DNA
M MCP Servers Developer low

NON906/omniparser-autogui-mcp

by Various

Automatic operation of on-screen GUI.

N

MCP

NON906/omniparser-autogui-mcp

Added 1 June 2026

Overview

A Python tool for automating on-screen GUI operations by parsing visual elements. It uses omniparser and MCP to interpret screen content and execute actions. Designed for developers to script GUI workflows without native API support.

Best for

Best for
Developers needing visual-based GUI automation for non-standard or legacy applications

Use cases

  • Automate repetitive desktop workflows
  • Control legacy or inaccessible GUI applications
  • Integrate GUI automation into larger MCP-based systems

Notes

A Python tool for automating on-screen GUI operations by parsing visual elements. It uses omniparser and MCP to interpret screen content and execute actions. Designed for developers to script GUI workflows without native API support.

67 stars on GitHub. Last updated 2025-03-02. Licensed MIT.

Use cases

  • Automate repetitive desktop workflows
  • Control legacy or inaccessible GUI applications
  • Integrate GUI automation into larger MCP-based systems

Pros

  • Open source and extensible in Python
  • Visual parsing approach works across different GUI frameworks
  • Integrates with MCP for model-controlled automation

Cons

  • Limited community and documentation due to low star count
  • May struggle with dynamic or non-standard UI elements
  • Requires careful setup for specific screen resolutions and layouts

Indexed from awesome-mcp-servers-punkpeye and enriched against its public facts.

Pros

  • Open source and extensible in Python
  • Visual parsing approach works across different GUI frameworks
  • Integrates with MCP for model-controlled automation

Cons

  • Limited community and documentation due to low star count
  • May struggle with dynamic or non-standard UI elements
  • Requires careful setup for specific screen resolutions and layouts