OS Automation MCP Servers
Servers for controlling the desktop operating system: screenshots, window management, mouse/keyboard input injection, and system-level automation.
4 of 4 shown
sbuysse/gnome-desktop-mcp
github.comGNOME desktop automation for AI agents. 30 tools via D-Bus: screenshots, window management, mouse/keyboard injection, clipboard, workspaces, and system notifications. Works on any GNOME 45–49 Linux desktop.
dimpagk92/cellar
github.comHybrid computer-use runtime. Fuses accessibility tree + Chrome DevTools Protocol + vision into structured context with per-element confidence. 4 MCP tools (see/act/think/perceive). Continuous awareness engine (Cortex) with freshness + side-effect detection. Works offline with Ollama + local models.
Harusame64/desktop-touch-mcp
github.comWindows desktop automation for LLM agents with entity-based actions instead of coordinate-only clicking. Uses UIA, CDP, screenshots, keyboard/mouse/clipboard, and terminal control, plus entity leases, verified delivery, causal context, and interaction memory to reduce silent UI automation failures.
tinqiao-oss/clawtouch-mcp
github.comPhysical USB HID keyboard/mouse control via a Raspberry Pi Pico 2 running open-source firmware. Exposes move, click, drag, type, key combos, and scroll as MCP tools for any MCP client. Genuine physical HID input on the standard driver path, with a --mock mode for hardware-free trials. pip install clawtouch-mcp
Attribution
Data sourced from punkpeye/awesome-mcp-servers (MIT). Synced every 24 hours.