MCPMacControl

MCP server for Mouse, Keyboard, Screenshot, Window & Shell Control on macOS.

Enables AI to control Mac UI: capture screenshots, move/click mouse, type text, press keyboard shortcuts, scroll/resize/minimize windows, and run interactive shell sessions.

Install

Download (recommended)

Download MCPMacControl.app.zip from the latest release, unzip, and move to /Applications/.

Release builds are signed and notarized with an Apple Developer ID certificate, so macOS permissions persist across updates and no Gatekeeper warnings appear.

Build from source

git clone https://github.com/sstraus/mcpmaccontrol
cd mcpmaccontrol
make build
make install

Configure Claude Code

Add to ~/.claude.json:

{
  "mcpServers": {
    "mac-control": {
      "command": "/Applications/MCPMacControl.app/Contents/MacOS/mcpmaccontrol"
    }
  }
}

Then ask Claude: "Take a screenshot of Safari and click the search box"

Why an `.app` bundle?

Unlike most MCP servers, MCPMacControl needs macOS Screen Recording and Accessibility permissions. macOS grants these permissions per-application, not per-binary. When a plain binary runs from a terminal, macOS attributes the permission to the terminal app (iTerm, Terminal.app), not to the binary itself. This means:

Permissions don't persist when the binary is rebuilt
Every terminal app that launches the binary needs separate permission grants
There's no way for users to manage the binary's permissions in System Settings

Wrapping the binary in a signed .app bundle gives it its own identity in macOS privacy settings. The binary still communicates via stdio like any MCP server - the .app bundle is just how macOS identifies it for permission management.

That's why the command path is /Applications/MCPMacControl.app/Contents/MacOS/mcpmaccontrol instead of /usr/local/bin/mcpmaccontrol.

Permissions

On first launch, macOS will prompt for two permissions:

Permission	Purpose
Screen Recording	Screenshot capture
Accessibility	Mouse/keyboard control

Grant both in System Settings > Privacy & Security. The app registers itself automatically.

Features

Menu bar icon shows when MCP server is running
Single binary in a signed .app bundle
Auto-exit when Claude disconnects
Region capture to save tokens

Tools

Tool	Description
`help`	Built-in documentation - call first!
`list_windows`	Find windows by app name
`capture_window`	Screenshot window or region with click coordinates
`capture_screen`	Screenshot entire screen
`do`	Execute actions: click, type, key, scroll, focus, minimize, etc.
`shell`	PTY shell sessions: spawn, send_input, get_snapshot, resize, close, list
`processes`	List running processes with filtering for debugging

Built-in Help

The AI calls help() to learn the API:

help() → Overview and workflow
help("actions") → All action types for do()
help("shell") → Shell/PTY documentation
help("examples") → Usage examples

Workflow

1. help()                           → Learn the API
2. list_windows("Safari")           → Find window
3. capture_window("Safari")         → Get screenshot + bounds
4. [AI analyzes image]              → Find button at (400, 50)
5. do([{type:"click", app:"Safari", x:400, y:50}])

Region Capture (saves tokens!)

Capture only a portion of a window for smaller images:

capture_window("Safari", region_x: 0, region_y: 0, region_width: 400, region_height: 300)

This captures only the top-left 400x300 area. Click coordinates:

click_x = region_x + x_in_image
click_y = region_y + y_in_image

The `do` Tool

Execute one or more actions in sequence:

do({"actions": [
  {"type": "click", "app": "Safari", "x": 400, "y": 50},
  {"type": "type", "text": "anthropic.com"},
  {"type": "key", "key": "enter"}
]})

Action Types

Action	Required	Optional	Description
`click`	app, x, y	button, double	Click at position
`move`	app, x, y		Move mouse
`type`	text		Type text string
`key`	key	modifiers	Press key (supports `"cmd+shift+g"` compound syntax)
`scroll`	app, x, y, delta_y/delta_x		Scroll in window
`wait`	ms		Pause (milliseconds)
`focus`	app		Bring window to front
`minimize`	app		Minimize to dock
`restore`	app		Restore from dock
`close`	app		Close window
`resize`	app, width, height		Resize window

Examples

Click:

{"type": "click", "app": "Safari", "x": 100, "y": 50}
{"type": "click", "app": "Finder", "x": 200, "y": 100, "button": "right"}
{"type": "click", "app": "Finder", "x": 200, "y": 100, "double": true}

Keyboard:

{"type": "type", "text": "Hello World"}
{"type": "key", "key": "enter"}
{"type": "key", "key": "cmd+v"}
{"type": "key", "key": "cmd+shift+g"}
{"type": "key", "key": "v", "modifiers": ["cmd"]}

Scroll:

{"type": "scroll", "app": "Safari", "x": 400, "y": 300, "delta_y": -100}

Window Control:

{"type": "focus", "app": "Safari"}
{"type": "minimize", "app": "Finder"}
{"type": "restore", "app": "Finder"}
{"type": "resize", "app": "Safari", "width": 1024, "height": 768}
{"type": "close", "app": "TextEdit"}

Shell Sessions

Run interactive terminal sessions with full PTY and terminal emulation (supports TUI apps like vim, htop):

1. shell(action: "spawn")                                          → Get session ID
2. shell(action: "send_input", session_id: "ID", input: "ls")     → Send text
3. shell(action: "send_input", session_id: "ID", special_key: "enter")
4. shell(action: "get_snapshot", session_id: "ID", format: "ansi") → Get screen
5. shell(action: "close", session_id: "ID")                        → Cleanup

Action	Parameters	Description
`spawn`	command?, cwd?, cols?, rows?	Start session (default: /bin/bash)
`send_input`	session_id, input?, special_key?, wait_ms?	Send text or keys
`get_snapshot`	session_id, format?	Get screen state (text or ansi)
`resize`	session_id, cols?, rows?	Resize terminal
`list`		List active sessions
`close`	session_id	Close session

Headless Mode

Run without menu bar (for CI/SSH):

{
  "mcpServers": {
    "mac-control": {
      "command": "/Applications/MCPMacControl.app/Contents/MacOS/mcpmaccontrol",
      "env": {
        "MCPMACCONTROL_HEADLESS": "1"
      }
    }
  }
}

Image Optimization

Screenshots are optimized for AI vision:

WebP lossy format (quality 15 — tested on complex UIs, fully readable at ~3x smaller than q75)
Scaled to window point dimensions (coordinates match clicks)
Use region capture to further reduce size and tokens

Development

make build          # Build signed .app bundle
make test           # Run tests
make install        # Install to /Applications
make clean          # Remove build artifacts
make verify-sign    # Check code signature

To sign with your own Apple Developer certificate:

SIGN_IDENTITY="Developer ID Application: YOUR NAME (TEAMID)" make build

Requirements

macOS 12+
Go 1.24+ (for building)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
bundle		bundle
internal		internal
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
MANUAL_TEST_SHELL.md		MANUAL_TEST_SHELL.md
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MCPMacControl

Install

Download (recommended)

Build from source

Configure Claude Code

Why an `.app` bundle?

Permissions

Features

Tools

Built-in Help

Workflow

Region Capture (saves tokens!)

The `do` Tool

Action Types

Examples

Shell Sessions

Headless Mode

Image Optimization

Development

Requirements

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

sstraus/McpMacControl

Folders and files

Latest commit

History

Repository files navigation

MCPMacControl

Install

Download (recommended)

Build from source

Configure Claude Code

Why an .app bundle?

Permissions

Features

Tools

Built-in Help

Workflow

Region Capture (saves tokens!)

The do Tool

Action Types

Examples

Shell Sessions

Headless Mode

Image Optimization

Development

Requirements

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Why an `.app` bundle?

The `do` Tool

Packages