AutoControl
AutoControl is a cross-platform Python GUI automation framework providing mouse control, keyboard input, image recognition, screen capture, action scripting, and report generation – all through a unified API that works on Windows, macOS, and Linux (X11).
Note
AutoControl supports Linux Wayland via CLI bridges (wtype + ydotool +
grim). See Running AutoControl in CI / Cloud for compositor / install
notes. Set JE_AUTOCONTROL_LINUX_DISPLAY_SERVER=x11 to force the
XWayland fallback.
Getting Started
User Guide (English)
- English Documentation
- Installation
- Mouse Control
- Keyboard Control
- Screen Operations
- Image Recognition
- Recording & Playback
- Keywords & Executor
- Report Generation
- Callback Executor
- Scheduler
- Socket Server (Remote API)
- MCP Server (Use AutoControl from Claude)
- Critical Exit
- Command-Line Interface
- Project Management
- New Features (2026-04)
- New Features (2026-05)
- New Features (2026-06) — QA Layer
- OCR backends
- Observability — metrics and traces
- Operations & Admin Layer
- USB Passthrough — Phase 2 Design
- USB Passthrough — Phase 2e Security Review Checklist
- USB Passthrough — Operator Guide