Skip to content

MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interface

Notifications You must be signed in to change notification settings

Mtehabsim/ScreenPilot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MseeP.ai Security Assessment Badge

ScreenPilot

MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interfaces. Good for automation, education and having fun.

Main Features

  • 📷 Screen capture and analysis
  • 🖱️ Mouse control (clicking, positioning)
  • ⌨️ Keyboard input (typing, key presses, hotkeys)

watch demo

Screen.Pilot.mp4

Installation

  1. Install python 3.12
  2. Clone the repository:
    git clone https://github.com/Mtehabsim/ScreenPilot.git
  3. create virtiual environment
python -m venv venv
  1. activate the env
venv\Scripts\activate
  1. Install the required packages:
    pip install -r requirements.txt
  2. Open Claude AI desktop
  3. file -> settings -> developer -> edit config
  4. open config file and paste this
{
    "mcpServers": {
        "device-controll": {
            "command": "pathToEnv\\venv\\Scripts\\python.exe",
            "args": [
                "pathToProject\\ScreenPilot\\main.py"
            ]
        }
    }
}
  1. Replace     "pathToEnv\venv\Scripts\python.exe" → with the full path to your python.exe     "pathToProject\ScreenPilot\main.py" → with the full path to your main.py file

  2. Save the config file.

  3. Open Claude AI Desktop.

  4. Go to File → Exit

  5. You can now open Claude AI Desktop and enjoy ScreenPilot.

Available Tools

  • Screen Capture: Take screenshots and get screen information
  • Mouse Control: Move the mouse and perform clicks
  • Keyboard Actions: Type text, press keys, and use hotkey combinations
  • Scrolling: Scroll in different directions and to specific positions
  • Element Detection: Check if elements exist on screen and wait for them to appear
  • Action Sequences: Perform multiple actions in sequence

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

About

MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interface

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages