KEMBAR78
GitHub - TurixAI/TuriX-CUA at mac_mcp
Skip to content

TurixAI/TuriX-CUA

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TuriX logo

TuriX · Desktop Actions, Driven by AI

Talk to your computer, watch it work.

📞 Contact & Community

Join our Discord community for support, discussions, and updates:

Join our Discord

Or contact us with email: contact@turix.ai

TuriX lets your powerful AI models take real, hands‑on actions directly on your desktop. It ships with a state‑of‑the‑art computer‑use agent (passes > 68 % of our internal OSWorld‑style test set) yet stays 100 % open‑source and cost‑free for personal & research use.

Prefer your own model? Change in config.json and go.

Table of Contents


🖼️ Demos

MacOS Demo

Book a flight, hotel and uber.

TuriX macOS demo - booking

Search iPhone price, create Pages document, and send to contact

TuriX macOS demo - iPhone price search and document sharing

Generate a bar-chart in the numbers file sent by boss in discord and insert it to the right place of my powerpoint, and reply my boss.

TuriX macOS demo - excel graph to powerpoint

Windows Demo

Search video content in youtube and like it

TuriX Windows demo - video search and sharing

MCP with Claude Demo

Claude search for AI news, and call TuriX with MCP, write down the research result to a pages document and send it to contact

TuriX MCP demo - news search and sharing


✨ Key Features

Capability What it means
SOTA default model Outperforms previous open‑source agents (e.g. UI‑TARS) on success rate and speed on Mac
No app‑specific APIs If a human can click it, TuriX can too—WhatsApp, Excel, Outlook, in‑house tools…
Hot‑swappable "brains" Replace the VLM policy without touching code (config.json)
MCP‑ready Hook up Claude for Desktop or any agent via the Model Context Protocol (MCP)

📊 Model Performance

Our agent achieves state-of-the-art performance on desktop automation tasks:

TuriX performance

For more details, check our report.

🚀 Quick‑Start (macOS 15)

We never collect data—install, grant permissions, and hack away.

0. Windows Users: Switch to the windows branch for Windows-specific setup and installation instructions.

git checkout windows

1. Download the App

For easier usage, download the app

Or follow the manual setup below:

2. Create a Python 3.12 Environment

Firstly Clone the repository and run:

conda create -n turix_env python=3.12
conda activate turix_env        # requires conda ≥ 22.9
pip install -r requirements.txt

3. Grant macOS Permissions

3.1 Accessibility

  1. Open System Settings â–¸ Privacy & Security â–¸ Accessibility
  2. Click +, then add Terminal and Visual Studio Code ANY IDE you use
  3. If the agent still fails, also add /usr/bin/python3

3.2 Safari Automation

  1. Safari ▸ Settings ▸ Advanced → enable Show features for web developers
  2. In the new Develop menu, enable
    • Allow Remote Automation
    • Allow JavaScript from Apple Events
Trigger the Permission Dialogs (run once per shell)
# macOS Terminal
osascript -e 'tell application "Safari" \
to do JavaScript "alert(\"Triggering accessibility request\")" in document 1'

# VS Code integrated terminal (repeat to grant VS Code)
osascript -e 'tell application "Safari" \
to do JavaScript "alert(\"Triggering accessibility request\")" in document 1'

Click "Allow" on every dialog so the agent can drive Safari.

4. Configure & Run

4.1 Edit Task Configuration

Important

Task Configuration is Critical: The quality of your task instructions directly impacts success rate. Clear, specific prompts lead to better automation results.

Edit task in examples/config.json:

{
    "agent": {
         "task": "open system settings, switch to Dark Mode"
    }
}

4.2 Edit API Configuration

Get API now with $20 credit from our official web page. Login to our website and the key is at the bottom.

Edit API in examples/config.json:

"llm": {
      "provider": "turix",
      "api_key": "YOUR_API_KEY",
      "base_url": "YOUR_BASE_URL"
   }

4.3 Configure Custom Models (Optional)

If you want to use other models not defined by the build_llm function in the main.py, you need to first define it, then setup the config.

main.py:

if provider == "name_you_want":
        return ChatOpenAI(
            model="gpt-4.1-mini", api_key=api_key, temperature=0.3
        )

Switch between ChatOpenAI, ChatGoogleGenerativeAI and ChatAnthropic base on your llm. Also change the model name.

4.4 Start the Agent

python examples/main.py

Enjoy hands‑free computing 🎉

5. MCP Support

TuriX support Model Context Protocol. You can use Claude for Desktop to call TuriX as a MCP Server with the following configuration(assume you have turix_env conda env):

{
  "mcpServers": {
    "mcp_agent": {
      "command": "/ABSOLUTE/PATH/TO/YOUR/CONDA/ENV/bin/python",
      "args": [
        "/ABSOLUTE/PATH/TO/FOLDER/TuriX-CUA/src/mcp/mcp_server.py"
      ],
      "env": {
        "LLM_PROVIDER": "turix",
        "LLM_MODEL":   "turix-model",
        "LLM_TEMP":    "0",
        "OPENAI_API_KEY":  "YOUR_KEY_HERE",
        "OPENAI_API_BASE": "YOUR_BASE_URL",
        "MCP_TIMEOUT_SEC": "600"
      }
    }
  }
}

You can also setup your llm model by change the env in the json file. MCP_TIMEOUT_SEC is the time limit, if the task running time larger than the setting value, the task will be terminated.

If you wanna terminate the agent while running, press cmd+shift+2.

🤝 Contributing

We welcome contributions! Please read our Contributing Guide to get started.

Quick links:

For bug reports and feature requests, please open an issue.

🗺️ Roadmap

Quarter Feature Description
2025 Q3 âś… Windows Support Cross-platform compatibility bringing TuriX automation to Windows environments (Now Available)
2025 Q3 âś… Enhanced MCP Integration Deeper Model Context Protocol support for seamless third-party agent connectivity (Now Available)
2025 Q4 âś… Next-Gen AI Model Significantly improved reasoning and task execution capabilities
2025 Q4 Planner Understands user intent and makes step-by-step plans to complete tasks
2025 Q4 Workflow Automation Record, edit, and replay complex multi-step automation sequences
2026 Q1 Offline Model Option Fully local inference for maximum privacy and zero API dependency
2026 Q1 Persistent Memory Learn user preferences and maintain task history across sessions
2026 Q2 Learning by Demonstration Train the agent by showing it your preferred methods and workflows
2026 Q2 Windows-Optimized Model Native Windows model architecture for superior performance on Microsoft platforms