Skip to content

AAI ProtocolAgent App Interface

From GUI to AAI: Let Agents directly invoke application capabilities

What is AAI?

AAI (Agent App Interface) is an open protocol that enables AI Agents to directly invoke application capabilities -- bypassing the slow, fragile approach of "watching screens and simulating clicks."

Traditional:  Agent -> [Screenshot] -> [OCR] -> [Click] -> GUI -> App  (seconds)
AAI:          Agent -> [MCP] -> AAI Gateway -> [IPC/API] -> App         (milliseconds)

Applications provide two independent interfaces:

  • GUI -- for humans (visual, intuitive)
  • AAI -- for Agents (structured, programmable, parallel)

Both access the same core business logic. Neither interferes with the other.

How It Works

  1. Apps describe their capabilities in aai.json (placed in ~/.aai/<appId>/aai.json)
  2. AAI Gateway discovers and loads these descriptors
  3. Agents connect to the Gateway via standard MCP (stdio)
  4. Gateway translates MCP requests into platform-native automation calls

Quick Example

json
{
  "mcpServers": {
    "aai": {
      "command": "aai-gateway",
      "args": ["--mcp"]
    }
  }
}

Then ask your Agent:

"Send an email to alice@example.com with subject 'Hello' using Mail"

The Agent calls the Gateway, which executes the operation via native IPC in milliseconds.

Released under the Apache 2.0 License.