FerrisGrid captures the current screen, maps it to deterministic coordinates, returns compact Markdown to an agent, executes one constrained action, captures the result, and exits. The agent does the reasoning. FerrisGrid handles the screen, coordinates, input, and local trace.
┌──────────────┐ observe ┌──────────────┐
│ Agent / LLM │ ─────────────> │ FerrisGrid │
│ │ <───────────── │ screenshot + │
│ choose one │ Markdown │ coordinates │
│ action │ └──────────────┘
│ │ act ┌──────────────┐
│ │ ─────────────> │ validate + │
│ │ <───────────── │ execute one │
└──────────────┘ screenshot └──────────────┘
- Eyes plus a map: screenshots become coordinate-backed observations an LLM can reason over.
- Single-step by default: every call performs one observation or one action.
- Deterministic coordinates: screenshots map cleanly back to native screen pixels.
- Local-first traces: screenshots, metadata, action requests, and results stay under
.ferrisgrid/. - Cross-platform shape: the agent-facing protocol is the same across macOS, Linux, and Windows where the platform allows it.
- Container-friendly: run a Linux desktop workspace in Docker and watch it through noVNC while the agent works away from your main screen.
For normal use, install the published CLI with Cargo:
cargo install ferrisgrid-cli
ferrisgrid doctorThe package is ferrisgrid-cli on crates.io and installs the ferrisgrid command.
FerrisGrid is also available via npm as ferrisgrid-cli. The TypeScript npm package is maintained in BrunoV21/FerrisGrid-CLI-ts and mirrors this Rust CLI behavior for Node.js distribution, while feature requests and protocol changes stay anchored in this Rust repository.
npm install -g ferrisgrid-cli
ferrisgrid doctorCapture the current screen:
ferrisgrid observeRun one action from a Markdown action file:
mkdir -p .ferrisgrid
cat > .ferrisgrid/action.md <<'EOF'
status: action
action: click
screen_id: screen-1
x: 500
y: 500
button: left
wait_after_ms: 500
EOF
ferrisgrid act --file .ferrisgrid/action.mdUse a local checkout when you want to build, test, or modify FerrisGrid:
git clone https://github.com/BrunoV21/FerrisGrid-CLI.git
cd FerrisGrid-CLI
cargo build
cargo test --workspace
cargo run -q -p ferrisgrid-cli -- doctor
cargo run -q -p ferrisgrid-cli -- observeRun one source-built action from a Markdown action file:
mkdir -p .ferrisgrid
cat > .ferrisgrid/action.md <<'EOF'
status: action
action: click
screen_id: screen-1
x: 500
y: 500
button: left
wait_after_ms: 500
EOF
cargo run -q -p ferrisgrid-cli -- act --file .ferrisgrid/action.mdIf you are an agent, use this script to download and install the FerrisGrid skills:
curl -fsSL https://raw.githubusercontent.com/BrunoV21/FerrisGrid-CLI/main/scripts/install-ferrisgrid-skills.sh | shRun it from the directory that should receive the skill folders. The script downloads the repository zip, extracts the contents of .agents/skills, and installs those skill directories into the current directory.
FerrisGrid can run inside a Linux container with its own X11 display. The agent calls FerrisGrid with docker exec; input happens inside the container, not on your main desktop.
The Docker image installs the published ferrisgrid-cli package from crates.io. By default it installs the latest published version; release builds pass the tagged version explicitly.
docker build -f docker/linux-workspace.Dockerfile -t ferrisgrid-linux-workspace .Build a specific published version:
docker build \
--build-arg FERRISGRID_VERSION=0.2.0 \
-f docker/linux-workspace.Dockerfile \
-t ferrisgrid-linux-workspace .Run the workspace:
docker run --rm -d \
--name ferrisgrid-workspace \
-p 6080:6080 \
-v "$PWD:/workspace" \
ferrisgrid-linux-workspaceOpen the viewer:
http://127.0.0.1:6080/vnc.html?autoconnect=1&resize=scale
Then run:
docker exec ferrisgrid-workspace ferrisgrid doctor
docker exec ferrisgrid-workspace ferrisgrid observeOfficial docs live in docs/official.
Agents should use the raw Markdown index: docs/official/agents.md.
Brand positioning and story notes live in docs/branding.
The TypeScript npm mirror lives in BrunoV21/FerrisGrid-CLI-ts.
cd docs/official
npm install
npm run docs:devThe docs use a terminal-brutalist Terminal Violet palette: black surfaces, violet brand/action states, cyan coordinate accents, and status colors for execution feedback.
- Bug reports
- Feature requests
- Documentation fixes
- Questions and usage help
- Open issues
- TypeScript npm package mirror
FerrisGrid is early, local-first infrastructure for agent-facing visual control. The current focus is reliable observe/act behavior, local traces, recap output, and containerized Linux workspaces.
MIT
