xAI Launches /goal in Grok Build, Adding Long-Running Autonomous Execution With Built-In Verification for Multi-Step Coding Tasks

xAI Launches /goal in Grok Build, Adding Long-Running Autonomous Execution With Built-In Verification for Multi-Step Coding Tasks


xAI shipped a new mode called /goal inside Grok Build, its terminal coding agent. The feature targets long-running, autonomous task execution. You hand the agent a larger implementation task, then step back.

Most coding sessions require back-and-forth execution and verification. You prompt, the agent acts, and you verify each step. /goal changes that loop. The agent keeps working until a task is completed and verified. Verification can mean reviewing code, inspecting webpages, or executing scripts.

The mode packages that autonomy behind one command and a visible progress checklist.

TL;DR

  • /goal runs long, autonomous tasks inside Grok Build’s terminal agent.
  • It plans an approach, builds a checklist, executes, and verifies until complete.
  • Verification can review code, inspect webpages, or execute scripts.
  • status, pause, resume, and clear steer the live run.
  • Access needs a SuperGrok or X Premium Plus subscription.

What is /goal

/goal is a mode within Grok Build, not a separate product. Grok Build is xAI’s coding agent and CLI for software engineering. A CLI is a command-line tool that runs in your terminal. It works against your local codebase, reading files and running commands.

Grok Build already ships several agent features. It has a plan mode that blocks edits until you approve a plan. It delegates larger work to specialized subagents that run in parallel. It reads your AGENTS.md, plugins, hooks, skills, and MCP servers out of the box. MCP is a protocol that connects agents to external tools and data.

Access requires a SuperGrok or X Premium Plus subscription. You install the CLI with one command:

curl -fsSL https://x.ai/cli/install.sh | bash

You then sign in with your account and invoke /goal. The mode is built for handoff. You describe one objective, and the agent owns the multi-step work.

Why Long-Running Execution Matters

An agent is given a goal, not just a single prompt. It plans steps, executes them, observes results, and replans when reality diverges. That is the observe–plan–act loop. The trade-off is observability: long runs produce many actions you must monitor. /goal answers that with a progress checklist and steering commands.

How /goal Works

You set a goal in a single line. You type it inside the Grok Build session, not your system shell. For example:

/goal Migrate the auth module to the new API

Grok Build plans an approach for that objective. It breaks the work into a progress checklist. It then starts executing the checklist items one by one.

You are not locked out during the run. You can keep sharing additional instructions as it works. When the goal is finished, the panel flips to Complete. Every checklist item then shows as checked.

The Verification Step

The detail worth noting is verification. /goal does not stop at writing code. It continues until the task is completed and verified.

Verification, per xAI, can take three forms. The agent may review the code it produced. It may inspect webpages to confirm behavior. It may execute scripts to test the result.

This matters for autonomous runs. An agent that only edits files may report success before the change works. Building verification into the run means the agent tests its own output before it finishes.

Interactive Demo




Source link