ARK Agent CLI

A terminal-based chat application for exploring knowledge graphs through AI agents. Part of the ARK (Adaptive Retriever of Knowledge) project.

What's Included

ARK Agent CLI automatically discovers any knowledge graph placed in the data/ directory and creates a dedicated AI agent for it. Out of the box, it ships with three biomedical knowledge graphs:

Graph	Description
PrimeKG	A precision medicine-oriented knowledge graph that provides a holistic view of diseases, drugs, genes, and their relationships.
AfriMedKG	A knowledge graph built from the AfriMed-QA pan-African, multi-specialty medical Q&A benchmark.
OptimusKG	A modern multimodal knowledge graph for precision medicine with rich metadata.

You can add your own graphs, biomedical or otherwise, without changing any code. See Adding Your Own Knowledge Graph below.

Getting Started

Prerequisites

Bun >= 1.0
Node.js >= 18
pnpm >= 10

Installation

Clone the repository:
```
git clone https://github.com/mims-harvard/ark-agent-cli.git
cd ark-agent-cli
```
The knowledge graph files (.parquet in data/) are tracked with Git LFS. To download them after cloning, install Git LFS and pull the large files:
```
git lfs install
git lfs pull
```
Install dependencies:
```
pnpm install
```
Configure your API key:
```
cp .env.example .env
```
Open .env and add your API key. The CLI currently uses Anthropic as its LLM provider:
```
ANTHROPIC_API_KEY=your_key_here
```
Start the CLI:
```
pnpm cli
```
You can also compile it into a standalone binary:
```
pnpm build
./build/ark-agent-cli
```

Quick Example

Once the CLI is running, select an agent and ask a question in plain language:

What genes are associated with Alzheimer's disease?

Find the relationship between metformin and breast cancer.

The agent will search the knowledge graph, traverse relationships, and synthesize an answer while citing the specific nodes and edges it used.

Adding Your Own Knowledge Graph

Adding a new graph takes four steps and requires no code changes.

Step 1: Create a folder

Create a new directory inside data/ with a short, lowercase name (this becomes the agent's internal ID):

data/mykg/

Step 2: Write `graph.json`

Create a graph.json file inside your folder with the following fields:

{
  "id": 4,
  "name": "MyGraph",
  "description": "A custom knowledge graph for my research domain.",
  "color": "#e06c75",
  "order": 4
}

Here's what each field does:

Field	Type	Required	Description
`id`	`number`	Yes	A unique numeric identifier. Pick any number not already used by another graph.
`name`	`string`	Yes	The display name shown in the UI when selecting an agent.
`description`	`string`	Yes	A short description of the graph. The AI agent uses this to understand when the graph is relevant.
`color`	`string`	Yes	A hex color for the agent in the UI (e.g. `"#e06c75"`).
`order`	`number`	Yes	Controls display order in the agent list. Lower numbers appear first.

Step 3: Prepare your parquet files

Place two Apache Parquet files in the same folder:

nodes.parquet (one row per node):

Column	Type	Description
`id`	string	Unique node identifier
`name`	string	Human-readable name
`type`	string	Node type or category (e.g. `"gene"`, `"disease"`, `"drug"`)
`properties`	string	A JSON string with any additional properties

edges.parquet (one row per edge):

Column	Type	Description
`from`	string	Source node ID
`to`	string	Target node ID
`type`	string	Relationship type (e.g. `"interacts_with"`, `"treats"`)
`properties`	string	A JSON string with any additional properties

Step 4: Run the CLI

That's it. Start (or restart) the CLI and your new agent will appear automatically:

pnpm cli

Available Tools

Each AI agent has access to the following tools for exploring its knowledge graph. You don't call these directly. The agent decides which tools to use based on your question.

Tool	Description	Parameters
`listAvailableGraphs`	List all available knowledge graphs	None
`findNodesByName`	Search for nodes by name (partial match)	`name: string`
`getNodeDetails`	Get detailed information about a specific node	`nodeId: string`
`getNeighborsByNodeId`	Get all neighbors of a node, optionally filtered by edge type	`nodeId: string`, `edgeType?: string`
`searchInSurroundings`	Search within 1 or 2 hops of a node with optional filters	`nodeId`, `query?`, `nodeType?`, `edgeType?`, `k?: "1"\|"2"`
`findPaths`	Find all length-2 paths between two nodes	`sourceNodeId`, `destinationNodeId`

Development

Scripts

Script	Description
`pnpm cli`	Run in development mode with hot reload
`pnpm build`	Compile to a standalone binary
`pnpm check-types`	Run TypeScript type checking
`pnpm clean`	Remove build artifacts

Custom Tool Renderers

Tool renderers provide rich visualization of tool outputs in the terminal. See src/tool-renderers/get-node-details-tool.tsx for an example of how to build a custom renderer for a graph tool.

Technology Stack

Runtime: Bun
Language: TypeScript
UI: React 19 with @ai-tui/core
LLM: Vercel AI SDK (currently configured for Anthropic Claude)
Data: Local parquet files queried via DuckDB
Validation: Zod

Citation

If you use ARK in your research, please cite our paper:

@misc{polonuer2026autonomousknowledgegraphexploration,
      title={Autonomous Knowledge Graph Exploration with Adaptive Breadth-Depth Retrieval}, 
      author={Joaquín Polonuer and Lucas Vittor and Iñaki Arango and Ayush Noori and David A. Clifton and Luciano Del Corro and Marinka Zitnik},
      year={2026},
      eprint={2601.13969},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2601.13969}, 
}

Related Projects

ARK (Main Repository) — Core ARK implementation and benchmarks
PrimeKG — Precision Medicine Knowledge Graph
Zitnik Lab — Harvard Medical School research group

License

This project is licensed under the MIT License — see the LICENSE file for details.

Made with ❤️ at Zitnik Lab, Harvard Medical School

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
data		data
src		src
static		static
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ARK Agent CLI

What's Included

Getting Started

Prerequisites

Installation

Quick Example

Adding Your Own Knowledge Graph

Step 1: Create a folder

Step 2: Write `graph.json`

Step 3: Prepare your parquet files

Step 4: Run the CLI

Available Tools

Development

Scripts

Custom Tool Renderers

Technology Stack

Citation

Related Projects

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ARK Agent CLI

What's Included

Getting Started

Prerequisites

Installation

Quick Example

Adding Your Own Knowledge Graph

Step 1: Create a folder

Step 2: Write graph.json

Step 3: Prepare your parquet files

Step 4: Run the CLI

Available Tools

Development

Scripts

Custom Tool Renderers

Technology Stack

Citation

Related Projects

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Step 2: Write `graph.json`

Packages