Benchmark: Add QMD comparison (local-first knowledge search baseline)

## Context

QMD (github.com/tobi/qmd, 10.7K stars, by Tobi Lütke/Shopify CEO) is a local-first CLI search engine for markdown knowledge bases. It combines BM25 + vector search + local LLM reranking via node-llama-cpp GGUF models. MCP-native.

QMD is the most relevant comparison for BM because:
- Same philosophy: local-first, markdown files, on-device
- Same search techniques: BM25 + vector + hybrid
- Same ecosystem: MCP tools for Claude Code/Cursor
- But fundamentally different architecture: flat document search (QMD) vs knowledge graph with semantic relations (BM)

## Benchmark Design

### Retrieval metrics (existing benchmark)
1. Ingest LoCoMo conversations into QMD collections
2. Run same queries through `qmd query` (hybrid + reranking mode)
3. Measure R@5, R@10, MRR against same ground truth
4. Compare: BM hybrid search vs QMD hybrid+reranking

### LLM-as-Judge (once basicmachines-co/basic-memory-benchmarks#9 lands)
1. Same eval: retrieve via BM MCP tools vs QMD MCP tools
2. Same eval LLM, same judge, same questions
3. Direct answer accuracy comparison

### What we expect to learn
- **Multi-hop:** BM should win — our knowledge graph connects concepts across documents. QMD does flat retrieval.
- **Single-hop:** QMD may win — their local LLM reranker adds precision for direct fact lookup.
- **Open domain:** Interesting — QMD's `context` tree feature vs our semantic relations.
- **Temporal:** Both probably weak here (neither has temporal-specific indexing yet).

### What we learn either way
- If QMD reranking beats our hybrid search → validates basicmachines-co/basic-memory#618 (add local reranking to BM)
- If BM's knowledge graph beats QMD on multi-hop → proves the value of semantic relations over flat search
- If results are close → the differentiator is UX, graph, and bidirectional human+AI access, not raw retrieval

## Installation
```bash
npm install -g @tobilu/qmd
```

QMD MCP tools: qmd_search, qmd_vector_search, qmd_deep_search, qmd_get, qmd_multi_get

## Notes
- Be respectful. Tobi has a massive audience. A fair comparison that acknowledges QMD's strengths (reranking, simplicity, speed) while showing BM's advantages (knowledge graph, relations, bidirectional access) is the right tone.
- Publish methodology and results openly.

## Related
- basicmachines-co/basic-memory-benchmarks#9 (LLM-as-Judge)
- basicmachines-co/basic-memory-benchmarks#8 (methodology)
- basicmachines-co/basic-memory#618 (local reranking — QMD already has this)
- basicmachines-co/basic-memory-benchmarks#7 (multi-eval LLM comparison)

## Milestone
v0.19.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark: Add QMD comparison (local-first knowledge search baseline) #3

Context

Benchmark Design

Retrieval metrics (existing benchmark)

LLM-as-Judge (once #9 lands)

What we expect to learn

What we learn either way

Installation

Notes

Related

Milestone

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Benchmark: Add QMD comparison (local-first knowledge search baseline) #3

Description

Context

Benchmark Design

Retrieval metrics (existing benchmark)

LLM-as-Judge (once #9 lands)

What we expect to learn

What we learn either way

Installation

Notes

Related

Milestone

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions