vault
ResearchVault is a local orchestration engine that stores research state in a local SQLite Vault and ingests external sources. It reads RESEARCHVAULT_DB, uses BRAVE_API_KEY, performs outgoing HTTP calls (e.g. https://api.search.brave.com/res/v1/web/search, https://grokipedia-api.com/page/{slug}), and includes CLI invocations such as python -m scripts.vault.
ResearchVault 🦞
The local-first orchestration engine for high-velocity AI research.
ResearchVault is a specialized state manager and orchestration framework for OpenClaw agents. It allows agents to handle complex, multi-step investigations by persisting state, instrumentation, and insights into a local SQLite "Vault."
Following the Inference-Speed Development philosophy, Vault is built CLI-first to close the loop between research planning and autonomous execution.
✨ Core Features
- The Vault (SQLite): A persistent, local ledger stored in
~/.researchvault/(configurable viaRESEARCHVAULT_DB). 100% private. - Normalized Evidence Core: Scalable storage for
artifacts,findings, andlinks(graph-ready). - Divergent Reasoning: First-class
branches+hypothesesso agents can explore competing explanations without contaminating the main line. - Cross-Artifact Synthesis: Local embeddings + similarity links (
links) to surface connections across findings and artifacts. - Active Verification Protocol: Auto-generated
verification_missionsfor low-confidence or#unverifiedfindings. - MCP Server: Expose the Vault over MCP for cross-agent access (
vault mcp/python -m scripts.mcp_server). - Watchdog Mode: Background scuttler that ingests URLs and runs query watches into your Vault.
- Unified Ingestion Engine: Modular connectors for automated research capture.
- Instrumentation 2.0: Every research event tracks Confidence (0.0-1.0), Source, and Tags.
- Multi-Source Support:
- X (Twitter): High-signal real-time data via
bird. - Reddit: Structured community discussions and top-comment trees.
- Grokipedia: Direct knowledge-base ingestion via API.
- YouTube: Metadata-only extraction (titles/descriptions) without API keys.
- X (Twitter): High-signal real-time data via
- Suspicion Protocol 2.0: Hardened logic for low-trust sources. Moltbook scans are forced to low-confidence (
0.55) and tagged#unverified. - Semantic Cache: Integrated deduplication for queries and artifacts.
- SSRF Safety: Robust URL validation blocks internal network probes and private IP ranges.
- Hardened Logic: Versioned database migrations and a comprehensive
pytestsuite.
🚀 Workflows
1. Project Management
Initialize a project, set objectives, and assign priority levels.
uv run python scripts/vault.py init --id "metal-v1" --name "Suomi Metal" --objective "Rising underground bands" --priority 5
2. Multi-Source Ingestion
Use the unified scuttle command to ingest data from any supported source (Reddit, YouTube, Grokipedia, Web).
uv run python scripts/vault.py scuttle "https://www.youtube.com/watch?v=..." --id "metal-v1"
3. Divergent Reasoning (Branches + Hypotheses)
uv run python scripts/vault.py branch create --id "metal-v1" --name "alt-hypothesis" --hypothesis "The trend is label-driven, not organic."
uv run python scripts/vault.py hypothesis add --id "metal-v1" --branch "alt-hypothesis" --statement "Streaming growth is driven by playlist placement." --conf 0.55
uv run python scripts/vault.py insight --id "metal-v1" --add --branch "alt-hypothesis" --title "Counter-signal" --content "..." --tags "unverified"
4. Cross-Artifact Synthesis
uv run python scripts/vault.py synthesize --id "metal-v1" --threshold 0.78 --top-k 5
5. Verification Missions
uv run python scripts/vault.py verify plan --id "metal-v1" --threshold 0.7 --max 20
uv run python scripts/vault.py verify list --id "metal-v1" --status open
uv run python scripts/vault.py verify run --id "metal-v1" --limit 5
6. Watchdog Mode
uv run python scripts/vault.py watch add --id "metal-v1" --type url --target "https://example.com" --interval 3600 --tags "seed"
uv run python scripts/vault.py watch add --id "metal-v1" --type query --target "rising underground metal bands finland" --interval 21600
uv run python scripts/vault.py watchdog --once --limit 10
7. MCP Server
uv run python scripts/vault.py mcp --transport stdio
# or:
uv run python -m scripts.mcp_server
8. Export & Reporting
Ship research summaries to Markdown or JSON for external use or agent review.
uv run python scripts/vault.py export --id "metal-v1" --format markdown --output summary.md
9. Verification & Testing
Run the integrated test suite to verify system integrity.
uv run pytest
10. Monitoring
View sorted project lists, high-level summaries, and detailed event logs.
uv run python scripts/vault.py list
uv run python scripts/vault.py summary --id "metal-v1"
uv run python scripts/vault.py status --id "metal-v1"
🛠️ Development & Environment
ResearchVault is formalized using uv for dependency management and Python 3.13 stability.
- Core Architecture: Modular design separating Interface (
vault.py), Logic (core.py), and Storage (db.py). - Oracle Loops: Complex refactors use high-reasoning sub-agents.
- Main-Line Evolution: Atomic improvements are committed directly to
main.
This project is 100% developed by AI agents (OpenClaw / Google Antigravity / OpenAI Codex), carefully orchestrated and reviewed by Luka Raivisto.