Research Claw Framework Evolution Team
open source & actively building

Exploring the Frontier of
AI Agents

We explore what AI agents can — and can't — do. From multi-agent collaboration to autonomous self-evolution, we research agent capabilities, build practical tools, and push the boundaries — all in the open.

Application

Research Claw

A self-hosted AI assistant for academic research — manages your papers, searches literature, tracks deadlines, and answers you on the channels you already use.

Writing & Compilation — Chat-driven LaTeX editing, one-command compile with auto-diagnosis, built-in venue skills (NeurIPS, ICML, ICLR, ACL, CVPR…).
🔄 Overleaf & Git — Bidirectional sync, every AI edit auto-committed, instant rollback.
🎭 Multi-Agent Collaboration — Sub-agents in isolated sandboxes; /task decomposes goals into a DAG and runs in parallel.
🔍 Literature Search — arXiv, PubMed, OpenAlex with full-text PDF reading.
📡 Research Radar — Daily scans, weekly digests, deadline alerts — pushed to Telegram, Feishu, Email, or any Apprise channel.
🧠 Memory & Context — Project-level memory across sessions with automated summarization.
💬 Access Anywhere — Web UI, CLI, Feishu, Telegram, QQ, DingTalk — no public IP required.
View on GitHub →

Mobile demo (English) · Executed by GLM-5
Papers generated: LLM-Based Autonomous Multi-Agent Systems Survey · Hierarchical Memory Sharing in MAS

Framework

nano_agent_team

Multi-agent collaboration built on the Blackboard Model. No database, no message queue — agents share state through structured Markdown files on the file system, version-controlled and human-readable.

Design Principles
📋 Protocol as Data — Use self-describing Markdown files as communication protocols instead of hard-coded APIs.
📂 Files as Database — Purely file-system based, no extra database required.
🌀 Dynamic Self-Organization — The Watchdog agent analyzes tasks, plans the blackboard structure, and spawns/coordinates Worker agents.
👁 Human-in-the-Loop — Monitor and intervene in agent execution through the TUI.
View on GitHub →
Query
Build a terminal-based 'Developer Survival Simulator' — a text adventure game where the player is a junior dev surviving their first week at a startup. Include: 1) An event engine with 20+ random events 2) A stat system tracking Energy, Code Quality, Boss Satisfaction, and Sanity 3) A decision system where the player picks from 2-3 options per event 4) A Rich-powered terminal UI with live stat bars, event log, and ASCII art 5) A game-over/victory condition (survive 5 days or get fired) 6) Unit tests for the event engine and stat system.
Generated Result → Python · Rich TUI · 203 Tests · Generated by DeepSeek via nano_agent_team
Experiment · Alpha

Self-Evolving Agents

Can agents improve themselves? We run unattended evolution sessions where agents analyze gaps in their own framework, propose features, implement and test them autonomously.

1 Analyze — Scan the codebase for gaps, missing features, or improvement opportunities.
2 Implement — Write code, add tests, create new tools or middleware components.
3 Validate — Run the full test suite. Only passing rounds get merged.
View on GitHub →
Session 20260311
Mar 11, 2026 — 5 rounds, all PASS
5/5 Pass

Agent autonomously identified missing data analysis and code introspection capabilities, then built and integrated 5 new tools β€” from arXiv search wiring to AST-based code analysis and token cost tracking.

DataAnalysisTool ArxivSearchTool CodeAnalysisTool MarkdownExportTool TokenUsageTracker
Session 20260315
Mar 15, 2026 — 9 rounds
9/9 Pass

Focused on observability and self-healing β€” the agent added cross-session memory, code health analysis, self-reflection middleware, a live status dashboard, and failure diagnosis & replay tools.

ExperienceMemory CodeHealthAnalyzer SelfReflection StatusDashboard SelfDiagnosis AgentMonitor SessionReplay DiagnosisTool
More sessions on the way — experiments are ongoing.
We also livestream evolution sessions on Bilibili — watch agents evolve in real-time.

The People Behind It

A small group curious about what agents can really do.