Cognaptus DataHub Monitor

GitHub Resources from arXiv Digests

A monitored reference page for GitHub repositories surfaced from arXiv-paper digests, rendered from a machine-generated local data file.

Updated 2026-06-16 19:06:35 +0800 Source: zl_agentr litxr digest extraction GitHub repositories linked from arXiv-paper digests

Tracked Repositories

115

Unique Papers

47

Core Fields

paper title, ref_id, GitHub URL

Refresh Mode

local data file written by backend tasks

How To Use This Page

Use it as a lightweight index of implementation assets surfaced from research digests. The page stays defensive: required fields remain visible even when optional metadata is missing.

This page is designed as a refreshable reference surface rather than a hand-maintained article.

The goal is simple: when paper-digest workflows identify linked GitHub repositories, keep them visible in one place with enough context to scan quickly and revisit later.

Repository Index

Search by paper title, arXiv reference, GitHub repository, author, or tags when those fields are available.

An Awesome Data Agents repository linked directly in the paper header, likely used to collect or organize data-agent resources associated with the survey.

HKUSTDial/awesome-data-agents
Open GitHub

JoyAgent is discussed as a proto-L3 system that begins to address predefined-toolset limitations through tool evolution and multi-level thinking.

jd-opensource/joyagent-jdgenie system
Open GitHub

GitHub repository established by the authors as the project page associated with the survey on embodied learning for object-centric robotic manipulation.

RayYoh/OCRM_survey
Open GitHub

GPT Engineer, a software-development agent implementation cited in the engineering application survey and open-source project discussion.

AntonOsika/gpt-engineer implementation
Open GitHub

GPT Researcher, an experimental application that uses LLMs for research-question development, web crawling, source summarization, and aggregation.

assafelovic/gpt-researcher implementation
Open GitHub

AI Legion, an LLM-agent implementation cited in the survey's open-source library and reference set.

eumemic/ai-legion implementation
Open GitHub

LoopGPT, an LLM-agent implementation cited in the survey's open-source library and reference set.

farizrahman4u/loopgpt implementation
Open GitHub

AGiXT, an agent framework implementation cited in the survey as a dynamic AI automation platform.

Josh-XT/AGiXT framework
Open GitHub

DemoGPT, a software-development agent repository cited in the engineering application survey and open-source project discussion.

melih-unsal/DemoGPT implementation
Open GitHub

MiniAGI, an LLM-agent implementation cited in the survey's open-source library and reference set.

muellerberndt/mini-agi implementation
Open GitHub

AgentVerse, a multi-agent collaboration framework referenced among surveyed agent systems and open-source libraries.

OpenBMB/AgentVerse framework
Open GitHub

AgentGPT, an LLM-based autonomous-agent system cited in the survey's open-source library and reference set.

reworkd/AgentGPT implementation
Open GitHub

Auto-GPT, an autonomous LLM-agent implementation included in the construction taxonomy and open-source library discussion.

Significant-Gravitas/Auto-GPT implementation
Open GitHub

SmolModels/developer-style agent repository cited as a software engineering application artifact.

smol-ai/developer implementation
Open GitHub

WorkGPT, a workflow-oriented LLM-agent framework cited as similar to AutoGPT and LangChain.

team-openpm/workgpt implementation
Open GitHub

SuperAGI, an autonomous-agent framework cited in the survey's open-source library and reference set.

TransformerOptimus/SuperAGI implementation
Open GitHub

XLang, an LLM-agent/tool-use framework cited as supporting executable language grounding and interaction with databases, web applications, and physical robots.

xlang-ai/xlang implementation
Open GitHub

Repository for the AGENT KB cross-framework agent memory system introduced and evaluated in the paper.

OPPO-PersonalAI/Agent-KB framework
Open GitHub

Repository for the Agent Mentor / Agent Analytics open-source observability and analytics platform for agentic AI applications.

AgentToolkit/agent-mentor framework
Open GitHub

GitHub path identified by the paper as the code corresponding to the analytics pipeline used for semantic feature analysis.

AgentToolkit/agent-mentor implementation
Open GitHub

Repository for the Agent-as-a-Judge project and DevAI-related evaluation artifacts.

metauto-ai/agent-as-a-judge framework
Open GitHub

GPT-Pilot is one of the three open-source code-generation agentic systems benchmarked in the paper.

Pythagora-io/gpt-pilot framework
Open GitHub

Google's Agent-to-Agent Protocol repository, referenced as the source for A2A, one of the modern agent communication protocols compared in the paper.

google/A2A implementation
Open GitHub

GitHub Gist containing the Claude Code implementation prompt for the case summarization by file name microservice.

https:/
Open GitHub

GitHub Gist containing the Case Summarization by Given Case Name Workflow pitch generated by the Planning Agent.

https:/
Open GitHub

BabyAGI is used as a representative agentic framework showing how LLMs can be embedded in feedback loops to plan, act, adapt, and manage or prioritize subtasks.

yoheinakajima/babyagi framework
Open GitHub

Repository stated by the paper as the public code and data release for AirQA.

OpenDFM/AirQA dataset
Open GitHub

Repository reported by the authors as containing the code and data for Ask an Expert / BBMHReasoning experiments.

QZx7/BBMHReasoning implementation
Open GitHub

Author-referenced sample dataset of synthetic identification-document images covering five document types.

meetsandesh/identification_document_dataset dataset
Open GitHub

Author-referenced repository for generating synthetic document images used in the document identification and information extraction experiment.

meetsandesh/synthetic_document_generator dataset
Open GitHub

The paper's AutoGen framework repository for building LLM applications via multi-agent conversations.

microsoft/autogen framework
Open GitHub

Repository for the ADAS codebase introduced by the paper, including the Meta Agent Search implementation and experimental framework.

ShengranHu/ADAS framework
Open GitHub

AgentGPT is analysed as a general-purpose autonomous LLM-powered multi-agent system with user-guided alignment in selected aspects such as decomposition, agent generation, and resource utilization.

reworkd/AgentGPT framework
Open GitHub

Auto-GPT is analysed as a general-purpose autonomous LLM-powered multi-agent system with autonomous goal decomposition, task action management, and resource utilization.

Significant-Gravitas/Auto-GPT framework
Open GitHub

SuperAGI is analysed as a general-purpose autonomous LLM-powered multi-agent system with some user-guided alignment options for agent-related and resource-related aspects.

TransformerOptimus/SuperAGI framework
Open GitHub

BabyAGI is analysed as a general-purpose autonomous LLM-powered multi-agent system with a profile similar to Auto-GPT across many assessed aspects.

yoheinakajima/babyagi framework
Open GitHub

An aggregated dataset of chess opening names and move sequences used by the paper to create opening-position concept datasets.

lichess-org/chess-openings dataset
Open GitHub

A GitHub repository listed by the paper as an accompanying curated resource for papers on code as agent harness.

YennNing/Awesome-Code-as-Agent-Harness-Papers
Open GitHub

Repository for the code, human-subject study materials, results, and supplementary materials associated with the Persona framework and AAAI 2025 paper.

YODA-Lab/Persona framework
Open GitHub

Semantic routing package for routing inputs by embedding or intent similarity.

aurelio-labs/semantic-router package
Open GitHub

Framework using repeated generations, verification prompts, and confidence estimates to decide whether to escalate to larger models.

automix-llm/automix framework
Open GitHub

AWS multi-agent orchestration framework that includes prompt-based routing or agent selection patterns.

awslabs/multi-agent-orchestrator framework
Open GitHub

Implementation associated with routing prompts to pre-trained experts after fine-tuned meta-model categorisation.

godcherry/ExpertTokenRouting implementation
Open GitHub

Implementation associated with deciding whether a query requires a complex prompting strategy.

imagination-research/sot implementation
Open GitHub

Iterative multi-agent code generation system using execution success as a routing signal.

JieyuZ2/EcoAssistant system
Open GitHub

LLM routing implementation associated with assessing model adequacy through multiple responses and ground-truth comparison.

kvadityasrivatsa/llm-routing implementation
Open GitHub

Routing-agent implementation using synthetic data and small classifiers for classification-based routing.

lamini-ai/llm-routing-agent implementation
Open GitHub

Orchestrator implementation using decoder-only LLM representations for routing or model selection.

Leeroo-AI/leeroo_orchestrator system
Open GitHub

Framework for serving and evaluating routers that choose between LLMs using preference-oriented routing strategies.

lm-sys/RouteLLM framework
Open GitHub

Task-planning framework in which an LLM selects among models or tools based on descriptions and user tasks.

microsoft/JARVIS framework
Open GitHub

Implementation assessing consistency across reasoning representations for cascade-style routing.

MurongYue/LLM_MoT_cascade implementation
Open GitHub

OpenAI multi-agent orchestration framework discussed as an example of prompt-based routing practice.

openai/swarm framework
Open GitHub

Fine-tuned model framework for API call generation, discussed as treating routing as a code generation problem.

ShishirPatil/gorilla implementation
Open GitHub

Framework for reducing LLM application cost using LLM cascades and related strategies.

stanford-futuredata/Frugalgpt framework
Open GitHub

Adaptive RAG framework that routes among no retrieval, single-step retrieval, and multi-step retrieval paths according to query complexity.

starsuzi/Adaptive-RAG framework
Open GitHub

Code and data for a multi-LLM routing benchmark and evaluation framework.

withmartian/routerbench dataset
Open GitHub

Repository for EmbedLLM materials, described by the paper as containing the dataset, code, and embedder for further research and application.

richardzhuang0412/EmbedLLM dataset
Open GitHub

Stated repository for the modular Python prototype implementing the neuro-symbolic ontology-based LLM validation pipeline.

ruslanmv/Neuro-symbolic-interaction system
Open GitHub

Repository associated with the paper's contamination-detection work for LLM evaluation. The paper links it as code and data; the currently visible README describes a lightweight tool for identifying and analysing potential contamination without access to LLM training data.

liyucheng09/Contamination_Detector dataset
Open GitHub

Repository containing code and technical details for the Multi-Agent Scoring System for essay assessment.

AzizovDilshod/Multi-agent-System-for-Essay-Assessment system
Open GitHub

Repository containing the benchmark data, task files, representative logs, and evaluation scripts for AutoGen, MetaGPT, and TaskWeaver.

lurf21/Agent_Evaluation_Framework dataset
Open GitHub

Repository titled for the paper 'From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications' and identified by GitHub as the code repository for the paper.

jiangfeibo/ComAgent framework
Open GitHub

BeeAI is described as the experimental platform central to IBM's ACP, supporting local-first orchestration, agent discovery, REST endpoints, SDKs, telemetry, and multi-agent execution.

i-am-bee/beeai-framework framework
Open GitHub

The MCP servers repository is cited as an ecosystem of reference and integration servers for file management, databases, Google Drive, Git, GitHub, GitLab, Slack, Google Maps, image generators, and search APIs.

modelcontextprotocol/servers implementation
Open GitHub

OpenAI Swarm is reviewed as a lightweight, stateless abstraction for multi-agent systems with agent definitions, dynamic handoffs, context management, direct function calling, streaming, and backend flexibility.

openai/swarm framework
Open GitHub

TheAgentCompany repository contains sandboxed work environments, task directories, evaluators, task instructions, and supporting files for many task instances listed in the paper's appendix task table.

TheAgentCompany/TheAgentCompany dataset
Open GitHub

GitHub directory linked by the paper for the deterministic prediction task source code, including Pauli string multiplication, divide-and-conquer, letter replacement, and addition-related files.

EdenCodeInc/PyCliffordMCP benchmark
Open GitHub

Repository for the FastAPI control server, callback-augmented interactive trainer, React/TypeScript dashboard, examples, and LLM-based tuning demonstration.

yuntian-group/interactive-training framework
Open GitHub

Open-source CAMEL framework for autonomous cooperation among communicative agents using inception prompting and role-play.

camel-ai/camel framework
Open GitHub

Open-source multi-agent collaborative framework associated with MetaGPT, discussed as a representative framework that embeds human workflow processes and SOPs into language-agent collaboration.

geekan/MetaGPT framework
Open GitHub

Open-source AutoGen framework for creating LLM applications using customizable agents that can be programmed through natural language and code.

microsoft/autogen framework
Open GitHub

Author-maintained repository for tracking LLM-based multi-agent papers and organizing them into streams such as frameworks, orchestration and efficiency, problem solving, world simulation, datasets, and benchmarks.

taichengguo/LLM_MultiAgents_Survey_Papers
Open GitHub

CAMEL is an open-source multi-agent framework for role-playing and agent collaboration.

camel-ai/camel framework
Open GitHub

Code repository for improving factuality and reasoning through multi-agent debate.

composable-models/llm_multiagent_debate framework
Open GitHub

MetaGPT is a multi-agent framework that models a software company using role assignments and SOP-style workflows.

FoundationAgents/MetaGPT framework
Open GitHub

AutoAgents generates different roles for GPTs to form a collaborative entity for complex tasks.

Link-AGI/AutoAgents framework
Open GitHub

Microsoft AutoGen, a framework for building multi-agent AI applications.

microsoft/autogen framework
Open GitHub

Code repository for Solo Performance Prompting / multi-persona self-collaboration.

MikeWangWZHL/Solo-Performance-Prompting framework
Open GitHub

AgentVerse provides task-solving and simulation frameworks for multiple LLM-based agents.

OpenBMB/AgentVerse framework
Open GitHub

ChatDev implements LLM-powered multi-agent collaboration for software development.

OpenBMB/ChatDev framework
Open GitHub

Repository connected to AI Scientist-generated papers reported as having passed peer review at an ICLR workshop.

SakanaAI/AI-Scientist-ICLR2025-Workshop-Experiment
Open GitHub

Code repository for MAD, a multi-agent debate framework using large language models.

Skytliang/Multi-Agents-Debate framework
Open GitHub

Sibyl System repository included in the selected AutoGen application sample.

Ag2S1/Sibyl-System system
Open GitHub

AutoTx repository for planning and executing on-chain transactions.

agentcoinorg/AutoTx
Open GitHub

GPT-Academic repository for LLM-assisted academic reading, writing, translation, and code/project analysis workflows.

binary-husky/gpt_academic
Open GitHub

Composio platform/repository evaluated as a flexible agent application or platform with multiple autonomy-related configurations.

ComposioHQ/composio
Open GitHub

h2oGPT repository for private local GPT-style chat and document interaction.

h2oai/h2ogpt
Open GitHub

GraphRag_Ollama repository combining AutoGen, GraphRAG, Ollama, and related tooling.

karthikvenkatesan-eaton/Autogen_GraphRAG_Ollama
Open GitHub

Langflow platform for building and deploying AI-powered agents and workflows.

langflow-ai/langflow
Open GitHub

Letta platform for stateful agents with advanced memory.

letta-ai/letta
Open GitHub

AutoGen open-source framework for building AI agent systems using language models, multi-agent conversations, and tool use.

microsoft/autogen framework
Open GitHub

AutoGen Studio application within the AutoGen repository.

microsoft/autogen system
Open GitHub

Dream Team repository for building a team of AI agents with AutoGen.

yanivvak/dream-team
Open GitHub

GitHub source for the multiple-choice Truthful-QA variant used in the model-level ranking experiment.

manyoso/haltt4llm dataset
Open GitHub

CHALE repository used as a hallucination-evaluation dataset with non-hallucinated, half-hallucinated, and hallucinated answer categories.

weijiaheng/CHALE dataset
Open GitHub

Repository path for Agent Spec runtime adapters that translate Agent Spec components into framework-specific equivalents for popular agentic frameworks.

oracle/agent-spec implementation
Open GitHub

WayFlow is presented as the paper's reference runtime for executing Agent Spec components, including native support for Agent Spec Agents and Flows.

oracle/wayflow framework
Open GitHub

Library of ADMM applications for sparse and low-rank optimization used to test NewADMM.

canyilu/LibADMM package
Open GitHub

Huawei Cloud VM-placement traces used in the cloud resource scheduling case study.

huaweicloud/VM-placement-dataset dataset
Open GitHub

Repository for the ICLR 2023 ReAct prompting paper, including data, prompts, HotpotQA, FEVER, ALFWorld, and WebShop notebooks, plus Wikipedia environment wrappers.

ysymyth/ReAct benchmark
Open GitHub

Repository for serving, training, and evaluating LLM routers, including router types corresponding to the paper such as matrix factorization, similarity-weighted ranking, BERT, causal LLM, and random routing.

lm-sys/routellm framework
Open GitHub

Repository reported by the paper for the AtomicTranslation code used in the language-to-logic translation experiments.

KrisAesoey/AtomicTranslation dataset
Open GitHub

A curated repository associated with the paper that organizes efficient architecture papers according to the survey's categories.

weigao266/Awesome-Efficient-Arch
Open GitHub

Repository for the BESSTIE sentiment and sarcasm classification benchmark for varieties of English.

unswnlp/BESSTIE dataset
Open GitHub

Repository for the InstruSum instruction-controllable summarization dataset referenced and used as an evaluation target in the paper.

yale-nlp/InstruSum dataset
Open GitHub

Companion repository that organizes works on LLM-agent evaluation according to the survey's structure and tracks papers, benchmarks, methodologies, and frameworks.

Asaf-Yehudai/LLM-Agent-Evaluation-Survey
Open GitHub

Framework for evaluating and optimizing agents and models in container environments, discussed as part of emerging standardized cross-environment agent evaluation.

harbor-framework/harbor framework
Open GitHub

LangChain AgentEvals package for evaluating agent trajectories, including trajectory matching and graph-based evaluation.

langchain-ai/agentevals framework
Open GitHub

HAL harness for centralized and reproducible evaluation across agent benchmarks.

princeton-pli/hal-harness
Open GitHub

Repository for SWE-agent, the LM-based agent system that attempts to fix GitHub issues using an agent-computer interface and configurable tools.

SWE-agent/SWE-agent framework
Open GitHub

Repository linked by the paper for LLM uncertainty decomposition, with folders and scripts related to input uncertainty, decoding uncertainty, model uncertainty, data, models, utilities, and uncertainty scoring.

aditya-taparia/LLM-Uncertainty implementation
Open GitHub

A GitHub repository collecting papers related to LLM-based agents, linked by the survey as a related-papers resource.

WooooDyy/LLM-Agent-Paper-List
Open GitHub

Repository containing code for Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning, including simulation pipeline, scripts, installation instructions, and training/evaluation commands.

UWRobotLearning/RISE implementation
Open GitHub

Repository identified as the code for 'When Routing Collapses: On the Degenerate Convergence of LLM Routers'.

AIGNLAI/EquiRouter implementation
Open GitHub

Public repository containing the cost-aware LLM routing system, training/data-preprocessing components, evaluation and serving pipeline, router tests, and documentation.

SalesforceAIResearch/xRouter framework
Open GitHub

Paper Details