A curated collection of papers I’m reading, planning to read, or have completed. This table helps me track my progress and whether I’ve created detailed cards for each paper.

Status Paper Title Card Created Notes
Completed BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining ✅ Yes High priority - synthetic data scaling
Reading Memento: Fine-tuning LLM Agents without Fine-tuning LLMs ✅ Yes Continual learning for agents
Completed PickLLM: Context-Aware RL-Assisted Large Language Model Routing ✅ Yes RL for LLM selection - interesting approach
Completed Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning ✅ Yes Multi-step routing with RL - excellent framework
Reading Deep Graph Anomaly Detection: A Survey and New Perspectives ✅ Yes Comprehensive survey - good reference
Completed Deep Think with Confidence ✅ Yes Confidence-based reasoning filtering - practical
Not Started gpt-oss-120b & gpt-oss-20b Model Card ❌ No Model card for new open-weight reasoning models from OpenAI with agentic capabilities.
Not Started R-Zero: Self-Evolving Reasoning LLM from Zero Data ❌ No Fully autonomous framework where Challenger/Solver models co-evolve to generate their own training curriculum from scratch.
Not Started Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning ❌ No An agentic GraphRAG that models retrieval as a multi-turn interaction and optimizes it with end-to-end RL.
Not Started A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence ❌ No The first systematic survey of self-evolving agents, organized by what, when, and how agents evolve.