🔬 Deep Dives
2026-04-12 · 4 pts · 3 comments · ⭐ 6.4k today
⭐ 6.4k GitHub stars today · ⭐ 61.4k total stars
The agent that grows with you
2026-04-12 · 2 pts · ⭐ 837 today
⭐ 837 GitHub stars today · ⭐ 17.0k total stars
"DeepTutor: Agent-Native Personalized Learning Assistant"
2026-04-09 · 139 pts · cs.CV · cs.AI · Wenbo Hu, Xin Chen…
📄 New in cs.CV, cs.AI
Group Relative Policy Optimization (GRPO) has emerged as the de facto Reinforcement Learning (RL) objective driving recent advancements in Multimodal Large Language Models. However, extending this success to open-source multimodal generalist models remains heavily constrained by two primary challenges: the extreme variance in reward topologies across diverse visual tasks, and the inherent difficulty of balancing fine-grained perception with multi-step reasoning capabilities. Leveraging the enhanced training stability provided by G$^2$RPO, we introduce two task-level shaping mechanisms to seamlessly balance perception and reasoning.
2026-04-12 · 11 pts · 8 comments · ⭐ 45 today
⭐ 45 stars today on GitHub · ⭐ 60.1k total stars
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
2026-04-12 · 6 pts · 5 comments · ⭐ 71 today
⭐ 71 stars today on GitHub · ⭐ 10.0k total stars
A collection of projects showcasing RAG, agents, workflows, and other AI use cases
⚡ Quick Signals
Python / Agent
Dmax / Decoding
Appeals / Court
Eleventy / End
Found / Mythos
Class / Feed