Daily AI Digest — 2026-04-12

🔬 Deep Dives

GitHub NousResearch / hermes-agent

2026-04-12 · 4 pts · 3 comments · ⭐ 6.4k today

⭐ 6.4k GitHub stars today · ⭐ 61.4k total stars

The agent that grows with you

GitHub HKUDS / DeepTutor

2026-04-12 · 2 pts · ⭐ 837 today

⭐ 837 GitHub stars today · ⭐ 17.0k total stars

"DeepTutor: Agent-Native Personalized Learning Assistant"

arXiv OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks

2026-04-09 · 139 pts · cs.CV · cs.AI · Wenbo Hu, Xin Chen…

📄 New in cs.CV, cs.AI

Group Relative Policy Optimization (GRPO) has emerged as the de facto Reinforcement Learning (RL) objective driving recent advancements in Multimodal Large Language Models. However, extending this success to open-source multimodal generalist models remains heavily constrained by two primary challenges: the extreme variance in reward topologies across diverse visual tasks, and the inherent difficulty of balancing fine-grained perception with multi-step reasoning capabilities. Leveraging the enhanced training stability provided by G$^2$RPO, we introduce two task-level shaping mechanisms to seamlessly balance perception and reasoning.

GitHub pathwaycom / llm-app

2026-04-12 · 11 pts · 8 comments · ⭐ 45 today

⭐ 45 stars today on GitHub · ⭐ 60.1k total stars

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

GitHub Arindam200 / awesome-ai-apps

2026-04-12 · 6 pts · 5 comments · ⭐ 71 today

⭐ 71 stars today on GitHub · ⭐ 10.0k total stars

A collection of projects showcasing RAG, agents, workflows, and other AI use cases

⚡ Quick Signals

Python / Agent

arXiv SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds 77 pts · 📄 New in cs.RO, cs.AI

arXiv When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models 34 pts · 📄 New in cs.CV

HF HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents 391 pts

HF SkillClaw: Let Skills Evolve Collectively with Agentic Evolver 374 pts

GitHub agentscope-ai / agentscope 2 pts · ⭐67 · ⭐ 67 stars today on GitHub · ⭐ 23.4k total stars

arXiv ViVa: A Video-Generative Value Model for Robot Reinforcement Learning 22 pts · 📄 New in cs.RO, cs.AI

arXiv ClawBench: Can AI Agents Complete Everyday Online Tasks? 16 pts · 📄 New in cs.CL, cs.AI

arXiv Act Wisely: Cultivating Meta-Cognitive Tool Use in Agentic Multimodal Models 6 pts · 📄 New in cs.CV, cs.AI

HN I run multiple $10K MRR companies on a $20/month tech stack 49 pts · 🔥 49 pts on Hacker News

arXiv MolmoWeb: Open Visual Web Agent and Open Data for the Open Web 1 pts · 📄 New in cs.CV

Dmax / Decoding

HF DMax: Aggressive Parallel Decoding for dLLMs 81 pts

Appeals / Court

HN US appeals court declares 158-year-old home distilling ban unconstitutional 103 pts · 💬 Active HN discussion (69 comments)

Eleventy / End

HN The End of Eleventy 141 pts · 💬 Active HN discussion (88 comments)

Found / Mythos

HN Small models also found the vulnerabilities that Mythos found 1.0k pts · 💬 Major HN discussion (278 comments) · 🔥 Trendi…

Class / Feed

RSS Mastering Claude AI: The Guide You Need to Transform the Way You Work 🆕 New article

📰 Daily AI Digest