Daily AI Digest — 2026-04-14

🔬 Deep Dives

GitHub NousResearch / hermes-agent

2026-04-14 · 4 pts · 3 comments · ⭐ 11.3k today

⭐ 11.3k GitHub stars today · ⭐ 80.8k total stars

The agent that grows with you

GitHub microsoft / agent-lightning

2026-04-14 · 98 pts · 14 comments · ⭐ 55 today

⭐ 55 stars today on GitHub · ⭐ 16.8k total stars

The absolute trainer to light up AI agents.

GitHub HandsOnLLM / Hands-On-Large-Language-Models

2026-04-14 · 147 pts · 17 comments · ⭐ 37 today

⭐ 37 stars today on GitHub · ⭐ 25.1k total stars

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

arXiv General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks

2026-04-13 · 5 pts · cs.CL · cs.AI · Junlin Liu, Shengnan An…

📄 New in cs.CL, cs.AI

To address this gap, we introduce General365, a benchmark specifically designed to assess general reasoning in LLMs. By restricting background knowledge to a K-12 level, General365 explicitly decouples reasoning from specialized expertise. We envision General365 as a catalyst for advancing LLM reasoning beyond domain-specific tasks toward robust, general-purpose real-world scenarios.

arXiv Solving Physics Olympiad via Reinforcement Learning on Physics Simulators

2026-04-13 · 4 pts · cs.LG · cs.AI · Mihir Prabhudesai, Aryan Satpathy…

📄 New in cs.LG, cs.AI

In contrast, other sciences such as physics lack large-scale QA datasets to effectively train reasoning-capable models. We generate random scenes in physics engines, create synthetic question-answer pairs from simulated interactions, and train LLMs using reinforcement learning on this synthetic data. These results demonstrate that physics simulators can act as scalable data generators, enabling LLMs to acquire deep physical reasoning skills beyond the limitations of internet-scale QA data.

⚡ Quick Signals

Claude / Fly

GitHub lyogavin / airllm 2 pts · ⭐263 · ⭐ 263 GitHub stars today · ⭐ 15.9k total stars

arXiv SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context 1 pts · 📄 New in cs.AI, cs.CL

GitHub rasbt / LLMs-from-scratch 1 pts · ⭐84 · ⭐ 84 stars today on GitHub · ⭐ 90.7k total stars

arXiv OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation 📄 New in cs.CV

arXiv Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind 📄 New in cs.CL, cs.AI

HF SPASM: Stable Persona-driven Agent Simulation for Multi-turn Dialogue Generation 194 pts · 💬 Active HN discussion (56 comments)

arXiv ADD for Multi-Bit Image Watermarking 📄 New in stat.ML, cs.AI

HF Eliciting Medical Reasoning with Knowledge-enhanced Data Synthesis: A Semi-Supervised Reinforcement Learning Approach 81 pts

HN Can Claude Fly a Plane? 69 pts · 💬 Active HN discussion (58 comments)

arXiv Who Handles Orientation? Investigating Invariance in Feature Matching 📄 New in cs.CV

Spam / Policy

HN A new spam policy for "back button hijacking" 290 pts · 💬 Major HN discussion (167 comments)

Davinci / Resolve

HN DaVinci Resolve releases Photo Editor 472 pts · 💬 Major HN discussion (111 comments)

Stacked / Prs

HN GitHub Stacked PRs 689 pts · 💬 Major HN discussion (361 comments) · 🔥 Trendi…

Continuous / Flow

HF Continuous Adversarial Flow Models 38 pts

Review / Class

RSS I Built an AI Code Review Pipeline — One Command for Review + Auto-Fix + Tests + Report 🆕 New article

📰 Daily AI Digest