🔬 Deep Dives
Embodied / Python
2026-04-11 · 4 pts · 3 comments · ⭐ 7.7k today
⭐ 7.7k GitHub stars today · ⭐ 54.1k total stars
The agent that grows with you
2026-04-11 · 16 pts · 2 comments · ⭐ 292 today
⭐ 292 GitHub stars today · ⭐ 61.0k total stars
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.
2026-04-09 · 139 pts · cs.CV · cs.AI · Wenbo Hu, Xin Chen…
📄 New in cs.CV, cs.AI
Group Relative Policy Optimization (GRPO) has emerged as the de facto Reinforcement Learning (RL) objective driving recent advancements in Multimodal Large Language Models. However, extending this success to open-source multimodal generalist models remains heavily constrained by two primary challenges: the extreme variance in reward topologies across diverse visual tasks, and the inherent difficulty of balancing fine-grained perception with multi-step reasoning capabilities. Leveraging the enhanced training stability provided by G$^2$RPO, we introduce two task-level shaping mechanisms to seamlessly balance perception and reasoning.
2026-04-11 · 2 pts · ⭐ 1.4k today
⭐ 1.4k GitHub stars today · ⭐ 16.2k total stars
"DeepTutor: Agent-Native Personalized Learning Assistant"
Machine / Rasbt
2026-04-11 · 228 pts · 124 comments · ⭐ 7 today
💬 Major HN discussion (124 comments)
Code Repository for Machine Learning with PyTorch and Scikit-Learn
⚡ Quick Signals
Embodied / Python
Artemis / Safely
Filing / Corners
Dmax / Decoding
Installing / Firefox