🔬 Deep Dives
Llms / Dive
2026-04-15 · 4 pts · 3 comments · ⭐ 8.3k today
⭐ 8.3k GitHub stars today · ⭐ 86.8k total stars
The agent that grows with you
2026-04-15 · 147 pts · 17 comments · ⭐ 35 today
⭐ 35 stars today on GitHub · ⭐ 25.1k total stars
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
2026-04-14 · 11 pts · cs.CL · Guoxin Chen, Jie Chen…
📄 New in cs.CL
We introduce AiScientist, a system for autonomous long-horizon engineering for ML research built on a simple principle: strong long-horizon performance requires both structured orchestration and durable state continuity. Ablation studies further show that File-as-Bus protocol is a key driver of performance, reducing PaperBench by 6.41 points and MLE-Bench Lite by 31.82 points when removed. These results suggest that long-horizon ML research engineering is a systems problem of coordinating specialized work over durable project state, rather than a purely local reasoning problem.
2026-04-15 · 15 pts · 2 comments · ⭐ 162 today
⭐ 162 GitHub stars today · ⭐ 76.7k total stars
A high-throughput and memory-efficient inference and serving engine for LLMs
Judge / Response
2026-04-14 · 177 pts · cs.CL · Liran Ringel, Yaniv Romano
📄 New in cs.CL
Speculative decoding accelerates autoregressive language models by using a lightweight drafter to propose multiple future tokens, which the target model then verifies in parallel. DFlash shows that a block diffusion drafter can generate an entire draft block in a single forward pass and achieve state-of-the-art speculative decoding performance, outperforming strong autoregressive drafters such as EAGLE-3. We introduce DDTree (Diffusion Draft Tree), a method that constructs a draft tree directly from the per-position distributions of a block diffusion drafter.
⚡ Quick Signals
Llms / Dive
Judge / Response
Stop / Flock
HN
Stop Flock
624 pts · 💬 Major HN discussion (161 comments) · 🔥 Trendi…
Amazon / Acquire
Fixing / Year
Action / Lary