skip to main content
π°
NewsNook
nesting hacker news in a more meaningful way
β‘
menu
(β/)
LLM / page 34
newer
older
your nooks
add yours
Why LLM-as-judge fails for code evaluation. Here's what works.
Meltdown: LLM Client Made in Python and Tk
Show HN: Nexa-gauge β Cache/cost-aware graph-based eval for LLM and RAG
Show HN: [Video] Tribute to LLM releases in April 2026
Humanity is self-deprecating
How Do You Know If a Skill Is Any Good? LLM-as-Judge Scoring
The Very American, Intense World of High-School Debate
Everything Vault β a local-first Markdown knowledge system for LLMs
Show HN: Openloom β Turn Loom links into transcripts and frames an LLM can watch
The Big Book of LLMs
Show HN: UltraCompress β first mathematically lossless 5-bit LLM compression
Sparser, Faster, Lighter Transformer Language Models
AMΓLIA and the future of European Portuguese LLMs
Can LLMs model real-world systems in TLA+?