NewsNook

nesting hacker news in a more meaningful way

Why LLM-as-judge fails for code evaluation. Here's what works.

Meltdown: LLM Client Made in Python and Tk

Show HN: Nexa-gauge – Cache/cost-aware graph-based eval for LLM and RAG

Show HN: [Video] Tribute to LLM releases in April 2026

Humanity is self-deprecating

How Do You Know If a Skill Is Any Good? LLM-as-Judge Scoring

The Very American, Intense World of High-School Debate

Everything Vault – a local-first Markdown knowledge system for LLMs

Show HN: Openloom – Turn Loom links into transcripts and frames an LLM can watch

The Big Book of LLMs

Show HN: UltraCompress – first mathematically lossless 5-bit LLM compression

Sparser, Faster, Lighter Transformer Language Models

AMÁLIA and the future of European Portuguese LLMs

Can LLMs model real-world systems in TLA+?