Decision-making under uncertainty, local inference acceleration, and the gap between technical capability and deployment realities.
Source: arXiv cs.CL | URL: https://arxiv.org/abs/2604.03216
Standard metrics don't capture the decision cost of overconfident errors. BAS introduces an asymmetric penalty that prioritises avoiding overconfident mistakes — which is what actually matters when models need to abstain from answering.
Why it matters: Bridges the gap between accurate confidence reporting and useful for actual decisions. Even frontier models remain prone to severe overconfidence despite decent calibration scores.
Source: HackerNews (Show HN) | URL: https://github.com/arman-bd/guppylm
Educational projects that make complex systems comprehensible are undervalued. A tiny, working implementation you can actually trace through fills the gap between Ive read about attention" and "I understand whats happening.
Source: HackerNews | URL: https://sschueller.github.io/posts/the-free-market-lie/
Infrastructure policy matters more than free market dynamics for deployment outcomes. Switzerland's approach: municipal fibre as public infrastructure, not profit-maximising asset.
Source: HackerNews | URLs: Apple App Store | Parlor
On-device inference crossed a threshold — Gemma 4 running natively on iPhone and real-time audio/video input with voice output on consumer hardware. Privacy, latency, and capability converge.
Source: arXiv cs.LG | URL: https://arxiv.org/abs/2604.03226
Federated systems can be designed to tolerate adversarial majorities, not just random noise. Important for any scenario where you can't control client integrity but still want collaborative learning.