Hacker Newsnew | past | comments | ask | show | jobs | submit | JonathanFly's favoriteslogin
1.Almost anything you give sustained attention to will begin to loop on itself (henrikkarlsson.xyz)
771 points by jger15 8 days ago | 223 comments
2.VibeVoice: A Frontier Open-Source Text-to-Speech Model (microsoft.github.io)
448 points by lastdong 9 days ago | 170 comments
3.The decline of high-tech manufacturing in the United States (waldrn.com)
124 points by giuliomagnifico 25 days ago | 192 comments
4.FFmpeg 8.0 adds Whisper support (ffmpeg.org)
1033 points by rilawa 30 days ago | 325 comments
5.Training language models to be warm and empathetic makes them less reliable (arxiv.org)
358 points by Cynddl 31 days ago | 375 comments
6.Diffusion language models are super data learners (jinjieni.notion.site)
218 points by babelfish 33 days ago | 16 comments
7.Open music foundation models for full-song generation (map-yue.github.io)
121 points by selvan 39 days ago | 74 comments
8.Show HN: KVoiceWalk – Voice cloning for Kokoro TTS using random walk algorithms (github.com/robviren)
13 points by robviren 3 months ago
9.The unreasonable effectiveness of an LLM agent loop with tool use (sketch.dev)
447 points by crawshaw 4 months ago | 320 comments
10. [dupe] Embeddings are underrated (2024) (technicalwriting.dev)
484 points by jxmorris12 4 months ago | 150 comments
11.ACE-Step: A step towards music generation foundation model (github.com/ace-step)
109 points by wertyk 4 months ago | 52 comments
12.Dummy's Guide to Modern LLM Sampling (rentry.co)
228 points by nkko 4 months ago | 37 comments
13.Llasa: Llama-Based Speech Synthesis (llasatts.github.io)
168 points by CalmStorm 4 months ago | 22 comments
14.Ask HN: Share your AI prompt that stumps every model
440 points by owendarko 4 months ago | 633 comments
15.DeepMind releases Lyria 2 music generation model (deepmind.google)
300 points by velcrobeg 4 months ago | 426 comments
16.Show HN: Dia, an open-weights TTS model for generating realistic dialogue (github.com/nari-labs)
652 points by toebee 4 months ago | 190 comments
17.Packing Input Frame Context in Next-Frame Prediction Models for Video Generation (lllyasviel.github.io)
270 points by GaggiX 4 months ago | 27 comments
18.AudioX: Diffusion Transformer for Anything-to-Audio Generation (zeyuet.github.io)
148 points by gnabgib 5 months ago | 19 comments
19.Orpheus-3B – Emotive TTS by Canopy Labs (canopylabs.ai)
186 points by Zetaphor 5 months ago | 39 comments
20.The year I didn't survive (bessstillman.substack.com)
949 points by LaurenSerino 7 months ago | 262 comments
21.Building a personal, private AI computer on a budget (ewintr.nl)
384 points by thm 7 months ago | 231 comments
22.S1: A $6 R1 competitor? (timkellogg.me)
851 points by tkellogg 7 months ago | 416 comments
23.DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL (arxiv.org)
1351 points by gradus_ad 7 months ago | 1056 comments
24.DeepSeek-R1 (github.com/deepseek-ai)
1843 points by meetpateltech 7 months ago | 663 comments
25.Titans: Learning to Memorize at Test Time (arxiv.org)
161 points by bicepjai 7 months ago | 35 comments
26.Entropy of a Large Language Model output (nikkin.dev)
161 points by woodglyst 8 months ago | 63 comments
27.AI founders will learn the bitter lesson (lukaspetersson.com)
337 points by gsky 8 months ago | 263 comments
28.Can LLMs write better code if you keep asking them to “write better code”? (minimaxir.com)
812 points by rcarmo 8 months ago | 439 comments
29.OpenAI O3 breakthrough high score on ARC-AGI-PUB (arcprize.org)
1724 points by maurycy 8 months ago | 1755 comments
30.Veo 2: Our video generation model (deepmind.google)
587 points by mvoodarla 9 months ago | 327 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: