1. | | You could have designed state of the art positional encoding (fleetwood.dev) |
|
216 points by Philpax 86 days ago | 46 comments
|
2. | | Show HN: Tips.io – A Tailwind playground with AI, page management, and theming (tips.io) |
|
291 points by TIPSIO 85 days ago | 62 comments
|
3. | | Transfusion: Predict the next token and diffuse images with one multimodal model (arxiv.org) |
|
122 points by fzliu 5 months ago | 10 comments
|
4. | | Diffusion Is Spectral Autoregression (sander.ai) |
|
223 points by ackbar03 5 months ago | 62 comments
|
5. | | Show HN: LLM-aided OCR – Correcting Tesseract OCR errors with LLMs (github.com/dicklesworthstone) |
|
479 points by eigenvalue 6 months ago | 172 comments
|
6. | | Diffusion Forcing: Next-Token Prediction Meets Full-Sequence Diffusion (boyuan.space) |
|
214 points by magoghm 7 months ago | 12 comments
|
7. | | The Geometry of Categorical and Hierarchical Concepts in Large Language Models (arxiv.org) |
|
123 points by Anon84 8 months ago | 15 comments
|
8. | | Scalable MatMul-Free Language Modeling (arxiv.org) |
|
205 points by lykahb 8 months ago | 30 comments
|
9. | | LoRA Learns Less and Forgets Less (arxiv.org) |
|
177 points by wolecki 9 months ago | 60 comments
|
10. | | Zilog Z80 CPU – Modern, free and open source silicon clone (github.com/rejunity) |
|
362 points by jnord 9 months ago | 71 comments
|
11. | | Our humble attempt at “how much data do you need to fine-tune” (barryzhang.substack.com) |
|
149 points by gnahzby on Sept 24, 2023 | 37 comments
|
12. | | Echoes of Electromagnetism Found in Number Theory (quantamagazine.org) |
|
178 points by EA-3167 on Oct 12, 2023 | 95 comments
|
13. | | LLM Leaderboard (lmsys.org) |
|
41 points by olalonde on Sept 22, 2023 | 17 comments
|
14. | | RAG is more than just embedding search (jxnl.github.io) |
|
151 points by jxnlco on Sept 21, 2023 | 58 comments
|
15. | | Show HN: Build AI DAGs with Memory; Run and Validate LLM Tools in Containers (github.com/griptape-ai) |
|
48 points by vasinov on April 21, 2023 | 11 comments
|
16. | | How does Linux NAT a ping? (devnonsense.com) |
|
328 points by willdaly on Sept 10, 2023 | 105 comments
|
17. | | Show HN: RAGstack – private ChatGPT for enterprise VPCs, built with Llama 2 (github.com/psychic-api) |
|
84 points by ayanb9440 on July 20, 2023 | 30 comments
|
18. | | Functions are vectors (thenumb.at) |
|
432 points by TheNumbat on July 29, 2023 | 120 comments
|
19. | | So you want to build your own open source chatbot (hacks.mozilla.org) |
|
328 points by edo-codes on July 29, 2023 | 122 comments
|
20. | | LlamaIndex: Unleash the power of LLMs over your data (llamaindex.ai) |
|
208 points by danboarder on July 8, 2023 | 55 comments
|
21. | | How long can open-source LLMs truly promise on context length? (lmsys.org) |
|
189 points by dacheng2 on June 29, 2023 | 62 comments
|
22. | | The Secret Sauce behind 100K context window in LLMs: all tricks in one place (gopenai.com) |
|
474 points by T-A on June 17, 2023 | 99 comments
|
23. | | Dust XP1 switches to GPT-3.5-turbo, is now free to use (dust.tt) |
|
99 points by ukuina on March 8, 2023 | 50 comments
|
24. | | The Missing Semester of Your CS Education (csail.mit.edu) |
|
1024 points by saikatsg on Feb 25, 2023 | 336 comments
|
25. | | Modeling Starlink Capacity (mikepuchol.com) |
|
234 points by walterbell on Oct 8, 2022 | 82 comments
|
26. | | Ask HN: Help me pick a front-end framework |
|
141 points by bjackman on Sept 11, 2022 | 176 comments
|
27. | | Reddit's favorite products in one place (looria.com) |
|
839 points by mooreds on Sept 8, 2022 | 253 comments
|
28. | | Ask HN: Self-hosted open source IP security cameras? |
|
80 points by DietaryNonsense on March 3, 2022 | 43 comments
|
29. | | My self-hosting infrastructure, fully automated (github.com/khuedoan) |
|
716 points by rmbryan on Jan 21, 2022 | 222 comments
|
30. | | Ask HN: Let's build an HN uBlacklist to improve our Google search results? |
|
374 points by sanketpatrikar on Jan 4, 2022 | 267 comments
|
|
|
More |