Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
SpaceManNabs's favorites
login
submissions
|
comments
1.
Training LLMs to Reason in a Continuous Latent Space
(
arxiv.org
)
283 points
by
omarsar
26 days ago
|
114 comments
2.
Grifters, believers, grinders, and coasters
(
seangoedecke.com
)
232 points
by
rbanffy
32 days ago
|
118 comments
3.
Genie 2: A large-scale foundation world model
(
deepmind.google
)
1247 points
by
meetpateltech
32 days ago
|
410 comments
4.
Fraud, so much fraud
(
science.org
)
1528 points
by
nabla9
3 months ago
|
823 comments
5.
Synchronizing Pong to music with constrained optimization
(
victortao.substack.com
)
321 points
by
platers
4 months ago
|
37 comments
6.
Revisiting the Classics: Jensen's Inequality (2023)
(
francisbach.com
)
89 points
by
cpp_frog
4 months ago
|
8 comments
7.
Transformers in music recommendation
(
research.google
)
211 points
by
panarky
4 months ago
|
125 comments
8.
Seven basic rules for causal inference
(
pedermisager.org
)
218 points
by
RafelMri
4 months ago
|
67 comments
9.
Are Emergent Abilities in Large Language Models Just In-Context Learning?
(
arxiv.org
)
27 points
by
croes
4 months ago
10.
Does Reasoning Emerge? Probabilities of Causation in Large Language Models
(
arxiv.org
)
165 points
by
belter
4 months ago
|
192 comments
11.
Grokked Transformers Are Implicit Reasoners
(
arxiv.org
)
239 points
by
jasondavies
7 months ago
|
61 comments
12.
Show HN: I built a game to help you learn neural network architectures
(
sabrina.dev
)
321 points
by
sabrina_ramonov
7 months ago
|
62 comments
13.
TimesFM: Time Series Foundation Model for time-series forecasting
(
github.com/google-research
)
317 points
by
yeldarb
8 months ago
|
118 comments
14.
Deep Reinforcement Learning: Zero to Hero
(
github.com/alessiodm
)
535 points
by
alessiodm
8 months ago
|
47 comments
15.
Kolmogorov-Arnold Networks
(
github.com/kindxiaoming
)
568 points
by
sumo43
8 months ago
|
142 comments
16.
A Visual Guide to Vision Transformers
(
mdturp.ch
)
237 points
by
md2rp
8 months ago
|
26 comments
17.
Histograms for Probability Density Estimation: A Primer
(
vvanirudh.github.io
)
27 points
by
vvanirudh
9 months ago
|
12 comments
18.
Implementation of Google's Griffin Architecture – RNN LLM
(
github.com/google-deepmind
)
218 points
by
milliondreams
9 months ago
|
38 comments
19.
A generalist AI agent for 3D virtual environments
(
deepmind.google
)
559 points
by
nuz
9 months ago
|
310 comments
20.
Diffusion models from scratch, from a new theoretical perspective
(
chenyang.co
)
379 points
by
jxmorris12
10 months ago
|
40 comments
21.
What Is a Schur Decomposition? (2022)
(
nhigham.com
)
90 points
by
vector_spaces
10 months ago
|
27 comments
22.
Mamba: The Easy Way
(
jackcook.com
)
279 points
by
jackcook
10 months ago
|
60 comments
23.
Gemma: New Open Models
(
blog.google
)
1129 points
by
meetpateltech
10 months ago
|
509 comments
24.
The killer app of Gemini Pro 1.5 is using video as an input
(
simonwillison.net
)
1136 points
by
simonw
10 months ago
|
484 comments
25.
Stable Diffusion 3
(
stability.ai
)
983 points
by
reqo
10 months ago
|
693 comments
26.
Training LLMs to generate text with citations via fine-grained rewards
(
arxiv.org
)
170 points
by
PaulHoule
10 months ago
|
34 comments
27.
Sora: Creating video from text
(
openai.com
)
3647 points
by
davidbarker
10 months ago
|
2231 comments
28.
Higher Order Derivatives of Transforms
(
nosferalatu.com
)
87 points
by
nosferalatu123
11 months ago
|
24 comments
29.
LLM in a Flash: Efficient LLM Inference with Limited Memory
(
huggingface.co
)
252 points
by
ghshephard
on Dec 20, 2023
|
53 comments
30.
VideoPoet A large language model for zero-shot video generation
(
research.google
)
126 points
by
fchyan
on Dec 19, 2023
|
40 comments
More
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: