| | Tiny Shakespeare, of the good old char-RNN fame (github.com/karpathy) |
|
4 points by Bluestein 67 days ago | past
|
| | Karpathy/Nano-Llama31 (github.com/karpathy) |
|
74 points by tim_sw 3 months ago | past | 1 comment
|
| | Nano-Llama31 (github.com/karpathy) |
|
3 points by yeldarb 3 months ago | past
|
| | Karpathy: Let's reproduce GPT-2 (1.6B): one 8XH100 node 24h $672 in llm.c (github.com/karpathy) |
|
182 points by alecco 4 months ago | past | 58 comments
|
| | GitHub – Karpathy/LLM101n: LLM101n: Let's Build a Storyteller (github.com/karpathy) |
|
61 points by bilsbie 4 months ago | past | 7 comments
|
| | NanoGPT: The simplest, fastest repository for training medium-sized GPTs (github.com/karpathy) |
|
114 points by ulrischa 5 months ago | past | 21 comments
|
| | karpathy/build-nanogpt: Video + code lecture on building nanoGPT from scratch (github.com/karpathy) |
|
9 points by codewiz 5 months ago | past | 3 comments
|
| | Reproducing GPT-2 (124M) in llm.c in 90 minutes for $20 (github.com/karpathy) |
|
3 points by georgehill 5 months ago | past
|
| | Reproducing GPT-2 in llm.c (github.com/karpathy) |
|
618 points by tosh 5 months ago | past | 117 comments
|
| | Llm.c State of the Union (github.com/karpathy) |
|
1 point by neeleshs 6 months ago | past
|
| | Layernorm (github.com/karpathy) |
|
3 points by sva_ 7 months ago | past
|
| | Full forward pass of GPT-2 in one file of pure CUDA (github.com/karpathy) |
|
63 points by tosh 7 months ago | past | 4 comments
|
| | Llm.c – LLM training in simple, pure C/CUDA (github.com/karpathy) |
|
1050 points by tosh 7 months ago | past | 169 comments
|
| | Karpathy: SVM vs. K-NN on Embeddings (github.com/karpathy) |
|
1 point by skanderbm 8 months ago | past
|
| | Code for the Byte Pair Encoding algorithm, commonly used in LLM tokenization (github.com/karpathy) |
|
81 points by magoghm 8 months ago | past | 31 comments
|
| | Karpathy removes llama licence from llama2.c (github.com/karpathy) |
|
3 points by orwellg1984 on July 26, 2023 | past
|
| | Llama2.c: Inference llama 2 in one file of pure C (github.com/karpathy) |
|
707 points by anjneymidha on July 23, 2023 | past | 165 comments
|
| | KNN vs. SVM (github.com/karpathy) |
|
3 points by tosh on April 15, 2023 | past
|
| | Neural Networks: Zero to Hero (github.com/karpathy) |
|
1 point by greenSunglass on Jan 24, 2023 | past
|
| | NanoGPT (github.com/karpathy) |
|
1532 points by trekhleb on Jan 11, 2023 | past | 320 comments
|
| | The simplest, fastest repository for training and fine-tuning medium-sized GPTs (github.com/karpathy) |
|
2 points by Terretta on Jan 10, 2023 | past
|
| | nanoGPT: The simplest repository for training medium-sized GPTs (github.com/karpathy) |
|
3 points by isoprophlex on Jan 3, 2023 | past | 1 comment
|
| | Micrograd: A Tiny Autograd Engine (github.com/karpathy) |
|
3 points by memorable on Nov 4, 2022 | past
|
| | Neural Networks: Zero to Hero (github.com/karpathy) |
|
3 points by gzer0 on Sept 12, 2022 | past
|
| | An autoregressive character-level language model for making more things (github.com/karpathy) |
|
2 points by phsilva on Sept 8, 2022 | past | 1 comment
|
| | MinGPT: Minimal PyTorch re-implementation of GPT (github.com/karpathy) |
|
223 points by memorable on Sept 6, 2022 | past | 24 comments
|
| | ArXiv-sanity lite: get recommendations of similar papers (github.com/karpathy) |
|
3 points by ofou on March 28, 2022 | past
|
| | Pure Python from-scratch zero-dependency implementation of Bitcoin (github.com/karpathy) |
|
2 points by ofou on June 22, 2021 | past
|
| | Karpathy's MinGPT (github.com/karpathy) |
|
374 points by aliabd on Aug 17, 2020 | past | 102 comments
|
| | A tiny scalar-valued autograd engine (github.com/karpathy) |
|
2 points by mooreds on April 26, 2020 | past
|
|
|
More |