Skip to content

Pull requests: karpathy/llm.c

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Overlap computation and communication V2
#574 opened Jun 9, 2024 by ngc92 Loading…
Dataloader - introducing randomness
#573 opened Jun 7, 2024 by gordicaleksa Loading…
Consolidate memory
#565 opened Jun 7, 2024 by ngc92 Loading…
Fix the compiler warnings and errors
#561 opened Jun 6, 2024 by lancerts Loading…
Utilities for cuda streams + disk IO
#556 opened Jun 5, 2024 by ngc92 Draft
added reading checkpoint files
#554 opened Jun 5, 2024 by morphpiece Loading…
Add master weights to resume state
#522 opened Jun 2, 2024 by gordicaleksa Loading…
add edu fineweb support, with 10B and 100B version
#517 opened Jun 2, 2024 by eliebak Loading…
Added packed layernorm_forward
#513 opened Jun 2, 2024 by ChrisDryden Loading…
adding wsd schedule with (1-sqrt) decay
#508 opened Jun 1, 2024 by eliebak Loading…
Add DockerFile
#501 opened May 30, 2024 by banyan-god Loading…
Realtime training visualization using wandb
#489 opened May 29, 2024 by chinthysl Loading…
train_gpt2.c: Add gpt2_write_to_checkpoint method
#467 opened May 26, 2024 by faxe1008 Loading…
.gitignore: ignore more for windows devs
#466 opened May 26, 2024 by nietras Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.