589689.xyz

llm.c-0e69e3a

  • 收录时间:2024-08-22 03:27:48
  • 文件大小:999KB
  • 下载次数:1
  • 最近下载:2024-08-22 03:27:48
  • 磁力链接:

文件列表

  1. train_gpt2.cu 87KB
  2. train_gpt2_fp32.cu 75KB
  3. dev/cuda/layernorm_backward.cu 70KB
  4. dev/cuda/attention_forward.cu 53KB
  5. train_gpt2.c 50KB
  6. dev/cuda/attention_backward.cu 48KB
  7. train_gpt2.py 41KB
  8. dev/cuda/classifier_fused.cu 36KB
  9. dev/cuda/fused_residual_forward.cu 27KB
  10. dev/cuda/trimat_forward.cu 27KB
  11. dev/cuda/matmul_backward_bias.cu 27KB
  12. dev/cuda/softmax_forward.cu 24KB
  13. llmc/dataloader.h 21KB
  14. dev/cuda/layernorm_forward.cu 20KB
  15. llmc/layernorm.cuh 19KB
  16. doc/layernorm/layernorm.md 18KB
  17. dev/cuda/matmul_forward.cu 18KB
  18. test_gpt2.cu 15KB
  19. README.md 13KB
  20. llmc/attention.cuh 13KB
  21. llmc/cudnn_att.cpp 12KB
  22. dev/cuda/common.h 12KB
  23. test_gpt2_fp32.cu 11KB
  24. dev/cuda/matmul_backward.cu 11KB
  25. llmc/encoder.cuh 11KB
  26. Makefile 10KB
  27. dev/cuda/adamw.cu 9KB
  28. llmc/matmul.cuh 9KB
  29. llmc/zero.cuh 9KB
  30. profile_gpt2cu.py 8KB
  31. dev/cuda/encoder_forward.cu 8KB
  32. llmc/cuda_utils.cuh 8KB
  33. test_gpt2.c 8KB
  34. dev/data/hellaswag.py 7KB
  35. dev/cuda/nccl_all_reduce.cu 7KB
  36. dev/cpu/matmul_forward.c 7KB
  37. llmc/rand.h 7KB
  38. dev/cuda/global_norm.cu 7KB
  39. dev/cuda/gelu_backward.cu 7KB
  40. dev/cuda/encoder_backward.cu 6KB
  41. llmc/mfu.h 6KB
  42. llmc/fused_classifier.cuh 6KB
  43. doc/layernorm/layernorm.c 6KB
  44. dev/cuda/crossentropy_softmax_backward.cu 6KB
  45. dev/data/mmlu.py 6KB
  46. dev/cuda/benchmark_on_modal.py 6KB
  47. dev/cuda/gelu_forward.cu 5KB
  48. dev/cuda/residual_forward.cu 5KB
  49. llmc/utils.h 5KB
  50. dev/cuda/crossentropy_forward.cu 5KB
  51. dev/unistd.h 5KB
  52. dev/data/data_common.py 5KB
  53. llmc/adamw.cuh 4KB
  54. dev/vislog.ipynb 4KB
  55. dev/data/fineweb.py 4KB
  56. dev/data/tinystories.py 4KB
  57. llmc/tokenizer.h 4KB
  58. llmc/cuda_common.h 4KB
  59. scripts/README.md 3KB
  60. llmc/gelu.cuh 3KB
  61. llmc/global_norm.cuh 3KB
  62. dev/cuda/Makefile 2KB
  63. dev/cuda/README.md 2KB
  64. dev/data/tinyshakespeare.py 2KB
  65. profile_gpt2.cu 2KB
  66. doc/layernorm/layernorm.py 2KB
  67. llmc/logger.h 2KB
  68. llmc/cublas_common.h 1KB
  69. scripts/run_gpt2_774M.sh 1KB
  70. scripts/run_gpt2_350M.sh 1KB
  71. scripts/run_gpt3_124M.sh 1KB
  72. scripts/run_gpt2_124M.sh 1KB
  73. llmc/sampler.h 1KB
  74. LICENSE 1KB
  75. scripts/pyrun_gpt2_124M.sh 876B
  76. llmc/cudnn_att.h 799B
  77. dev/data/README.md 618B
  78. requirements.txt 57B