|
87f75f2dfe
|
Fix huggingface download error
|
2024-08-18 06:46:44 +00:00 |
|
|
9b8fe37cd1
|
Update requirements.txt
|
2024-08-18 06:21:38 +00:00 |
|
|
eef8ae1477
|
Update Dockerfile
|
2024-06-15 18:59:00 +00:00 |
|
|
e6df81ae9d
|
Add docker-compose.yml
|
2024-06-15 18:32:23 +00:00 |
|
|
bbb1d210aa
|
Add Dockerfile
|
2024-06-15 18:31:53 +00:00 |
|
|
26a0ea81c0
|
Update requirements.txt
|
2024-06-15 17:12:47 +00:00 |
|
|
a24271d48e
|
Update data/download_the_pile.sh
|
2024-06-14 09:08:55 +00:00 |
|
|
5635f7d08d
|
Update data/download_wudao.sh
|
2024-06-14 09:08:03 +00:00 |
|
LiangSong
|
0157b6938d
|
update readme
|
2023-05-17 22:45:04 +07:00 |
|
LiangSong
|
95973b5de1
|
update header
|
2023-05-17 22:21:46 +07:00 |
|
LiangSong
|
d269affb42
|
update readme
|
2023-05-17 21:17:49 +07:00 |
|
s-JoL
|
6988c69884
|
Merge pull request #64 from eltociear/patch-1
Update README.md
|
2023-05-16 22:24:13 +07:00 |
|
Ikko Eltociear Ashimine
|
7bacd6cb93
|
Update README.md
HuggingFace -> Hugging Face
|
2023-05-17 00:11:35 +09:00 |
|
LiangSong
|
77b1c552c3
|
add discord invite link
|
2023-05-15 23:00:25 +07:00 |
|
LiangSong
|
82c845a8ce
|
update readme
|
2023-05-15 00:21:13 +08:00 |
|
LiangSong
|
1ce8c18d83
|
add logo
|
2023-05-14 10:52:43 +08:00 |
|
LiangSong
|
52e8df9a8d
|
update readme
|
2023-05-14 10:48:49 +08:00 |
|
LiangSong
|
a07d9b0ac8
|
update readme
|
2023-05-14 01:06:03 +08:00 |
|
LiangSong
|
bf2cac0a45
|
update config
|
2023-05-14 01:00:50 +08:00 |
|
LiangSong
|
e18ead00cc
|
update server
|
2023-05-12 15:07:46 +08:00 |
|
LiangSong
|
7231d53ca4
|
update readme add new model
|
2023-05-12 11:32:42 +08:00 |
|
LiangSong
|
ceb1fd067b
|
update vocab_size
|
2023-05-11 14:15:12 +08:00 |
|
LiangSong
|
73dafa7ad6
|
add rounding vocab_size
|
2023-05-10 17:49:52 +08:00 |
|
LiangSong
|
26f7421f05
|
add star history to readme
|
2023-05-10 15:52:55 +08:00 |
|
LiangSong
|
72a6f81b61
|
update readme
|
2023-05-09 18:47:29 +08:00 |
|
LiangSong
|
7d505ea303
|
update readme
|
2023-05-09 17:03:13 +08:00 |
|
LiangSong
|
59b79af9d7
|
add comment
|
2023-05-09 16:53:05 +08:00 |
|
LiangSong
|
f6ac834ef9
|
update default config
|
2023-05-09 15:16:50 +08:00 |
|
LiangSong
|
21fdd25b94
|
update Citation
|
2023-05-09 15:12:49 +08:00 |
|
LiangSong
|
30ab306c56
|
update readme
|
2023-05-09 15:06:47 +08:00 |
|
LiangSong
|
32583a41a7
|
update wudao download and preprocess
|
2023-05-09 14:47:59 +08:00 |
|
LiangSong
|
7dc90c2558
|
fix typo
|
2023-05-09 10:46:11 +08:00 |
|
LiangSong
|
6814fdb59e
|
support gradient ckpt for peft
|
2023-05-08 23:40:03 +08:00 |
|
LiangSong
|
3ba0c77053
|
update optimizer for lora
|
2023-05-08 22:56:37 +08:00 |
|
LiangSong
|
58586112c1
|
fix table
|
2023-05-08 22:30:02 +08:00 |
|
LiangSong
|
16811d0efe
|
update readme
|
2023-05-08 22:29:24 +08:00 |
|
LiangSong
|
92caa94490
|
support peft
|
2023-05-08 22:26:39 +08:00 |
|
LiangSong
|
7da40f1c83
|
fix typo
|
2023-05-08 19:00:06 +08:00 |
|
LiangSong
|
2df3e622e9
|
update readme
|
2023-05-08 18:59:01 +08:00 |
|
LiangSong
|
ec2b4d6ee7
|
fix split by shard bug
|
2023-05-08 14:03:05 +08:00 |
|
LiangSong
|
4a1e7bb44b
|
Optimized the structure of configs, added support for deepspeed stage3, reduced memory usage by using Auto class to load models, and added support for training 65B models.
|
2023-05-06 23:37:17 +08:00 |
|
LiangSong
|
5b1f6a4861
|
fix epoch bug
|
2023-05-06 09:45:37 +08:00 |
|
LiangSong
|
f893a0f5b8
|
update dataset
|
2023-05-05 19:23:16 +08:00 |
|
LiangSong
|
758af69c73
|
update science instruct-tuning datasets
|
2023-05-05 19:00:37 +08:00 |
|
LiangSong
|
d24b4cce54
|
update preprocess format
|
2023-05-05 18:20:59 +08:00 |
|
LiangSong
|
85caa97a6a
|
add xP3 dataset and belle_2M
|
2023-05-05 17:05:41 +08:00 |
|
LiangSong
|
00cbdbbf26
|
fix typo
|
2023-05-04 22:55:40 +08:00 |
|
LiangSong
|
693e3970d9
|
update readme
|
2023-05-04 22:54:10 +08:00 |
|
LiangSong
|
fbb7997607
|
fix typo
|
2023-05-04 22:32:15 +08:00 |
|
LiangSong
|
98ffab3a97
|
update readme and add half to server
|
2023-05-04 22:28:36 +08:00 |
|