Commit Graph

142 Commits

Author SHA1 Message Date
87f75f2dfe Fix huggingface download error 2024-08-18 06:46:44 +00:00
9b8fe37cd1 Update requirements.txt 2024-08-18 06:21:38 +00:00
eef8ae1477 Update Dockerfile 2024-06-15 18:59:00 +00:00
e6df81ae9d Add docker-compose.yml 2024-06-15 18:32:23 +00:00
bbb1d210aa Add Dockerfile 2024-06-15 18:31:53 +00:00
26a0ea81c0 Update requirements.txt 2024-06-15 17:12:47 +00:00
a24271d48e Update data/download_the_pile.sh 2024-06-14 09:08:55 +00:00
5635f7d08d Update data/download_wudao.sh 2024-06-14 09:08:03 +00:00
LiangSong
0157b6938d update readme 2023-05-17 22:45:04 +07:00
LiangSong
95973b5de1 update header 2023-05-17 22:21:46 +07:00
LiangSong
d269affb42 update readme 2023-05-17 21:17:49 +07:00
s-JoL
6988c69884
Merge pull request #64 from eltociear/patch-1
Update README.md
2023-05-16 22:24:13 +07:00
Ikko Eltociear Ashimine
7bacd6cb93
Update README.md
HuggingFace -> Hugging Face
2023-05-17 00:11:35 +09:00
LiangSong
77b1c552c3 add discord invite link 2023-05-15 23:00:25 +07:00
LiangSong
82c845a8ce update readme 2023-05-15 00:21:13 +08:00
LiangSong
1ce8c18d83 add logo 2023-05-14 10:52:43 +08:00
LiangSong
52e8df9a8d update readme 2023-05-14 10:48:49 +08:00
LiangSong
a07d9b0ac8 update readme 2023-05-14 01:06:03 +08:00
LiangSong
bf2cac0a45 update config 2023-05-14 01:00:50 +08:00
LiangSong
e18ead00cc update server 2023-05-12 15:07:46 +08:00
LiangSong
7231d53ca4 update readme add new model 2023-05-12 11:32:42 +08:00
LiangSong
ceb1fd067b update vocab_size 2023-05-11 14:15:12 +08:00
LiangSong
73dafa7ad6 add rounding vocab_size 2023-05-10 17:49:52 +08:00
LiangSong
26f7421f05 add star history to readme 2023-05-10 15:52:55 +08:00
LiangSong
72a6f81b61 update readme 2023-05-09 18:47:29 +08:00
LiangSong
7d505ea303 update readme 2023-05-09 17:03:13 +08:00
LiangSong
59b79af9d7 add comment 2023-05-09 16:53:05 +08:00
LiangSong
f6ac834ef9 update default config 2023-05-09 15:16:50 +08:00
LiangSong
21fdd25b94 update Citation 2023-05-09 15:12:49 +08:00
LiangSong
30ab306c56 update readme 2023-05-09 15:06:47 +08:00
LiangSong
32583a41a7 update wudao download and preprocess 2023-05-09 14:47:59 +08:00
LiangSong
7dc90c2558 fix typo 2023-05-09 10:46:11 +08:00
LiangSong
6814fdb59e support gradient ckpt for peft 2023-05-08 23:40:03 +08:00
LiangSong
3ba0c77053 update optimizer for lora 2023-05-08 22:56:37 +08:00
LiangSong
58586112c1 fix table 2023-05-08 22:30:02 +08:00
LiangSong
16811d0efe update readme 2023-05-08 22:29:24 +08:00
LiangSong
92caa94490 support peft 2023-05-08 22:26:39 +08:00
LiangSong
7da40f1c83 fix typo 2023-05-08 19:00:06 +08:00
LiangSong
2df3e622e9 update readme 2023-05-08 18:59:01 +08:00
LiangSong
ec2b4d6ee7 fix split by shard bug 2023-05-08 14:03:05 +08:00
LiangSong
4a1e7bb44b Optimized the structure of configs, added support for deepspeed stage3, reduced memory usage by using Auto class to load models, and added support for training 65B models. 2023-05-06 23:37:17 +08:00
LiangSong
5b1f6a4861 fix epoch bug 2023-05-06 09:45:37 +08:00
LiangSong
f893a0f5b8 update dataset 2023-05-05 19:23:16 +08:00
LiangSong
758af69c73 update science instruct-tuning datasets 2023-05-05 19:00:37 +08:00
LiangSong
d24b4cce54 update preprocess format 2023-05-05 18:20:59 +08:00
LiangSong
85caa97a6a add xP3 dataset and belle_2M 2023-05-05 17:05:41 +08:00
LiangSong
00cbdbbf26 fix typo 2023-05-04 22:55:40 +08:00
LiangSong
693e3970d9 update readme 2023-05-04 22:54:10 +08:00
LiangSong
fbb7997607 fix typo 2023-05-04 22:32:15 +08:00
LiangSong
98ffab3a97 update readme and add half to server 2023-05-04 22:28:36 +08:00