LiangSong
|
4a1e7bb44b
|
Optimized the structure of configs, added support for deepspeed stage3, reduced memory usage by using Auto class to load models, and added support for training 65B models.
|
2023-05-06 23:37:17 +08:00 |
|
LiangSong
|
5c876121cb
|
update gradio, fix code format bug
|
2023-05-04 18:18:52 +08:00 |
|
LiangSong
|
a1acc90988
|
fix train_tokenizer bug
|
2023-05-04 16:00:56 +08:00 |
|
LiangSong
|
f0d41f937b
|
update instruct_config and set all random seed to 42
|
2023-05-04 08:45:21 +08:00 |
|
LiangSong
|
ae0691c509
|
update utils
|
2023-04-12 17:15:40 +08:00 |
|