Commit Graph

8 Commits

Author SHA1 Message Date
LiangSong
4a1e7bb44b Optimized the structure of configs, added support for deepspeed stage3, reduced memory usage by using Auto class to load models, and added support for training 65B models. 2023-05-06 23:37:17 +08:00
LiangSong
5c876121cb update gradio, fix code format bug 2023-05-04 18:18:52 +08:00
LiangSong
a1acc90988 fix train_tokenizer bug 2023-05-04 16:00:56 +08:00
LiangSong
f0d41f937b update instruct_config and set all random seed to 42 2023-05-04 08:45:21 +08:00
LiangSong
3f62a23ee2 update format 2023-04-12 22:16:15 +08:00
LiangSong
a4aa109dd3 add trainer and utils 2023-04-12 17:59:05 +08:00
LiangSong
ae0691c509 update utils 2023-04-12 17:15:40 +08:00
LiangSong
da1c927016 update speed test 2023-04-12 17:15:07 +08:00