Commit Graph

71 Commits

Author SHA1 Message Date
LiangSong
8c85535db3 update readme 2023-04-28 19:49:51 +08:00
LiangSong
c890bce69c update readme 2023-04-28 15:10:41 +08:00
LiangSong
9baebfd49c Merge branch 'main' into dev 2023-04-28 15:08:25 +08:00
LiangSong
2fd13ff075 fix typo 2023-04-28 15:05:33 +08:00
LiangSong
0fdca8b949 update readme 2023-04-28 15:01:01 +08:00
LiangSong
49118aad42 update header config and add padding to concat_multiple_sequence 2023-04-27 23:42:11 +08:00
LiangSong
db6cdb51d0 unified pre-training and instrcution-tuning both use train_lm and dataset 2023-04-27 19:42:06 +08:00
LiangSong
97aff0e051 use split_dataset_by_node instead accelerate.prepare to accelerate data loading by 50% 2023-04-27 00:04:11 +08:00
LiangSong
0377b43628 update tokenizer to LlamaTokenizer 2023-04-26 18:53:30 +08:00
LiangSong
f41f5558ec update header 2023-04-24 23:19:07 +08:00
LiangSong
f8f4cde228 using huggingface datasets to accelerate training, using open-llama to pretrain 2023-04-24 19:13:53 +08:00
s-JoL
92af968637
Update README.md 2023-04-23 16:26:58 +08:00
s-JoL
cf852bc459
Update README.md 2023-04-23 16:26:21 +08:00
LiangSong
ad3d943a7d update readme add ckpt from hf 2023-04-16 23:50:36 +08:00
LiangSong
b21441b14b disable concat docs 2023-04-15 19:35:24 +08:00
LiangSong
3f62a23ee2 update format 2023-04-12 22:16:15 +08:00
LiangSong
a4aa109dd3 add trainer and utils 2023-04-12 17:59:05 +08:00
LiangSong
ae0691c509 update utils 2023-04-12 17:15:40 +08:00
LiangSong
da1c927016 update speed test 2023-04-12 17:15:07 +08:00
LiangSong
0ee9612f40 add speed test 2023-04-11 21:59:18 +08:00
s-JoL
4cb94d2687
Update README.md 2023-04-11 15:59:02 +08:00
s-JoL
d2632467ec
Update README_en.md 2023-04-11 15:53:43 +08:00
s-JoL
ce8bc5249f
Update README_en.md 2023-04-11 15:53:05 +08:00
s-JoL
be2f0960c7
Update README.md 2023-04-11 15:51:43 +08:00
S
5f7a4a69d3
Merge pull request #3 from Bayes-Song/dev
update readme
2023-04-09 22:49:40 +08:00
LiangSong
f9e7a3376a update readme 2023-04-09 22:48:56 +08:00
S
ce06d9feab
Merge pull request #2 from Bayes-Song/dev
update readme_en
2023-04-08 00:05:02 +08:00
LiangSong
00cda9e265 update readme_en 2023-04-08 00:04:11 +08:00
S
56f71e24df
Merge pull request #1 from Bayes-Song/dev
update instruct-tuning
2023-04-07 23:21:06 +08:00
LiangSong
c67d365db3 update format 2023-04-07 23:20:20 +08:00
LiangSong
f4ba4b6ff2 update chinese readme 2023-04-07 23:19:42 +08:00
LiangSong
1a731953da update server 2023-04-07 10:04:05 +08:00
LiangSong
bc16df4751 add more instruction data 2023-04-06 03:45:24 +08:00
LiangSong
9f140dc99f update preprocess_instruction, add math/code/multiturn_chat and etc. 2023-04-05 23:51:56 +08:00
LiangSong
562067230f update dataset, add concat sequence from multiple docs 2023-04-05 22:42:34 +08:00
LiangSong
0b0028097d update demo 2023-04-04 18:04:35 +08:00
LiangSong
7c7c4765fc update server demo 2023-04-04 17:35:51 +08:00
LiangSong
e89f5a5aa0 update server 2023-04-02 21:55:18 +08:00
LiangSong
9857ad03f1 update server 2023-04-02 21:51:32 +08:00
LiangSong
3d0d4e8af7 update server.py 2023-04-02 21:39:26 +08:00
LiangSong
529f063234 update readme 2023-04-02 21:32:39 +08:00
LiangSong
ab9d68a3ed update readme 2023-04-02 20:29:49 +08:00
LiangSong
7f3f80dc59 update readme 2023-04-02 11:44:05 +08:00
LiangSong
73db126384 update demo link 2023-04-01 10:36:37 +08:00
LiangSong
a8ee5ca732 fix typo 2023-03-31 19:14:03 +08:00
LiangSong
4dfc2a3045 update readme add limitation 2023-03-31 15:11:12 +08:00
LiangSong
d25b34c280 update readme 2023-03-31 14:58:07 +08:00
LiangSong
5dc1e77c66 add server 2023-03-31 14:11:07 +08:00
LiangSong
b9bc7eaf35 fix long seq bug 2023-03-31 10:12:28 +08:00
LiangSong
a62ac2658f add instruction-tuning 2023-03-30 23:43:12 +08:00