LiangSong
|
724265b435
|
update readme
|
2023-04-28 19:54:14 +08:00 |
|
LiangSong
|
8c85535db3
|
update readme
|
2023-04-28 19:49:51 +08:00 |
|
LiangSong
|
c890bce69c
|
update readme
|
2023-04-28 15:10:41 +08:00 |
|
LiangSong
|
9baebfd49c
|
Merge branch 'main' into dev
|
2023-04-28 15:08:25 +08:00 |
|
LiangSong
|
2fd13ff075
|
fix typo
|
2023-04-28 15:05:33 +08:00 |
|
LiangSong
|
0fdca8b949
|
update readme
|
2023-04-28 15:01:01 +08:00 |
|
LiangSong
|
49118aad42
|
update header config and add padding to concat_multiple_sequence
|
2023-04-27 23:42:11 +08:00 |
|
LiangSong
|
db6cdb51d0
|
unified pre-training and instrcution-tuning both use train_lm and dataset
|
2023-04-27 19:42:06 +08:00 |
|
LiangSong
|
97aff0e051
|
use split_dataset_by_node instead accelerate.prepare to accelerate data loading by 50%
|
2023-04-27 00:04:11 +08:00 |
|
LiangSong
|
0377b43628
|
update tokenizer to LlamaTokenizer
|
2023-04-26 18:53:30 +08:00 |
|
LiangSong
|
f41f5558ec
|
update header
|
2023-04-24 23:19:07 +08:00 |
|
LiangSong
|
f8f4cde228
|
using huggingface datasets to accelerate training, using open-llama to pretrain
|
2023-04-24 19:13:53 +08:00 |
|
s-JoL
|
92af968637
|
Update README.md
|
2023-04-23 16:26:58 +08:00 |
|
s-JoL
|
cf852bc459
|
Update README.md
|
2023-04-23 16:26:21 +08:00 |
|
LiangSong
|
ad3d943a7d
|
update readme add ckpt from hf
|
2023-04-16 23:50:36 +08:00 |
|
LiangSong
|
b21441b14b
|
disable concat docs
|
2023-04-15 19:35:24 +08:00 |
|
LiangSong
|
3f62a23ee2
|
update format
|
2023-04-12 22:16:15 +08:00 |
|
LiangSong
|
a4aa109dd3
|
add trainer and utils
|
2023-04-12 17:59:05 +08:00 |
|
LiangSong
|
ae0691c509
|
update utils
|
2023-04-12 17:15:40 +08:00 |
|
LiangSong
|
da1c927016
|
update speed test
|
2023-04-12 17:15:07 +08:00 |
|
LiangSong
|
0ee9612f40
|
add speed test
|
2023-04-11 21:59:18 +08:00 |
|
s-JoL
|
4cb94d2687
|
Update README.md
|
2023-04-11 15:59:02 +08:00 |
|
s-JoL
|
d2632467ec
|
Update README_en.md
|
2023-04-11 15:53:43 +08:00 |
|
s-JoL
|
ce8bc5249f
|
Update README_en.md
|
2023-04-11 15:53:05 +08:00 |
|
s-JoL
|
be2f0960c7
|
Update README.md
|
2023-04-11 15:51:43 +08:00 |
|
S
|
5f7a4a69d3
|
Merge pull request #3 from Bayes-Song/dev
update readme
|
2023-04-09 22:49:40 +08:00 |
|
LiangSong
|
f9e7a3376a
|
update readme
|
2023-04-09 22:48:56 +08:00 |
|
S
|
ce06d9feab
|
Merge pull request #2 from Bayes-Song/dev
update readme_en
|
2023-04-08 00:05:02 +08:00 |
|
LiangSong
|
00cda9e265
|
update readme_en
|
2023-04-08 00:04:11 +08:00 |
|
S
|
56f71e24df
|
Merge pull request #1 from Bayes-Song/dev
update instruct-tuning
|
2023-04-07 23:21:06 +08:00 |
|
LiangSong
|
c67d365db3
|
update format
|
2023-04-07 23:20:20 +08:00 |
|
LiangSong
|
f4ba4b6ff2
|
update chinese readme
|
2023-04-07 23:19:42 +08:00 |
|
LiangSong
|
1a731953da
|
update server
|
2023-04-07 10:04:05 +08:00 |
|
LiangSong
|
bc16df4751
|
add more instruction data
|
2023-04-06 03:45:24 +08:00 |
|
LiangSong
|
9f140dc99f
|
update preprocess_instruction, add math/code/multiturn_chat and etc.
|
2023-04-05 23:51:56 +08:00 |
|
LiangSong
|
562067230f
|
update dataset, add concat sequence from multiple docs
|
2023-04-05 22:42:34 +08:00 |
|
LiangSong
|
0b0028097d
|
update demo
|
2023-04-04 18:04:35 +08:00 |
|
LiangSong
|
7c7c4765fc
|
update server demo
|
2023-04-04 17:35:51 +08:00 |
|
LiangSong
|
e89f5a5aa0
|
update server
|
2023-04-02 21:55:18 +08:00 |
|
LiangSong
|
9857ad03f1
|
update server
|
2023-04-02 21:51:32 +08:00 |
|
LiangSong
|
3d0d4e8af7
|
update server.py
|
2023-04-02 21:39:26 +08:00 |
|
LiangSong
|
529f063234
|
update readme
|
2023-04-02 21:32:39 +08:00 |
|
LiangSong
|
ab9d68a3ed
|
update readme
|
2023-04-02 20:29:49 +08:00 |
|
LiangSong
|
7f3f80dc59
|
update readme
|
2023-04-02 11:44:05 +08:00 |
|
LiangSong
|
73db126384
|
update demo link
|
2023-04-01 10:36:37 +08:00 |
|
LiangSong
|
a8ee5ca732
|
fix typo
|
2023-03-31 19:14:03 +08:00 |
|
LiangSong
|
4dfc2a3045
|
update readme add limitation
|
2023-03-31 15:11:12 +08:00 |
|
LiangSong
|
d25b34c280
|
update readme
|
2023-03-31 14:58:07 +08:00 |
|
LiangSong
|
5dc1e77c66
|
add server
|
2023-03-31 14:11:07 +08:00 |
|
LiangSong
|
b9bc7eaf35
|
fix long seq bug
|
2023-03-31 10:12:28 +08:00 |
|