|  | 5635f7d08d | Update data/download_wudao.sh | 2024-06-14 09:08:03 +00:00 |  | 
			
				
					| 
							
							
								 LiangSong | 0157b6938d | update readme | 2023-05-17 22:45:04 +07:00 |  | 
			
				
					| 
							
							
								 LiangSong | 95973b5de1 | update header | 2023-05-17 22:21:46 +07:00 |  | 
			
				
					| 
							
							
								 LiangSong | d269affb42 | update readme | 2023-05-17 21:17:49 +07:00 |  | 
			
				
					| 
							
							
								 s-JoL | 6988c69884 | Merge pull request #64 from eltociear/patch-1 Update README.md | 2023-05-16 22:24:13 +07:00 |  | 
			
				
					| 
							
							
								 Ikko Eltociear Ashimine | 7bacd6cb93 | Update README.md HuggingFace -> Hugging Face | 2023-05-17 00:11:35 +09:00 |  | 
			
				
					| 
							
							
								 LiangSong | 77b1c552c3 | add discord invite link | 2023-05-15 23:00:25 +07:00 |  | 
			
				
					| 
							
							
								 LiangSong | 82c845a8ce | update readme | 2023-05-15 00:21:13 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 1ce8c18d83 | add logo | 2023-05-14 10:52:43 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 52e8df9a8d | update readme | 2023-05-14 10:48:49 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | a07d9b0ac8 | update readme | 2023-05-14 01:06:03 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | bf2cac0a45 | update config | 2023-05-14 01:00:50 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | e18ead00cc | update server | 2023-05-12 15:07:46 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 7231d53ca4 | update readme add new model | 2023-05-12 11:32:42 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | ceb1fd067b | update vocab_size | 2023-05-11 14:15:12 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 73dafa7ad6 | add rounding vocab_size | 2023-05-10 17:49:52 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 26f7421f05 | add star history to readme | 2023-05-10 15:52:55 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 72a6f81b61 | update readme | 2023-05-09 18:47:29 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 7d505ea303 | update readme | 2023-05-09 17:03:13 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 59b79af9d7 | add comment | 2023-05-09 16:53:05 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | f6ac834ef9 | update default config | 2023-05-09 15:16:50 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 21fdd25b94 | update Citation | 2023-05-09 15:12:49 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 30ab306c56 | update readme | 2023-05-09 15:06:47 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 32583a41a7 | update wudao download and preprocess | 2023-05-09 14:47:59 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 7dc90c2558 | fix typo | 2023-05-09 10:46:11 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 6814fdb59e | support gradient ckpt for peft | 2023-05-08 23:40:03 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 3ba0c77053 | update optimizer for lora | 2023-05-08 22:56:37 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 58586112c1 | fix table | 2023-05-08 22:30:02 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 16811d0efe | update readme | 2023-05-08 22:29:24 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 92caa94490 | support peft | 2023-05-08 22:26:39 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 7da40f1c83 | fix typo | 2023-05-08 19:00:06 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 2df3e622e9 | update readme | 2023-05-08 18:59:01 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | ec2b4d6ee7 | fix split by shard bug | 2023-05-08 14:03:05 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 4a1e7bb44b | Optimized the structure of configs, added support for deepspeed stage3, reduced memory usage by using Auto class to load models, and added support for training 65B models. | 2023-05-06 23:37:17 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 5b1f6a4861 | fix epoch bug | 2023-05-06 09:45:37 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | f893a0f5b8 | update dataset | 2023-05-05 19:23:16 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 758af69c73 | update science instruct-tuning datasets | 2023-05-05 19:00:37 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | d24b4cce54 | update preprocess format | 2023-05-05 18:20:59 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 85caa97a6a | add xP3 dataset and belle_2M | 2023-05-05 17:05:41 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 00cbdbbf26 | fix typo | 2023-05-04 22:55:40 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 693e3970d9 | update readme | 2023-05-04 22:54:10 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | fbb7997607 | fix typo | 2023-05-04 22:32:15 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 98ffab3a97 | update readme and add half to server | 2023-05-04 22:28:36 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 5c876121cb | update gradio, fix code format bug | 2023-05-04 18:18:52 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | a1acc90988 | fix train_tokenizer bug | 2023-05-04 16:00:56 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 51686b5fb8 | add split dataset by shard option to accelerate data loading | 2023-05-04 09:20:23 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | f0d41f937b | update instruct_config and set all random seed to 42 | 2023-05-04 08:45:21 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | dba2e2d680 | update ShareGPT_90K preprocess | 2023-05-04 08:34:38 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | 154456c976 | set dataset shuffle seed to 42 | 2023-05-04 00:31:12 +08:00 |  | 
			
				
					| 
							
							
								 LiangSong | c2184c6dd1 | support multiple epochs | 2023-05-03 00:02:01 +08:00 |  |