You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
imyzx a61d2999cf "UPDATE for 8K long-context exp1, SFT-PI run passed!" 9 months ago
..
core first 9 months ago
data first 9 months ago
fp16_deprecated first 9 months ago
fused_kernels first 9 months ago
model "UPDATE for 8K long-context exp1, SFT-PI run passed!" 9 months ago
mpu/tests first 9 months ago
optimizer first 9 months ago
static first 9 months ago
text_generation first 9 months ago
tokenizer first 9 months ago
__init__.py first 9 months ago
arguments.py first 9 months ago
checkpointing.py first 9 months ago
dist_signal_handler.py first 9 months ago
global_vars.py first 9 months ago
indexer.py first 9 months ago
initialize.py first 9 months ago
memory.py first 9 months ago
microbatches.py first 9 months ago
optimizer_param_scheduler.py first 9 months ago
text_generation_server.py first 9 months ago
timers.py first 9 months ago
training.py "UPDATE for 8K long-context exp" 9 months ago
utils.py first 9 months ago

No Description

Python C++ Text Shell Cuda other

Contributors (1)