PyDigger - unearthing stuff about Python


NameVersionSummarydate
shtec-rlhf 0.0.4.dev0 shtec-rlhf: Safe Reinforcement Learning from Human Feedback 2024-06-17 03:24:23
trl 0.9.4 Train transformer language models with reinforcement learning. 2024-06-06 14:14:32
nemo-aligner 0.3.1 NeMo-Aligner - a toolkit for model alignment 2024-06-03 20:17:32
hourdayweektotal
10118669453219412
Elapsed time: 1.13101s