PyDigger - unearthing stuff about Python


NameVersionSummarydate
trl 0.11.1 Train transformer language models with reinforcement learning. 2024-09-24 17:01:16
nemo-aligner 0.4.0 NeMo-Aligner - a toolkit for model alignment 2024-09-23 16:15:00
shtec-rlhf 1.0.5 shtec-rlhf: Safe Reinforcement Learning from Human Feedback 2024-06-24 05:55:07
hourdayweektotal
59220510049248313
Elapsed time: 1.51111s