PyDigger - unearthing stuff about Python


NameVersionSummarydate
shtec-rlhf 1.0.5 shtec-rlhf: Safe Reinforcement Learning from Human Feedback 2024-06-24 05:55:07
trl 0.9.4 Train transformer language models with reinforcement learning. 2024-06-06 14:14:32
nemo-aligner 0.3.1 NeMo-Aligner - a toolkit for model alignment 2024-06-03 20:17:32
hourdayweektotal
64242610082221821
Elapsed time: 1.54923s