Name | Version | Summary | date |
---|---|---|---|
shtec-rlhf | 0.0.4.dev0 | shtec-rlhf: Safe Reinforcement Learning from Human Feedback | 2024-06-17 03:24:23 |
trl | 0.9.4 | Train transformer language models with reinforcement learning. | 2024-06-06 14:14:32 |
nemo-aligner | 0.3.1 | NeMo-Aligner - a toolkit for model alignment | 2024-06-03 20:17:32 |
hour | day | week | total |
---|---|---|---|
101 | 1866 | 9453 | 219412 |