Name | Version | Summary | date |
---|---|---|---|
shtec-rlhf | 0.0.4.dev0 | shtec-rlhf: Safe Reinforcement Learning from Human Feedback | 2024-06-17 03:24:23 |
trl | 0.9.4 | Train transformer language models with reinforcement learning. | 2024-06-06 14:14:32 |
nemo-aligner | 0.3.1 | NeMo-Aligner - a toolkit for model alignment | 2024-06-03 20:17:32 |
hour | day | week | total |
---|---|---|---|
72 | 1975 | 9464 | 219444 |