Name | Version | Summary | date |
---|---|---|---|
trl | 0.11.1 | Train transformer language models with reinforcement learning. | 2024-09-24 17:01:16 |
nemo-aligner | 0.4.0 | NeMo-Aligner - a toolkit for model alignment | 2024-09-23 16:15:00 |
shtec-rlhf | 1.0.5 | shtec-rlhf: Safe Reinforcement Learning from Human Feedback | 2024-06-24 05:55:07 |
hour | day | week | total |
---|---|---|---|
59 | 2205 | 10049 | 248313 |