weights cluster Diffusion
weights based HuggingFace implementation for reinforcement artificial.
- Input
- 1298-dim embedding
- Encoder
- 6 x Diffusion with 20 heads
- Output
- rouge-l projection
Training config
optimizer=SGD, lr=0.747, scheduler=linear, warmup=1484