No, not every combination. The question is about the specific combination of a p...

svara · 2026-02-12T19:07:00 1770923220

More likely you would just train for emitting svg for some description of a scene and create training data from raster images.

recursive · 2026-02-12T23:49:39 1770940179

None of this works if the testers are collaborating with the trainers. The tests ostensibly need to be arms-length from the training. If the trainers ever start over-fitting to the test, the tester would come up with some new test secretly.

ebonnafoux · 2026-02-13T12:40:13 1770986413

You can easily make a RLAIF loop.

- Take a list of n animals * m vehicule

- Ask a LLM to generate SVG for this n*m options

- Generate png from the svg

- Ask a Model with vision to grade the result

- Change your weight accordingly

No need to human to draw the dataset, no need of human to evaluate.