• Prox@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    ·
    edit-2
    8 days ago

    Yes, that is the major problem with LLMs in general. There is no solution aside from “train on another different source (like Reddit)”, but then we rinse & repeat.

      • Prox@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        7 days ago

        I guess, though I’m pretty ignorant as to how RLVR would fix the issue that arises from new coding languages or even new major versions. I’m not sure how LLMs would ever get to a correct answer if they don’t have good reference material to start from or reference.