• LukeZaz@beehaw.org
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    4
    ·
    edit-2
    2 days ago

    The changelog lists 30 significant changes, of which the top new feature is integrating Whisper. This means whisper.cpp, which is Georgi Gerganov’s entirely local and offline version of OpenAI’s Whisper automatic speech recognition model. The bottom line is that FFmpeg can now automatically subtitle videos for you.

    Yeah hey, can anyone chime in if this is at all based off LLMs? Because my problems with the incorrect plagiarism machine don’t end just because it’s now the offline incorrect plagiarism machine. Making OpenAI’s garbage hockey open source doesn’t make it okay. Or should I just start calling this shit FOSSwashing?

    I dug around for a bit and couldn’t find much of anything, but judging by a look at the Github pages for both versions of Whisper, it’s looking very related. If that’s the case, fuck right off. I don’t want AI in FFmpeg, either.

    • Kay Ohtie@pawb.social
      link
      fedilink
      English
      arrow-up
      15
      ·
      edit-2
      2 days ago

      It’s not AI, it’s neural network models in the same way voice recognition in devices has been working for over a decade. Even Dragon has been utilizing language models vectors for a very long time, just requiring voice training instead of utilizing a premade research or open-source data set.

      I hate generative AI and it’s slop too, but getting angry about neural network models in general is not only absurd, but playing exactly into what corporations want – conflation of the underlying basic technology concepts with the capitalistic vampirism of art.

      EDIT: to add, “research” here can be closed source – voice models utilized with these tend to be internally-sourced for much of them, at least earlier ones do.

      • drosophila@lemmy.blahaj.zone
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 day ago

        It’s not AI, it’s neural network models

        These used to be called AI before people decided that only LLMs and Diffusion models were AI. Both of which are types of neural networks.

        • Kay Ohtie@pawb.social
          link
          fedilink
          English
          arrow-up
          1
          ·
          12 hours ago

          But much more loosely-so, not nearly as heavily. It was more like a seldom-used term to say that it might be like what machine learning actually was.

          Now they’re all being called it heavily, forcefully, by corporations which started using it for capitalistic hype reasons. Hence, the push for strong distinctions between a field that’s been around for quite a while as algorithms in mathematics being a variety of types, and lazy slop that “just one more prompt bro” and “we can replace workers”. Even DLSS wasn’t called “AI” until the hype train started, and now Jensen Huang can’t call it that often enough lest he be unable to afford yet another leather jacket as if they’re disposable glasses wipes.