• mozz@mbin.grits.dev
    link
    fedilink
    arrow-up
    16
    ·
    edit-2
    9 months ago

    What the heck is this source. Excerpts:

    recall the recent condensing of regulation and timescales in the biomedical industry, ‘Operation Warp Speed’

    in chess or Go I expect that in the overwhelming majority of positions (even excluding the opening book & endgame databases), the best move is known and the engines aren’t going to change the choice no matter how long you run them

    (that second one is him quoting some researcher, but this transparently absurd statement simply slides past unnoticed, and indeed is cited as support for something similar he’s saying which also seem dubious to me.)

    This month, the US was fairly quiet. In fact, only a few US-based AI labs announced new models (generally, I don’t count Llama finetunes, nor OpenAI’s minor model updates).

    I barely pay attention to this stuff, and I noticed the CodeLlama 70B release, which I would describe as significant – simply stacking up the number of papers and saying one side of the equation is making more progress because they’re releasing more things (and specifically saying that he “doesn’t count” the two most prolific US sources in terms of commonly-used models), is very weird. You can look at benchmarks, or try the models yourselves or see what results they claim in their papers, if you’re going to write an article saying something about the comparative output.

    There is a whole conversation to be had about AI research in China, and I’m 100% open to the idea that I and the rest of the West is missing something important, but it would have been nice to see this citation-less statement:

    many of them significantly outperforming ChatGPT

    Backed up by something more than:

    I’m not sure what China’s prolific output means

    He also compares things to GPT 3.5 (in his mind, not by testing). Personally I dislike using 3.5 for anything, because there’s something already available to consumers that’s clearly way better and has been for quite some time. GPT-4 is clearly the model to beat and the model that most US researchers compare their stuff to when they’re publishing stuff.

    Etc etc. In short:

    BOOOOOOOOOOOOOOO

    BOOOOOOOOOOOOOOOO

  • Possibly linux@lemmy.zip
    link
    fedilink
    English
    arrow-up
    5
    ·
    9 months ago

    Honestly these metrics are pretty useless.

    It is a bit concerning that China could use machine learning to censor images automatically country wide. Imagine a protest where the police just disappear out of the photo. All you would see is people running.

    • mozz@mbin.grits.dev
      link
      fedilink
      arrow-up
      2
      ·
      9 months ago

      Kind of yeah. There’s a whole conversation about how the Chinese government was at one point hamstringing all Chinese AI development by trying to punish researchers (with for-real, Chinese-prison punishment) if their models ever say anything politically suspect. I don’t know if that’s still going on, but past and present it is asinine and counterproductive to a degree that’s honestly a little hard to believe. At the same time, China’s command economy has the ability to muster resources towards a priority like AI development in a way that simply doesn’t happen in the West, so I could easily believe that there are good things coming out that people here are unaware of. So it’d be good to learn about these things; I wish this article was reporting some of the substance of them so we could.

    • Lugh@futurology.todayOPM
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      9 months ago

      Saying all Chinese language media is propaganda just seems like a conspiracy theory.

  • Lugh@futurology.todayOPM
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    3
    ·
    edit-2
    9 months ago

    Most English language media is poor at reporting on China, so sometimes you come across facts like this that sit up and make you take notice; though I wish I had more context for them.