• wise_pancake@lemmy.ca
      link
      fedilink
      English
      arrow-up
      2
      ·
      11 days ago

      Additional is fairly trivial for a neural network to learn.

      Weight 1 plus weight 2 equals output is literally the baseline model structure.

      • Zos_Kia@lemmynsfw.com
        link
        fedilink
        English
        arrow-up
        5
        ·
        10 days ago

        It’s actually a fairly involved process because the tokens representing 1 and 4 don’t have any mathematical correlation with the numbers 1 and 4 so you can’t math them directly to get to 5.

        Apparently how they do it is by a series of approximations from big numbers to small numbers, not too dissimilar from the way a human would do it. The anthropic team published a paper about it recently, I can dig it up if you’re interested.