the lemmy.monster
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Timely_Jellyfish_2077@programming.dev to Technology@lemmy.worldEnglish · 9 个月前

Reasoning failures highlighted by Apple research on LLMs

appleinsider.com

external-link
message-square
59
link
fedilink
222
external-link

Reasoning failures highlighted by Apple research on LLMs

appleinsider.com

Timely_Jellyfish_2077@programming.dev to Technology@lemmy.worldEnglish · 9 个月前
message-square
59
link
fedilink
A new paper from Apple's artificial intelligence scientists has found that engines based on large language models, such as those from Meta and OpenAI, still lack basic reasoning skills.
  • Rimu@piefed.social
    link
    fedilink
    English
    arrow-up
    8
    ·
    edit-2
    9 个月前

    I tried it myself (changing the name and changing the values) but lost interest after 3 attempts and always getting the right answer:

    https://chatgpt.com/share/670af65d-da08-800f-8ad4-c67782ee5477

    https://chatgpt.com/share/670af672-45dc-800f-ac91-cc2811fa89c7

    https://chatgpt.com/share/6709e80b-e5a8-800f-90d0-1af3418675ef

    • A_A@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      9 个月前

      Errors from your links like this :
      Unable to load conversation 670a…6ed2c

      • Rimu@piefed.social
        link
        fedilink
        arrow-up
        2
        ·
        9 个月前

        Sorry! I’ve updated my links now.

        • A_A@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          9 个月前

          “… So, Mary has 190 kiwifruit.”
          nice 😋🥝

    • tinsukE@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      1
      ·
      9 个月前

      I wouldn’t doubt that LLMs got some special input to deal with the specific examples of this paper, or similar enough.

      • alienanimals@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 个月前

        This is just improving LLMs, but with more steps.

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @[email protected]
  • @[email protected]
  • @[email protected]
  • @[email protected]
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 3.31K users / day
  • 10.2K users / week
  • 18.1K users / month
  • 38.8K users / 6 months
  • 2 local subscribers
  • 72K subscribers
  • 11.6K Posts
  • 430K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L4sBot@lemmy.world
  • L3s@hackingne.ws
  • L4s@hackingne.ws
  • BE: 0.19.11
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org