Replies

This profile is from a federated server and may be incomplete. Browse more on the original instance.

tante, to random
@tante@tldr.nettime.org avatar

Today in "LLMs can't do even simple reasoning":

Prompt: Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?

See a whole bunch of LLMs fail: https://benchmarks.llmonitor.com/sally

tante,
@tante@tldr.nettime.org avatar

Many LLMs answer "6", mostly because "each" triggers a lot of programming/math wording.

Embeddings can be very finicky and LLms don't handle extra information well.

jollyorc, to random
@jollyorc@social.5f9.de avatar

I wonder: is there GAIA-X discourse happening the fediverse?

tante,
@tante@tldr.nettime.org avatar

@jollyorc is that still a thing with funding running out?

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • uselessserver093
  • Food
  • aaaaaaacccccccce
  • test
  • CafeMeta
  • testmag
  • MUD
  • RhythmGameZone
  • RSS
  • dabs
  • KamenRider
  • Ask_kbincafe
  • TheResearchGuardian
  • KbinCafe
  • Socialism
  • oklahoma
  • SuperSentai
  • feritale
  • All magazines