cross-posted from: https://futurology.today/post/2910566

Alibaba’s Qwen team just released QwQ-32B-Preview, a powerful new open-source AI reasoning model that can reason step-by-step through challenging problems and directly competes with OpenAI’s o1 series across benchmarks.

The details:

QwQ features a 32K context window, outperforming o1-mini and competing with o1-preview on key math and reasoning benchmarks.

The model was tested across several of the most challenging math and programming benchmarks, showing major advances in deep reasoning.

QwQ demonstrates ‘deep introspection,’ talking through problems step-by-step and questioning and examining its own answers to reason to a solution.

The Qwen team noted several issues in the Preview model, including getting stuck in reasoning loops, struggling with common sense, and language mixing.

Why it matters: Between QwQ and DeepSeek, open-source reasoning models are here — and Chinese firms are absolutely cooking with new models that nearly match the current top closed leaders. Has OpenAI’s moat dried up, or does the AI leader have something special up its sleeve before the end of the year?

    • yetAnotherUser@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      24
      arrow-down
      1
      ·
      26 days ago

      It’s a little on the rambling side. While it does show the steps it takes, I’m sure it gets annoying when it considers more and more edge cases.

      Q: What is the longest word in this sentence?

      Let’s see. The sentence is: “What is the longest word in this sentence?” I need to find out which word is the longest in this sentence.

      First, I should probably list out all the words in the sentence. Let’s break it down:

      1. What

      2. is

      3. the

      4. longest

      5. word

      6. in

      7. this

      8. sentence

      Now, I need to count the number of letters in each word.

      • What: 4 letters

      • is: 2 letters

      • the: 3 letters

      • longest: 7 letters

      • word: 4 letters

      • in: 2 letters

      • this: 4 letters

      • sentence: 8 letters

      Okay, so “sentence” has 8 letters, and “longest” has 7 letters. So “sentence” is longer.

      Wait, but is “sentence” the longest? Let me check again.

      What: 4

      is: 2

      the: 3

      longest: 7

      word: 4

      in: 2

      this: 4

      sentence: 8

      Yes, “sentence” has the most letters with 8 letters.

      But wait, is there any word with more than 8 letters? Let’s make sure.

      Looking back at the list:

      • What: 4

      • is: 2

      • the: 3

      • longest: 7

      • word: 4

      • in: 2

      • this: 4

      • sentence: 8

      No, “sentence” is the longest word in this sentence.

      But now I’m thinking, maybe there’s a word that’s longer that I’m missing. Maybe there’s a word within “sentence” that’s longer, like if “sentence” contained another word within it.

      Wait, no. Each word is separate. I think I listed all the words correctly.

      Alternatively, maybe there are multiple words with the same length, but in this case, “sentence” is the longest.

      Wait, perhaps I should consider punctuation. Is there any punctuation that might be considered part of a word?

      The sentence ends with a question mark, so “sentence?” would be 9 letters if the question mark is included. But typically, punctuation is not considered part of the word. So, “sentence” is still 8 letters.

      Alright, so “sentence” is the longest word with 8 letters.

      But just to be thorough, maybe there’s a word that I missed or miscounted.

      Let me count again:

      • What: W-H-A-T, 4 letters

      • is: I-S, 2 letters

      • the: T-H-E, 3 letters

      • longest: L-O-N-G-E-S-T, 7 letters

      • word: W-O-R-D, 4 letters

      • in: I-N, 2 letters

      • this: T-H-I-S, 4 letters

      • sentence: S-E-N-T-E-N-C-E, 8 letters

      Yes, definitely 8 letters.

      So, the longest word is “sentence.”

      But now I’m thinking, maybe there’s a word with a hyphen or something that I need to consider. Like “well-known” would be two words, well and known, but without a hyphen, it’s one word.

      But in this sentence, there are no hyphenated words.

      Alternatively, maybe there are contractions, but there are no contractions in this sentence.

      So, “sentence” is clearly the longest word.

      But to make sure, perhaps I can measure the length programmatically.

      If I were to write a simple program to find the longest word in a sentence, I would split the sentence into words and then find the word with the maximum length.

      In Python, for example, I could do something like:

      sentence = “What is the longest word in this sentence?”

      words = sentence.split()

      max_word = max(words, key=len)

      print(max_word)

      That would output “sentence?” but since punctuation is not part of the word, I should remove punctuation.

      So, perhaps:

      import string

      sentence = “What is the longest word in this sentence?”

      words = sentence.translate(str.maketrans(‘’, ‘’, string.punctuation)).split()

      max_word = max(words, key=len)

      print(max_word)

      This would output “sentence” without the question mark.

      So, yes, “sentence” is the longest word with 8 letters.

      But now, out of curiosity, I wonder if there are any words with more than 8 letters in this sentence. Clearly not, but maybe in a different sentence.

      Alternatively, perhaps there’s a way to make “sentence” longer by considering it with punctuation, but as established, punctuation is not part of the word.

      Wait, maybe there are words with apostrophes, like “it’s,” but in this sentence, there are no such words.

      So, in conclusion, “sentence” is the longest word in the sentence “What is the longest word in this sentence?” with 8 letters.

      Final Answer

      [ \boxed{\text{sentence}} ]

        • pirat@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          ·
          25 days ago

          The model just didn’t stop generating.

          You must wait until it reaches a theory where “i” could somehow refer to some Chinese dissidents…

        • pirat@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          25 days ago

          […]

          But again, that’s specific to that field.

          […]

          But that’s getting too technical.

          […]

          But still, without context, it’s hard to be precise.

      • Asuka@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        24 days ago

        It REALLY rambles, but man, it feels really hard to claim this is just an autocomplete not - it seems like it actually understands the logic.

    • SkaveRat@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      1
      ·
      25 days ago

      Love the step by step reasoning. Especially when you give it some weird stuff.

      I asked:

      How many strawberries can you fit into the most common sedan car from 2023, while also being able to drive it?

      The answer is too long to post directly:

      https://pastebin.com/mTiyQkmY

      • jcg@halubilo.social
        link
        fedilink
        English
        arrow-up
        10
        ·
        25 days ago

        I thought it’d drop the “just the trunk space” thing eventually but it reaffirms it towards the end

        But the question specifies that the car should still be drivable, which probably means that the rear seats need to be in place for passengers to sit.

        And the reasoning broke down, you don’t need passengers to drive a car. Pretty interesting reading it’s “thought” process with the little humanisms like “hmm” and “but wait!”