PixelVerse t1 – CoT prompting outperforms flagship LLMs

(ai.pixelverse.tech)

8 points | by hayden_k 8 hours ago ago

8 comments

  • growt 7 hours ago ago

    First try with llama 70b found two R's in strawberry :) Gemma did better

  • hayden_k 8 hours ago ago

    OpenAI o1-like CoT and logical thinking prompting strategy significantly enhances llm responses.

    PixelVerse t1 is powered by Llama 3.1 70b and 3.2 90b. However, with a detailed CoT prompt, it answers complex questions correctly, much better than it's base model and even sometimes beating flagship models like GPT 4o, Claude 3 and Gemini.

    Try it at: https://ai.pixelverse.tech/app/cortexchat

  • dylanjcastillo 6 hours ago ago

    I asked the 9.8 vs. 9.11 question from the examples and it got it wrong :/

  • Lienetic 4 hours ago ago

    Can we see the detailed CoT prompt?

  • satisfice 3 hours ago ago

    I guess there really are two r’s in strawbery.

  • 6 hours ago ago
    [deleted]
  • brianjking 3 hours ago ago

    lol, this feels like Reflection 70b all over again.

  • 7 hours ago ago
    [deleted]