PixelVerse t1 – CoT prompting outperforms flagship LLMs

(ai.pixelverse.tech)

8 points | by hayden_k 8 hours ago ago

8 comments

growt 7 hours ago ago
First try with llama 70b found two R's in strawberry :) Gemma did better
hayden_k 8 hours ago ago
OpenAI o1-like CoT and logical thinking prompting strategy significantly enhances llm responses.
PixelVerse t1 is powered by Llama 3.1 70b and 3.2 90b. However, with a detailed CoT prompt, it answers complex questions correctly, much better than it's base model and even sometimes beating flagship models like GPT 4o, Claude 3 and Gemini.
Try it at: https://ai.pixelverse.tech/app/cortexchat
dylanjcastillo 6 hours ago ago
I asked the 9.8 vs. 9.11 question from the examples and it got it wrong :/
Lienetic 4 hours ago ago
Can we see the detailed CoT prompt?
satisfice 3 hours ago ago
I guess there really are two r’s in strawbery.
6 hours ago ago
[deleted]
brianjking 3 hours ago ago
lol, this feels like Reflection 70b all over again.
7 hours ago ago
[deleted]