Home Artists Posts Import Register

Downloads

Comments

Staatlicher Nagetierfunktionär

The rock paper scissors test is nice. Another one is: "I have 5 oranges today. Last week I ate 3. How many oranges do I have now?" Almost any chatbot fails at it. If they succeed, open a new chat and try again, they fail, none is consistent. Tested with gpt 3.5, 4, bing copilot, genimi bard. They instead teach you in basic math, and cheer your healthy diet. Also, Bard costs 22+ USD per month, 2TB storage alone costs less than 10. But I guess people will fail at that math because of previously taught example.

Jacob Wood

Gpt 3.5 got it wrong but gpt 4 got it right every time I tried.