实例,chatGPT解数学竞赛题相当厉害,但分数计算出错

来源: 孺将 2023-03-19 13:20:05 [] [旧帖] [给我悄悄话] 本文已被阅读: 次 (2564 bytes)

娃参加Beestar的小学初中网上数学竞赛,第二轮有几道题挺难,我看了也不确定,索性下来用正时髦的chatGPT试试,结果让我大开眼界,chatGPT几秒钟解五年级竞赛难题, 并且给出详细解题步骤。但chatGPT也在简单的分数计算出错,莫名其妙。。 分享一下:

 题目 (5年级竞赛难题):

Box A contains 5 red balls and 7 green balls. Box B contains 3 red balls and 2 green balls. A box is chosen at random, and one ball removed at random from this box. Given that the ball removed was green, what is the probability that it came from box A?

这题目其实家长碰上都得仔细想想。输入chatGPT, 几秒钟给出超详细的解题过程:

Let's use Bayes' theorem to solve this problem:

Let A be the event that the ball was drawn from box A, and G be the event that a green ball was drawn. We want to find P(A|G), the probability that the ball came from box A given that it was green.

By Bayes' theorem:

P(A|G) = P(G|A)P(A) / P(G)

where:

P(G|A) is the probability of drawing a green ball given that the box is A. This is 7/12 since box A has 7 green balls out of 12 total balls.

P(A) is the prior probability of choosing box A, which is 1/2 since either box could have been chosen at random.

P(G) is the total probability of drawing a green ball, which can be calculated using the law of total probability:

P(G) = P(G|A)P(A) + P(G|B)P(B)

where P(G|B) is the probability of drawing a green ball given that the box is B, and P(B) is the prior probability of choosing box B, which is also 1/2.

We can calculate P(G|B) as 2/5 since box B has 2 green balls out of 5 total balls.

Plugging these values into Bayes' theorem, we get:

P(A|G) = (7/12) * (1/2) / [(7/12)(1/2) + (2/5)(1/2)]

≈ 0.636

Therefore, the probability that the ball came from box A given that it was green is approximately 0.636 or 63.6%.

太厉害了!chatGPT 真吓我一跳。但我拿着结果在Beestar却没有对应的选择答案。后来发现chatGPT竟把自己给出的简单计算算错了,(7/12) * (1/2) / [(7/12)(1/2) + (2/5)(1/2)] = 35/59 ≈ 0.593, 不是chatGPT给出的0.636。 难解题思路步骤都对了,最后简单计算却错了,让我无语,挺滑稽!搞AI的大牛们解释一下,是不是AI 分数计算能力有点弱?

 

所有跟帖: 

ChatGpt 正确率不到50% -Lucky2022- 给 Lucky2022 发送悄悄话 (114 bytes) () 03/19/2023 postreply 13:49:42

ChatGPT是生成答案,而不是计算出答案。 -光猜不想- 给 光猜不想 发送悄悄话 光猜不想 的博客首页 (180 bytes) () 03/20/2023 postreply 13:08:58

请您先登陆,再发跟帖!

发现Adblock插件

如要继续浏览
请支持本站 请务必在本站关闭Adblock

关闭Adblock后 请点击

请参考如何关闭Adblock

安装Adblock plus用户请点击浏览器图标
选择“Disable on www.wenxuecity.com”

安装Adblock用户请点击图标
选择“don't run on pages on this domain”